Spark Summit 2017
“Apache® Spark™ is a powerful open source processing engine built around speed, ease of use, and sophisticated analytics. It was started at UC Berkeley in 2009 and is now developed at the vendor-independent Apache Software Foundation. Since its release, Spark has seen rapid adoption by enterprises across a wide range of industries. Internet powerhouses such as Yahoo, eBay and Netflix have deployed Spark at massive scale, processing multiple petabytes of data on clusters of over 8,000 nodes. Apache Spark has also become the largest open source community in big data, with over 1000 contributors from 250+ organizations.”
Director of Data Infrastructure, The Honest Company
How Insnap acquisition made-over Jessica Alba’s company with data analytics
One day, regular line-of-business people might be able to handle high-level data monetization themselves with a do it yourself tool. Today is not that day for The Honest Co. The natural home and personal care brand, co-founded by actress Jessica Alba, acquired and integrated a big data software as a service startup called Insnap Inc. in 2016 to become more data-driven. Read the full blog post with highlights from his interview at SiliconANGLE.com.
Master Architect, HP
How Spark helped build a community inside a distributed company
It’s hard to appreciate just how large organizations can be until someone has to get all those departments and splintered businesses talking to each other. They need a common platform to share data. Hewlett Packard Enterprise Co., a company currently in a downsizing transition of its own, found that common platform in the open-source data management platform, Apache Spark. Read the full blog post with highlights from his interview at SiliconANGLE.com.
VP, Chief Data Scientist Officer, Hotels.com
Hotels.com embraces data-driven culture
Whether you are looking for a luxury hotel on a secluded beach or a place to stay in midtown Manhattan that won’t break the bank, the process of sorting through options and booking online can seem pretty straightforward. But behind most travel websites today is a complex data engine that pays close attention to what you may have booked before, and it strives to filter every possible detail in order to make recommendations that you’ll like. Read the full blog post with highlights from his interview at SiliconANGLE.com.
CEO and Co-Founder, Databricks
Will serverless functions beat DevOps in race to democratize analytics?
Enterprise data scientists and developers fed up with data that just sits there and doesn’t make money might take heart in a prediction from Ali Ghodsi (pictured), chief executive officer and co-founder of Databricks Inc. Read the full blog post with highlights from his interview at SiliconANGLE.com.
VP, Software & Service Group, Intel
Leveraging Moore’s Law to get more integration into solutions
Michael Greene, vice president of the software and service group at Intel, offered insight into how Intel is a solutions company and how software can be used to ignite the hardware. Watch his interview!
Software Developer & Author, Cloudera
Cloudera aims to change the way data is engineered
Developing an accurate data science model is a challenging process on its own. Scaling the model from a development environment to a production cluster presents another set of operational challenges that Cloudera Inc. aims to address with two new product offerings: Data Science Workbench and Altus. Read the full blog post with highlights from his interview at SiliconANGLE.com.
Could Apache Spark become a universal computation engine?
Spark Summit keynotes are known for their surprises, and this year the stand-out changes were in data streaming, with sub-millisecond times predicted for some workloads. With multiple avenues open for potential success, the community is watching as Spark matures to fulfill the promise of what it could be: But does that promise include becoming a database? Read the full blog post with highlights from his interview at SiliconANGLE.com.
Data Scientist, Riot Games
A game of data science: the analytics architecture behind Riot Games
With the help of modern analytics, Riot Games Inc. developed a highly successful computer game called League of Legends, in which players form teams of champions and compete with other players around the world. Wesley Kerr, senior data scientist at Riot Games, explained how his organization is leveraging data science to improve player experience and weed out unsavory behavior. Read the full blog post with highlights from his interview at SiliconANGLE.com.
Director of Predictive Analytics, Novetta
Automating entity recognition, extraction and resolution
Identifying and extracting relevant entities from masses of stored data is a complex and tedious task. Advanced analytics company Novetta Solutions LLC is using the functionality of Databricks Inc., a cloud-based data management service, to speed up and even automate the process. Read the full blog post with highlights from his interview at SiliconANGLE.com.
Can big data DevOps see what abstraction is hiding?
DevOps that have worked well in other areas of information technology can’t hack it in big data, according to Ash Munshi, chief executive officer of Pepperdata Inc. Read the full blog post with highlights from his interview at SiliconANGLE.com.
Senior Software Development Manager, Autodesk
Building software for the construction industry
Nathan Murith, senior software development manager at Autodesk, talked about what he’s excited to try after watching the keynote address and how they use Spark. Watch his interview!
Senior Director of Product Marketing, Confluent
Confluent makes Kafka easier in the cloud, grants new data powers to small business
Where data processing is concerned, speed is life. Companies can ship off some workloads to batch processing, but they need to process the critical stuff in real-time while a customer or system interaction is happening. Not all companies have the resources to make this happen, however. That’s where big data company Confluent Inc. and distributed data streaming platform Apache Kafka step in. Read the full blog post with highlights from his interview at SiliconANGLE.com.
How SnappyData is enriching Spark as a hybrid database
Enriching Apache Spark so it’s not just a platform but also a store is just part of the innovation occurring at SnappyData Inc., according to Jags Ramnarayan, founder and chief technical officer of SnappyData. Read the full blog post with highlights from his interview at SiliconANGLE.com.
Founder, Predictive Analytics World
Using predictive analytics to make key decisions in marketing, political campaigns and other applications
Eric Siegel, founder of Predictive Analytics World, shared insight into his book “Predictive Analytics: The Power to Predict Who Will Click, Buy, Lie or Die” and how data helps make key predictions. Watch his interview!
SVP Data ONTAP Software and Systems Group, NetApp
Why customers are investing in data lake technologies
Octavian Tanase, senior vice president of data ONTAP software and systems group at NetApp, talked about the evolution of the mainstream, what is changing and use cases for big data.
Watch his interview!
Dr. Jisheng Wang
Senior Director of Data Science, CTO Office, Hewlett Packard Enterprise
Preventing internal data attacks
Dr. Jisheng Wang, senior director of data science in the CTO office at Hewlett Packard Enterprise, provided insight into the new solution he is developing with Spark to prevent attackers from inside an organization. Watch his interview!
Director, Cloud Management, Cloudera
Cloudera aims to change the way data is engineered
Developing an accurate data science model is a challenging process on its own. Scaling the model from a development environment to a production cluster presents another set of operational challenges that Cloudera Inc. aims to address with two new product offerings: Data Science Workbench and Altus. Read the full blog post with highlights from her interview at SiliconANGLE.com.
Chief Architect & Co-Founder, Databricks
Exposing the right information so customers can see the value of a project
Reynold Xin, chief architect and co-founder at Databricks, provided thoughts on the new developments with Spark from Databricks, including streaming. Watch his interview!
Chief Technologist & Co-Founder, Databricks
Working with a one millisecond latency
Matei Zaharia, chief technologist and co-founder at Databricks, talked about the new deep learning library coming with Spark and what customers want to see when working with Spark. Watch his interview!
Check Out theCUBE’s Coverage of Spark Summit Events From 2016:
- TBD – Schedule Subject to Change
For More Information : Contact theCUBE