machine learning

Modern Data Warehousing, Meet AI

We are enchanted by the possibility of digital disruption. New computing approaches, from cloud to artificial intelligence and machine learning, promise new business models and untold efficiencies. We are closing the gap between science fiction and business operations. A Quick Look Back Let’s take a quick look back at data processing, and then come back to the industry frontier. It started with data and the place to put it, which became the database. Then came a desire to understand the data...


Durable Storage for Real-Time Analytics with MemSQL and Spark

Apache Spark has made a name for itself as a powerful data processing engine for transforming large datasets in a swift, distributed manner. After using Spark to complete such transformations, you often want to store your data in a persistent and efficient format for long-term access. The common solution of storing data in HDFS solves the issue of persistence, but suffers efficiency issues as a result of the HDFS disk-based architecture. The MemSQL Spark Connector solves both of these issues by...


machine learning at scale

Video: Scoring Machine Learning Models at Scale

At Strata+Hadoop World, MemSQL Software Engineer, John Bowler shared two ways of making production data pipelines in MemSQL: 1) Using Spark for general purpose computation 2) Through a transform defined in MemSQL pipeline for general purpose computation In the video below, John runs a live demonstration of MemSQL and Apache Spark for entity resolution and fraud detection across a dataset composed of a hundred thousand employees and fifty million customers. John uses MemSQL and writes a Spark job...


ml facial recognition

Machines and the Magic of Fast Learning (Strata Keynote Recording)

How can big data and machine learning be used for good? In our keynote at Strata+Hadoop World, MemSQL CEO Eric Frenkiel shared how we are working with Thorn to provide a new approach to machine learning and real-time image recognition to combat child exploitation. About Thorn Thorn partners across the technology companies and government organizations to combat predatory behavior, rescue victims, and protect vulnerable children. Thorn has to sift through a massive amount of images daily. Images...


Machine Learning Facial Recognition

An Engineering View on Real-Time Machine Learning

About Thorn Thorn partners across the tech industry, government and NGOs, leveraging technology to combat predatory behavior, rescue victims, and protect vulnerable children. About Eric Boutin Eric leads an engineering team for MemSQL in our Seattle office. This is background information from Eric on our work with Thorn. How did you first get connected with Thorn? I was introduced to Federico Gomez Suarez, a volunteer working with Thorn, by a common friend. I was impressed by the work Thorn was...


Strata Talks

Five Talks for Solving Big Data Challenges at Strata+Hadoop World

Strata+Hadoop World in San Jose kicks-off next week on March 14, offering data engineers and business intelligence professionals a place to gather and learn about the most challenging problems, engaging use cases, and enticing opportunities in data today. MemSQL will be showcasing real-time data as a vehicle for operationalizing machine-learning, exploring advanced tools including TensorFlow, Apache Spark, and Apache Kafka. We will also be demonstrating the power of machine learning to effect...


Machine Learning Podcast

O’Reilly Radar Podcast: The 2017 Machine Learning Outlook

O’Reilly Media Editor, Jon Bruner, recently sat down with MemSQL VP of Engineering, Drew Paroski, and MemSQL CMO, Gary Orenstein, to discuss the rapid growth and impact that machine learning will have in 2017. In this podcast, Paroski and Orenstein share examples from companies using real-time technologies to power machine learning applications. They also identify key trends driving the adoption of machine learning and predictive analytics. Listen Here Podcast topics of discussion include: ...


Path to Predictive Analytics Book

The Path to Predictive Analytics and Machine Learning - Free O’REILLY Book

Organizations once waited hours, days, or even weeks to get a handle on their data. In an earlier era, that sufficed. But with today’s endless stream of zeros and ones, data must be usable right away. It’s the crux of decision making for enterprises competing in the modern era. Recognizing cross-industry interest in massive data ingest and analytics, we teamed up with O’Reilly Media on a new book: The Path to Predictive Analytics and Machine Learning. In this book, we share the latest step...


PowerStream

Using MemSQL and Spark for Machine Learning

At Spark Summit in San Francisco, we highlighted our PowerStream showcase application, which processes and analyzes data from over 2 million sensors on 200,000 wind turbines installed around the world. We sat down with one of our PowerStream engineers, John Bowler, to discuss his work on our integrated MemSQL and Apache Spark solutions. What is the relationship between MemSQL and Spark? At its core, MemSQL is a database engine, and Spark is a powerful option for writing code to transform data....


real-time predictions

Predictions 2016: the Impact of Real-Time Data

Prediction 1. The industrial internet moves to real-time data pipelines The industrial internet knits together big data, machine learning, and machine-to-machine communications to detect patterns and adjust operations in near real time. Soon the industrial internet will expand by definition to include the Internet of Things. The detection of patterns and insights often comes with a price: time. While the goal of machine learning is to develop models that will prove useful, dealing with large...