MemSQL Streamliner is now generally available! Streamliner is an integrated MemSQL and Apache Spark solution for streaming data from real-time data sources, such as sensors, IoT devices, transactions, application data and logs.
The MemSQL database pairs perfectly with Apache Spark out-of-the-box. Apache Spark is a distributed, in-memory data processing framework that provides programmatic libraries for users to work with data across a broad set of use cases, including streaming, machine...
by Ankur Goyal
In early August, a consortium of the largest German automakers including Audi, BMW, and Daimler (Mercedes) purchased Nokia’s Here mapping unit, the largest competitor to Google Maps, for $3 billion.
It is no longer easy to get lost. Quite the opposite, we expect and rely on maps for our most common Internet tasks from basic directions to on-demand transportation, discovering a new restaurant or finding a new friend.
And the battle is on between the biggest public and private companies in the...
by Gary Orenstein
To shed light on the state of the in-memory database market, we conducted a survey on the prevalent use cases for in-memory databases. Respondents included software architects, developers, enterprise executives and data scientists1. The results revealed a high demand for real-time capabilities, such as analytics and data capture, as well as a high level of interest in Spark Streaming.
Real-Time Needs for In-Memory Databases
It is no surprise that our survey results highlight real-time...
by Freja Mickos
Over the past several months, we worked closely with the Tapjoy data science and engineering team to implement MemSQL as the database to power their Mobile Marketing Automation and Monetization Platform. In order to deliver optimized ads to over 500 million global users and support over one million transactions per minute, Tapjoy needed a database that could enable HTAP, a Gartner term we refer to frequently at MemSQL, which stands for Hybrid Transactional and Analytical Processing.
by Emily Friedman
The phrase “real-time,” like love, means different things to different people.
At its most basic, the term implies near simultaneity. However, the amount of time that constitutes the “real-time window” differs across industries, professions, and even organizations. Definitions vary and the term is so often (ab)used by marketers and analysts, that some dismiss “real-time” as a meaningless buzzword.
However, there is an important distinction between “real-time” and “what we have...
by Conor Doherty
As technology weaves into our daily lives, our expectations of it continue to increase. Consider mobile devices and location information. Recently 451 Research released data that 47% of consumers would like to receive personalized information based on immediate location.
Source: 451 Research
Addressing this requires the ability to track real-time and historical data and to put both in context. Let’s examine that spectrum.
Incoming High Value Content
With a focus on ‘immediate,’ the highest...
With Hadoop Summit Europe underway today, we wanted to share some thoughts on how MemSQL fits in to the Hadoop ecosystem.
While MemSQL and Hadoop are both data stores, they fill different roles in the data processing and analytics stack. The Hadoop Distributed File System (HDFS) enables businesses to store large volumes of immutable data, but by design, it is used almost exclusively for batch processing. Moreover, newer execution frameworks, that are faster and storage agonistic, are...
by Lesia Myroshnichenko
Scaling tends to make even simple things, like counting, seem difficult. In the past, businesses used specialized databases for particular tasks, including high-speed, high-throughput event counters. Due to the constraints of legacy systems, some people still assume that relational databases cannot handle high-throughput tasks at scale. However, due to advances like in-memory storage, high-throughput counting no longer requires a specialized, single-purpose database.
Why do we even need...
by Nikita Shamgunov
In preparation for Open World, I asked some of our engineers to recreate a demo that Oracle has been using over the last year to show off their “in-memory option.” It’s impressive to look at: the demo shows the database searching through billions of records from Wikipedia search trend data for popular terms in less than a second.
The thing about Oracle’s demo is it runs on a gigantically expensive server. In fact it is the biggest one they have at 32TB of RAM and hundreds of CPU cores....