| By Bill McColl | Article Rating: |
|
| October 29, 2009 04:34 PM EDT | Reads: |
2,471 |
For twenty years, analytics has been viewed as just one specific area within the broader relational database industry. So, analytics has meant databases. Today that view is changing. Over the past year or so, a new movement, the "NoSQL" movement has emerged promoting the advantages of doing a variety of kinds of analytics without using any relational database technologies at all.
Whatever one thinks of the capabilities and limitations of distributed key-value stores relative to relational databases, one thing is clear - the stranglehold that SQL has held over all aspects of data analytics since 1990 is now coming to an end. Other non-SQL approaches to analytics such as MapReduce/Hadoop, a very simple dataflow architecture for batch computing, are now gaining ground. As the need for realtime analytics grows we will continue to see a migration away from databases and towards more scalable parallel dataflow architectures for analytics.

The main differences between databases and dataflow can be summarized as follows:
|
Database |
Dataflow |
|
Historical |
Realtime |
|
Offline |
Online |
|
Pull Model |
Push Model |
|
High latency |
Low latency |
|
Demand-driven |
Data-driven |
The shift from databases to dataflow for enterprise cloud analytics mirrors what we have recently seen in another area, the "realtime web". The old demand-driven web model of polling/querying/pulling RSS feeds has proved unable to deliver the kinds of low latency required for the numerous new realtime web services being created by Twitter and others. New data-driven, realtime, push models such as PubSubHubbub and RSSCloud are now replacing the old approaches.
Published October 29, 2009 Reads 2,471
Copyright © 2009 Ulitzer, Inc. — All Rights Reserved.
Syndicated stories and blog feeds, all rights reserved by the author.
Related Stories
More Stories By Bill McColl
Bill McColl is Founder & CEO, Cloudscale Inc. In order to found Cloudscale he left Oxford University, where for over twenty years he was Professor of Computer Science, Head of the Parallel Computing Research Center, and Chairman of the Computer Science Faculty. He has led research, product and business teams in a number of areas: massively parallel algorithms and architectures, parallel programming languages and tools, datacenter virtualization and resource management, realtime stream processing, and cloud computing. Cloudscale is his second Silicon Valley software company. He was also founder and CEO of Sychron Inc., a Silicon Valley VC-backed software company developing scalable software systems for datacenter and desktop virtualization. McColl lives in Palo Alto, CA.
- The Top 150 Players in Cloud Computing
- Cloudscale's Bill McColl to Present at Cloud Computing Conference in NYC
- Cloudscale CEO Launches Cloudcel Topic on Ulitzer
- 25 Years of Big Data: From SQL To The Cloud
- Cloud Computing For The World's Excel Users
- DEMO Spring 2010 Launches Exceptional Crop of New Emerging Technologies
- DEMO Spring Conference Unveils Diverse Lineup of Emerging Technology Products
- Cloud Supercomputing: Navigating The "Big Data" Space


































Ulitzer content is offered under Creative Commons "Attribution Non-Commercial No Derivatives" License.
For any reuse or distribution, you must make clear to others the license terms of this work.
The best way to do this is with a link to this web page.
Any of the above conditions can be waived if you get written permission from Ulitzer, Inc., the copyright holder.
Nothing in this license impairs or restricts the author's moral rights.