Recent data management trends suggest a rapid growth in the use of large server infrastructure like data centers and cloud computing platforms for scalable services. Internet services, like Google and Yahoo!, are deploying their computing infrastructure to run applications on massive amounts of input data to compute business results.The massive volume of data being used are publicly available and is increasing year over year. The emerging future is more exciting. Success in the future will be dictated to a large extent by the organization's ability to extract value from other organization's data.
The ability to process massive volume of data is made avaialble by Google's software and the closely related open-source Hadoop software. They are revolutionizing the Internet services community by building scalable systems infrastructure for data intensive applications.
As business “best practices” trend increasingly towards basing decisions off data and hard facts rather than instinct and theory, the corporate thirst for systems that can manage, process, and granularly analyze data is becoming insatiable. Venture capitalists are very much aware of this trend, and have funded no fewer than a dozen new companies in recent years that build specialized analytical data management software (e.g., Netezza, Vertica, DATAllegro, Greenplum, Aster Data, Infobright, Kickfire, Dataupia, ParAccel, and Exasol), and continue to fund them, even in pressing economic times.
Big data is data that exceeds the processing capacity of conventional database systems. The data is too big, moves too fast, or doesn’t fit the strictures of conventional database architectures By definition, Big Data is data that cannot be processed using the resources of a single machine. The term Big Data applies to information that can’t be processed or analyzed using traditional processes or tools. Within this data lie valuable patterns and information, previously hidden because of the amount of work required to extract them. The value of big data to an organization falls into two categories: analytical use and enabling new products.
With more and more businesses reporting petabyte-sized data warehouses, the system of choice for managing and analyzing massive amounts of the data ( Big Data ) will be the one that
The power of an organization’s information can be enhanced by its trustworthiness, its volume, its accessibility, and the capability of an organization to be able to make sense of it all in a reasonable amount of time in order to empower intelligent decision making.