Apache Hadoop and Big Data Analytics |
|
Hadoop big data analytics is ready for use by business, platform is now stable enough to be relied upon for big data crunching!! After six years of development, the Apache Software Foundation (ASF) has made the first official release of Hadoop 1.0 in Jan'12. What is Hadoop?Hadoop is a collection of software including: a distributed file system, which can handle large amounts of data storage; map reduce, which processes the data; and common, which is the shared infrastructure that supports the project.Hadoop is becoming the de facto data platform that enables organisations to store, process and query vast torrents of data and the new release represents an important step forward in performance, stability and security." Companies can use Hadoop for the types of analyses that business intelligence tools and big data SQL analysis tools are not designed to handle. NoSQL TechnologyThe NoSQL Hadoop technology allows applications to work quickly with thousands of nodes and petabytes of structured and unstructured data. Hadoop is an entirely open source project built by a global community of developers using the Java programming language.Who are using Hadoop?Companies can use Hadoop for the types of analyses that business intelligence tools and big data SQL analysis tools are not designed to handle.For example, Hadoop may be used by companies to analyse the past behaviour of users on their web sites, or to predict future behaviour. Hadoop is already used by some of the largest internet firms in the world, including Amazon Web Services, AOL, Apple, eBay, Facebook, Foursquare, HP, LinkedIn, Netflix, Rackspace, Twitter and Yahoo. Other companies such as IBM and EMC have integrated Hadoop into their offerings to allow their customers to process large amounts of unstructured data. According to a recent Ovum study, around one-third of organisations in the US, Europe and Asia that process large amounts of information plan to invest in big data analytics in the next year. |