Hadoop

(redirected from Apache Hadoop)
Also found in: Wikipedia.

Hadoop

An open source Big Data framework from the Apache Software Foundation designed to handle huge amounts of data on clusters of servers. The storage is handled by the Hadoop Distributed File System (HDFS), and the data are sorted and summarized in parallel by Hadoop MapReduce, a version of Google's MapReduce. Required Java files are included in Hadoop Common, and Hadoop YARN provides the cluster management.

Originally written for the Nutch Web crawler for spidering the Web, in 2008, Yahoo's Search Webmap was the first very large implementation of Hadoop running on 10,000 Linux servers. Search Webmap ran in a third less time than Yahoo's previous search engine.

The Hadoop name comes from a favorite stuffed elephant of the son of the developer Doug Cutting. See Google File System, MapReduce and Spark.
References in periodicals archive ?
Rob Bearden, CEO of Hortonworks said, Through deep integration with Enterprise Apache Hadoop, HP customers will be able to easily build their next generation of applications with the Hortonworks Data Platform.
Hortonworks said it has signed a reseller agreement with SAP (NYSE: SAP), enabling SAP to resell Hortonworks Data Platform (HDP), the industry's only 100-percent open source data platform powered by Apache Hadoop.
As the first Hortonworks certified training partner in the UK and Europe, Big Data Partnership is uniquely placed to deliver the Hortonworks Certified Apache Hadoop Developer and Administrator courses through private on-site training for enterprise customers.
LOUIS, June 13, 2012 /PRNewswire/ -- Savvis, a CenturyLink company (NYSE: CTL) and global leader in cloud infrastructure and hosted IT solutions for enterprises, today announced a strategic agreement with Hortonworks, a leading commercial vendor promoting the innovation, development and support of Apache Hadoop, enabling Savvis to leverage Hadoop to offer cloud-based big data solutions for enterprises.
Hortonworks, a leading contributor to Apache Hadoop, today announced the launch of the Hortonworks Certified Technology Program, designed to help customers choose leading enterprise software that has been tested to integrate with Hortonworks Data Platform (HDP), the only 100-percent open source Apache Hadoop distribution.
9, 2015 /PRNewswire-iReach/ -- Bright Computing, the leading provider of hardware-agnostic cluster and cloud management software, announces a significant update to distribution-agnostic Bright Cluster Manager for Apache Hadoop at the Hadoop Summit North America 2015.
An enterprise version of Apache Hadoop has been delivered by Hortonworks for big data analytics applications.
Pactera (NASDAQ: PACT) said it has become an authorized Hortonworks Systems Integration Partner, providing Apache Hadoop consulting and integration services for the Hortonworks Data Platform.
With a team of industry experts and thought leaders in big data technologies including the Apache Hadoop ecosystem and NoSQL databases, Big Data Partnership has the knowledge and experience needed to Discover, Develop & Deliver scalable and reliable big data solutions, driving exciting results for our customers.
Hortonworks is a leading contributor to Apache Hadoop projects based in Sunnyvale, Calif.
Hortonworks Data Platform Only Apache Hadoop Distribution Certified for Rackspace Private Cloud
th] Annual Hadoop Summit North America, the leading conference for the Apache Hadoop community with content selected by the community for the community, will showcase examples of how Open Enterprise Hadoop is transforming industries.