Hadoop


Also found in: Dictionary, Thesaurus, Wikipedia.

Hadoop

An open source big data framework from the Apache Software Foundation designed to handle huge amounts of data on clusters of servers. The storage is handled by the Hadoop Distributed File System (HDFS), and the data are sorted and summarized in parallel by Hadoop MapReduce, a version of Google's MapReduce. Required Java files are included in Hadoop Common, and Hadoop YARN provides the cluster management.

Originally written for the Nutch Web crawler for spidering the Web, in 2008, Yahoo!'s Search Webmap was the first very large implementation of Hadoop running on 10,000 Linux servers. Search Webmap ran in a third less time than Yahoo!'s previous search engine.

The Hadoop name comes from a favorite stuffed elephant of the son of the developer Doug Cutting. See Google File System, MapReduce and Spark.
References in periodicals archive ?
Since Hadoop allows you to manage large data sets without throwing anything away, it's easy to track how the data is being manipulated.
The announcement was made at Strata + Hadoop World, a deep-immersion event where data scientists, innovators, and executives get up to speed on emerging techniques and technologies.
To help our customers more effectively get on the path for Hadoop success, were excited to provide Cloudera Navigator Optimizer as a unique new tool.
Anaconda is leading the Open Data Science movement opening up Hadoop through a Python gateway that interacts directly with YARN and HDFS.
As per the forecast, this market will witness a gradual shift towards Software as a Service (SaaS)-based Hadoop solutions as it helps enterprises save cost and gain a better user experience.
Section 2 gives the survey of related work in the area of image processing and Hadoop.
The hands on Apache Hadoop will mark major proportion of 70% and theory will account for the rest 30% of the training.
SequenceIQ provides rapid deployment tools for Hadoop, to deliver a consistent and automated solution for launching on-demand Hadoop clusters in the cloud or to any environment that supports Docker containers.
Hortonworks (NASDAQ: HDP) said it has signed an agreement with Hitachi Data Systems Corporation (HDS) to promote and support Hadoop.
Forrester Research predicts that this year, Hadoop will become a cornerstone of the business technology agenda at most organizations.
With the growing need for businesses to store, search and analyze huge volumes of unstructured data from multiple sources, big data technologies such as Hadoop are necessary for businesses to extract critical insights.
It will allow users of SAP software with a simple way to incorporate Hadoop as a complementary component of their data architectures to enable a broad range of new applications as well as enrich existing ones with additional data sources.