Hadoop

(redirected from Apache Hadoop)
Also found in: Wikipedia.

Hadoop

An open source Big Data framework from the Apache Software Foundation designed to handle huge amounts of data on clusters of servers. The storage is handled by the Hadoop Distributed File System (HDFS), and the data are sorted and summarized in parallel by Hadoop MapReduce, a version of Google's MapReduce. Required Java files are included in Hadoop Common, and Hadoop YARN provides the cluster management.

Originally written for the Nutch Web crawler for spidering the Web, in 2008, Yahoo's Search Webmap was the first very large implementation of Hadoop running on 10,000 Linux servers. Search Webmap ran in a third less time than Yahoo's previous search engine.

The Hadoop name comes from a favorite stuffed elephant of the son of the developer Doug Cutting. See Google File System, MapReduce and Spark.
References in periodicals archive ?
Cloudera, the global provider of the fastest, easiest, and most secure data management and analytics platform built on Apache Hadoop and the latest open source technologies, announced today that Dialog, Sri Lankas largest and most rapidly growing mobile telecommunications network provider, has adopted Cloudera Enterprise to gain a unified view of their customer insights and networks.
8226; Fundamentals of Apache Hadoop and data ETL (extract, transform, load), ingestion, and processing with Hadoop
This new modern data architecture made it possible for Apache Hadoop to become a true data operating system and platform.
The company has deep experience contributing to existing Apache Software Foundation projects and innovating within the open source community, best exemplified by the seamless integration of Cloudbreak and Periscope with Apache Ambari and Apache Hadoop YARN.
Apache Hadoop has enabled businesses of all sizes to utilize big data cost-effectively and re-allocate the limited space in the Enterprise Data Warehouse only for that data that needs to travel first class.
based on open source software from the Apache Hadoop ecosystem and optimise
TELECOMWORLDWIRE-February 17, 2015-Hortonworks, Hitachi Data Systems to deliver Apache Hadoop
Information technology solutions firm TEKsystems Global Services, a division of TEKsystems, revealed on Thursday that it has joined the Hortonworks partner network and now has Systems Integrator Partner Program status with the provider of enterprise Apache Hadoop.
the top-ranked distribution for Apache Hadoop, and Elasticsearch Inc.
Palo Alto-based Cloudera sells and services data management software that helps clients manage and analyze large amounts of data quickly using the open source Apache Hadoop program.
25-Year Technology Veteran Brings Apache Hadoop Expertise and Enterprise Product Vision to Shape Future Direction of 100-percent Open Source Hortonworks Data Platform
US-based Intel Corporation has unveiled Intel Distribution for Apache Hadoop software, which enables more organisations and the public to use the vast amounts of data being generated, collected and stored everyday - also known as "big data".