Choose Index below for a list of all words and phrases defined in this glossary.


Hadoop

index | Index

Hadoop - definition(s)

Hadoop – an open-source framework that is built to enable the process and storage of big data across a distributed file system.

[Category=Big Data ]

Source: BigData-Startup, 29 September 2013 08:56:33, http://www.bigdata-startups.com/abc-big-data-glossary-terminology External


These advertisers support this free service


Hadoop - An open source software library project administered by the Apache Software Foundation. Apache defines Hadoop as “a framework that allows for the distributed processing of large data sets across clusters of computers using a simple programming model.”

[Category=Big Data ]

Source: DataInformed, 31 October 2013 09:04:01, http://data-informed.com/glossary-of-big-data-terms/ External


Hadoop - An open source framework for data intensive distributed applications. As a framework, it encompasses things like HBase and HDFS (listed here as well).

Hadoop is designed to work on commodity hardware and scale horizontally by throwing more machines onto a problem.

Hadoop is king of Big Data_ technologies. It is the most commonly used solution when referring to Big Data_ and is oftentimes used without any real need. While Hadoop is used for real time analytics, it is really good at write operations and slow on reads, making it… not that good for real time analytics as some incorrectly assume.

[Category=Big Data ]

Tsahi Levent-Levi, 14 November 2013 09:19:47, http://bloggeek.me/my-big-data-glossary/ External


Hadoop - Is the defacto big data framework. The java-based, open-source ecosystem enables data to be chopped up and processed across a cluster of services. It’s designed to scale out from a single server to thousands of machines, with a very high degree of fault tolerance.

[Category=Big Data ]

Jamie Turner, 15 December 2014 11:19:05, http://blog.triggar.com/a-z-of-big-data-learn-the-lingo/ External


Hadoop - Apache Hadoop is one of the most widely used software frameworks in big data. It is a collection of programs which allow storage, retrieval and analysis of very large data sets using distributed hardware (allowing the data to be spread across many smaller storage devices rather than one very large one).

[Category=Big Data ]

Bernard Marr, 21 December 2014 10:20:02, http://smartdatacollective.com/bernardmarr/287086/big-data-22-key-terms-everyone-should-understand/ External


Data Quality Glossary.  A free resource from GRC Data Intelligence. For comments, questions or feedback: dqglossary@grcdi.nl