Apache Hadoop Ecosystem

Apache Hadoop Ecosystem

  • Hadoop HDFS - 2007 - A distributed file system for reliably storing huge amounts of unstructured, semi-structured and structured data in the form of files. 
  • Hadoop MapReduce - 2007 - A distributed algorithm framework for the parallel processing of large datasets on HDFS filesystem. It runs on Hadoop cluster but also supports other database formats like Cassandra and HBase. 
  • Cassandra - 2008 - A key-value pair NoSQL database, with column family data representation and asynchronous masterless replication. 
  • HBase - 2008 - A key-value pair NoSQL database, with column family data representation, with master-slave replication. It uses HDFS as underlying storage. 
  • Zookeeper - 2008 - A distributed coordination service for distributed applications. It is based on Paxos algorithm variant called Zab. 
  • Pig - 2009 - Pig is a scripting interface over MapReduce for developers who prefer scripting interface over native Java MapReduce programming. 
  • Hive - 2009 - Hive is a SQL interface over MapReduce for developers and analysts who prefer SQL interface over native Java MapReduce programming. 
  • Mahout - 2009 - A library of machine learning algorithms, implemented on top of MapReduce, for finding meaningful patterns in HDFS datasets. 
  • Sqoop - 2010 - A tool to import data from RDBMS/DataWarehouse into HDFS/HBase and export back. 
  • YARN - 2011 - A system to schedule applications and services on an HDFS cluster and manage the cluster resources like memory and CPU. 
  • Flume - 2011 - A tool to collect, aggregate, reliably move and ingest large amounts of data into HDFS. 
  • Storm - 2011 - A system to process high-velocity streaming data with 'at least once' message semantics. 
  • Spark - 2012 - An in-memory data processing engine that can run a DAG of operations. It provides libraries for Machine Learning, SQL interface and near real-time Stream Processing. 
  • Kafka - 2012 - A distributed messaging system with partitioned topics for very high scalability. 
  • SolrCloud - 2012 - A distributed search engine with a REST-like interface for full-text search. It uses Lucene library for data indexing.


  1. Much thanks to you for setting aside opportunity to composing your experience.This is extremely useful.
    Education | Article Submission sites | MBA Guide | Technology

  2. Wonderful post!!Thank you for sharing this info with us.
    Keep updating I would like to know more updates on this topic
    Very useful content, I would like to suggest this blog to my friends.

    Best Hadoop Training in Chennai
    Big Data Hadoop Training in Chennai

  3. This technical post helps me to improve my skills ,thanks for this wonder post I expect your upcoming blog, so keep sharing...
    Technology updates


  4. Pretty blog, so many ideas in a single site, thanks for the informative article, keep updating more article.
    Digital marketing course in chennai

  5. Best tutorial on hadoop for freshers among lots of available. Super work
    Hadoop Training in Chennai

  6. Thanks for sharing your knowledge with us .This will absolutely going to help me in my future .

    Big Data Training Chennai

    Best hadoop training institute in chennai

  7. It's event-driven, and builders not should depend on the ops to check their code. They'll shortly run, check and deploy their code with out getting tangled within the conventional workflow.This is great blog. If you want to know more about this visit here Internet of Things.

  8. Your good knowledge and kindness in playing with all the pieces were very useful. I don’t know what I would have done if I had not encountered such a step like this.
    Online training in USA

  9. Those guidelines additionally worked to become a good way to recognize that other people online have the identical fervor like mine to grasp great deal more around this condition.
    Click here:
    Online training in USA

  10. Thank you a lot for providing individuals with a very spectacular possibility to read critical reviews from this site.

    Online training in USA

  11. I feel really happy to have seen your webpage and look forward to so many more entertaining times reading here. Thanks once more for all the details.Online training in USA

  12. This information is impressive; I am inspired with your post . Keep posting like this, THis is very useful .. Thank you so much.. Waiting for more blogs like this.

    Best Big Data Training in Chennai

    Hadoop Big Data Training in Chennai

  13. This is an awesome post. Really very informative and creative contents. These concept is a good way to enhance the knowledge. Thank you for this brief explanation and very nice information.
    Big Data Certification in Chennai | Best Hadoop Training in Chennai | Best hadoop training institute in chennai | Hadoop Course in Chennai | Best Big Data Training in Chennai

  14. It's interesting that many of the bloggers to helped clarify a few things for me as well as giving.Most of ideas can be nice content.The people to give them a good shake to get your point and across the command.
    aws training in chennai

    hadoop training in chennai

  15. Its a wonderful post and very helpful, thanks for all this information. You are including better information regarding this topic in an effective way. T hank you so much.
    Python Training in Chennai
    Digital Marketing Course in Chennai
    Python Training classes in Chennai
    Python Training Chennai
    Digital Marketing Training Institutes in Chennai
    Digital Marketing Chennai

  16. It’s a classic great for me to go to this blog site, it offers helpful suggestions
    JAVA Training in Chennai |
    JAVA Course in Chennai |
    Best JAVA Training in Chennai

  17. Hi to all, the blog has really the dreadful information I really enjoyed a lot.
    Big Data Training in Chennai |
    Big Data Training |
    Big Data Course in Chennai

  18. Thanks for your sharing such a useful information. this was really helpful to me


  19. Great information about Medical Billing Services and EMR Software. So many people have a confusion related to the field of billing and coding. This is a kind of article that can help those people
    Hadoop Training Chennai |
    Hadoop Training in Chennai |
    Big Data Training in Chennai

  20. So nice to read.Its very useful for me to get more valuable info about Medical Billing Coding Service.Thanks for it.Keep going.
    Cloud computing Training |
    Cloud computing Training in Chennai |
    Cloud computing courses in Chennai

  21. I have gone through your blog, it was very much useful for me and because of your blog, and also I gained many unknown information, the way you have clearly explained is really fantastic. Kindly post more like this, Thank You.
    Airport management courses in chennai
    Airport Management Training in Chennai
    Airline Courses in Chennai
    airport courses in chennai

  22. Fascinating casino life online. best online gambling sites Play online casino at BGAOC.

  23. I appreciate that you produced this wonderful article to help us get more knowledge about this topic.
    I know, it is not an easy task to write such a big article in one day, I've tried that and I've failed. But, here you are, trying the big task and finishing it off and getting good comments and ratings. That is one hell of a job done!

    Selenium training in bangalore
    Selenium training in Chennai
    Selenium training in Bangalore
    Selenium training in Pune
    Selenium Online training

  24. The way of you expressing your ideas is really good.you gave more useful ideas for us and please update more ideas for the learners.
    Python Training in Chennai
    IOS Training in Chennai
    Hadoop training in chennai
    Big data training in chennai
    big data training in chennai anna nagar


Post a Comment

Popular posts from this blog

Big Data After The Internet