Monday, March 5, 2012

Cloudera


1. Hadoop is a big deal because of the flexibility it provides to companies that need data managment. One major advantage of Hadoop is that it is open source. That appeals to lots of companies in and of itself. Another advantage to Hadoop is that it can handle incredibly large amounts of unstructured data and structured data. Lots of major companies are adopting it to handle their data management needs. Its revolutionizing the way we store unstructured data.

2. Cloudera is the enterprise that offers Hadoop as a package but is free under Apache licenses. It was formed inThey offer two packages. The first is Hadoop in its raw form without any technical help. The second product offers Hadoop but also assists in consulting, setting up, and managing Hadoop.

3. PIG is a platform that is used in conjunction with Hadoop to analyze large data sets. The query language is humorously called PIG latin and queries can be created by their owners to do special processing of the data sets. The advantage of PIG is that it, much like a actual pig, can consume anything. Their is no data set that PIG can't analyze.

4. HIVE is similar to PIG. Hive does similar functions of PIG in that it analyzes large data sets of structured and unstructured data. The main advantage of HIVE is that it is based on SQL. Because SQL is already in use in most of the organizations it is one less thing to learn when using Hadoop.

5. Cassandra is a hybid non-sequel, non-relational database. The major advantage of Cassandra is that fields do not have to be predetermined before you add data to the database. This allows for the database to be scaled up without having to manually move data or restart processes. Cassandra also backs up data so that their can never be a single point of failure.

6. Mahout is a machine that learns, and interprets data sets and gives useful feedback on trends or patterns. Mahout is a extremely glorified data mining machine that learns from past experiences and provides useful business data. Mahout is open source and focuses on giving scalable machine-learning algorithms.

No comments:

Post a Comment

HTML

Sherlock Holmes: A Game of Shadows

Sherlock has his girl that he likes kidnapped and she dies of TB. His friend gets married and while on the way to the honeymoon almost get killed. They find out that the man who is trying to kill them has been buying up all of the resources needed to maintain a war. Sherlock must infiltrate a gathering and discover the man who is set to kill a political candidate in order to start the war. Sherlock saves the person and ends up killing his arch nemesis.

Shadows
  1. Robert Downey Jr. is awesome.

  2. Jude Law is awesome.

  3. I thought Rachel McAdams would be in it more.

  4. The first one was pretty good.

  5. I enjoy the mystery genre.

Sherlock Holmes: A Game of Shadows