“BIG
DATA” is a term for a collection of a large and complex volume of both
structured and unstructured data. It is so big and complex that it is very
difficult to process it using traditional software and techniques. Example of
Big data would be Petabytes or Exabyte of data consisting of trillions of
information about millions of people. Organization often faces a problem in
creating, manipulating and managing a big data. But a successful interpretation
of big data provides a competitive edge to the firm as it can provide a real
time information.
Hadoop
is an open source software that supports the processing of Big Data. It makes
possible to run applications on systems with thousands of nodes involving
thousands of terabytes. Hadoop makes sense out of your big data and reveals
answers that have always been just out of reach. In order to gain a competitive
edge by making a sense out of a big data, there is no brighter lure than Hadoop
for an organization. But in order to take best use of Hadoop a firm has to
understand Hadoop and then implement it in their data cloud.
- Hadoop doesn’t works well with the structured data. Hadoop is ideal for the data from the sources like social media, documents, graphs etc. i.e., data which can easily fits in rows and columns
- Transactional data are also not ideal for Hadoop
- Hadoop works best when it is deployed in situation such as index buildings, pattern recognitions and sentiment analysis i.e., Hadoop should be integrated within existing IT infrastructure of firm, it should not replace the existing infrastructure
- Hadoop is linearly scalable, firm has to increase storage and processing power whenever there is an addition in number of nodes.
Conclusion
In
order to have a competitive edge over competitor a firm has to analyze its Big
Data more effectively than its competition and for doing so Hadoop is a best
tool but in order to make best of use of Hadoop a firm has to understand that
Hadoop should not replace the existing system but augment Hadoop with an existing
system.
Refrences:
- http://www.cloudera.com/content/cloudera/en/why-cloudera/hadoop-and-big-data.html
- http://www.cio.com/article/708542/What_Hadoop_Can_and_Can_t_Do?page=3&taxonomyId=3028