Big data is high-volume, high-velocity and/or high-variety information assets that demand cost-effective, innovative forms of information processing that enable enhanced insight, decision making, and process automation. By Gartner
The theoretical comparison between Big Data Vs Conventional Data:
Time to explore more hands-on with Hortownworks(now merged with Cloudera)
Setup the environment
Virtual Box, VMware & Docker are the 3 options available. Virtual Box in this case as the setup less light weight compare to other 2 options.
Download & Setup Virtual Box , Link
Download .ova file from Hortonworks. Contains the complete environment of Hadoop. But need some configuration which we will cover shortly. Link
By end of this steps you should completed the configuration & able to boot-up the Hortontonworks Hadoop environment.
Start the configured Hortonworks sandbox
Once the sandbox configutation is completed, now time to look at the front end.
Type, 127.0.0.1:8080 in internet browser. Key in username, password & click Sign in.
Note: Port, username and password setup done during environment configuration process.
Can see names like HIVE, PIG, SPARK, etc etc, Yes? Bingo, now you are REALLY IN Hadoop based enterprise ready GUI based administration platform: AMBARI
AMBARI – Admin portal or can be said as gateway for most of the works we going to perform in Hontonworks Hadoop based environment. From administration to data modelling and visualization.
Things are getting interesting, we want more!!! Yes, I can hear that..
Stay tune, in my next blog will be covering “Sample Big Data Setup & Visualization in Ambari” in a Proof of Concept(POC) manner.
Disclaimer: My original post in 2017, reposted here again to centralise my digital profile in one single location, in dataChief. And to share the knowledge with any members starting with big data.