Apache Open-Source Configurations
Installation and Configuration Here I have installed and configured software from Apache Software Foundation.You guys could follow Cloudera/Edureka Blogs for reference. Apache Hadoop Hadoop is an open-source software framework for storing data and running applications on clusters of commodity hardware. It provides massive storage for any kind of data, enormous processing power and the ability to handle virtually limitless concurrent tasks or jobs. Click Here For Hive and Sqoop one should install and configure Hadoop first. Apache Hive Hive is a data warehouse infrastructure tool to process structured data in Hadoop. It resides on top of Hadoop to summarize Big Data, and makes querying and analyzing easy. Click Here Apache Sqoop Sqoop is a command-line interface application for transferring data between relational databases and Hadoop. Click here Apache Cassandra Apache Cassandra is a free open-source database system