Apache Pig FILTER Operator The Apache Pig FILTER operator is used to remove duplicate tuples in a relation. Initially, Pig sorts the…
pig
-
-
Apache Pig SUM Function The Apache Pig SUM function is used to find the sum of the numeric values in a single-column…
-
Apache Pig FLOOR Function The Apache Pig FLOOR function is used to return the largest integer less than or equal to a…
-
Apache Pig TOKENIZE Function The Apache Pig TOKENIZE function is used to splits the existing string and generates a bag of words…
-
Hive Tutorial Hive tutorial provides basic and advanced concepts of Hive. Our Hive tutorial is designed for beginners and professionals. Apache Hive…
-
Sqoop Import We have tables in mySql database and we have to import it to HDFS using Sqoop.To see the content in…
-
What is Big Data Data which are very large in size is called Big Data. Normally we work on data of size…
-
Sqoop Installation The prerequisite for Sqoop installation are Java and Hadoop. These two must be preinstalled in the system. Download the latest…
-
What is Hadoop Hadoop is an open source framework from Apache and is used to store process and analyze data which are…
-
Sqoop Integration with Hadoop Ecosystem Till now data was moved between RDBMS to HDFS. This imported data may further be required code…