Difference between var and val in spark- Interview question

Var keyword is just similar to variable declaration in Java whereas Val is little different. Once a variable is declared using Val the reference cannot be changed to point to another reference.... Read more »

Lazy Evaluation in Apache Spark

Lazy evaluation in Spark means that the execution will not start until an action is triggered. The Spark Lazy evaluation, users can divide into smaller operations. It reduces the number of passes... Read more »

Difference between Coalesce and Repartition- Hadoop Interview question

Lenovo V130 81HQA034IH 2019 14-inch Laptop (8th gen I3-8130U/4GB/1TB HDD/DOS/Integrated Graphics), Gray  (2) (as of April 3, 2020 - More infoProduct prices and availability are accurate as of the date/time indicated and... Read more »

How to create RDD in spark – Interview question

DELL Vostro 3490 14-inch Thin & Light Laptop (10th Gen Core i3-10110U/4GB/1TB HDD/Ubuntu/Integrated Graphics), Black  (3) (as of April 3, 2020 - More infoProduct prices and availability are accurate as of the... Read more »

Hive partition with example – Interview Question

HP 15 da0414tu 15.6-inch Laptop (8th Gen i3-8130U/8GB/1TB HDD/Windows 10/Intel UHD 620 Graphics), Chalkboard Gray (as of April 3, 2020 - More infoProduct prices and availability are accurate as of the date/time... Read more »

Sqoop import – Part 1- Hadoop series

MSI Modern 14 A10M-652IN Intel Core i5-10210U 10th Gen 14-inch Laptop(8GB/512GB NVMe SSD/Windows 10 Home/UMA/Grey/1.29Kg )9S7-14B361-652  (1) (as of April 3, 2020 - More infoProduct prices and availability are accurate as of... Read more »

Read CSV and JSON file format in spark 2.0

Read CSV with spark 2.0 STEP 1. Open the spark-shell and fire the following command. scala> spark.read.format(“csv”).option(“header”,”true”).load(“F:/Hadoop Youtube/customer.csv”) STEP 2. Display the result with show command scala> .show +—–+——+———–+——-+———-+—-+——+ |empno| ename|designation|manager| hire_date|... Read more »

Default Number of mapper and reducer in SQOOP job

Updated: Dec 12, 2018 #hadoop #sqoop #defaultmapper #defaultreducer #hadoopinterviewquestion In this post we are going to focus the default number of mappers and reducers in the sqoop. scope is the part of Hadoop... Read more »

Big data and Hadoop difference

This article gives the difference between hadoop and big data. I personally found many students have confusion between hadoop and big data. Actually both are different entity. In one sentense I would... Read more »

Switch your career from Oracle DBA to Hadoop Bigdata

I have seen a many people those who want to switch their career from the Oracle DBA to the Hadoop but they are not sure how to start and from where to... Read more »