0% found this document useful (0 votes)

137 views15 pages

Sqoop

Sqoop allows importing and exporting data between relational databases and Hadoop. It uses MapReduce to import data from a relational database into HDFS, HBase, or Hive. For import, Sqoop first connects to the database and retrieves metadata, then executes a MapReduce job to import the data. It supports importing a full table, selected columns/rows, and incremental imports. Sqoop export works similarly but in reverse, using MapReduce to export data from HDFS, HBase, or Hive to a relational database in bulk.

Uploaded by

shobhit

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

137 views15 pages

Sqoop

Uploaded by

shobhit

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 15

Sqoop

Sqoop Import

Traditional RDBMS data into Hadoop, Hbase and HIVE.

Prerequisites:-
RDBMS
Hadoop Cluster up in running
Set HADOOP_HOME environment variable
Basic command

bin/sqoop import connect jdbc:mysql://url username name password pwd

table name target-dir path/for/storing/db
How import works?

First connection is set up to the Database server to pull desired metadata info
from the input table we are using.
Then it executes a Mapreduce job on Hadoop cluster. Sqoop will use
metadata to perform actual import.
Modify Delimiters

--fields-terminated-by ,
--lines-terminated-by ,
--escaped-by \\
--enclosed-by \
Different file formats

--as-sequencefile Store data in sequential file format

--as-avrodatafile Store data in Avro file
--as-textfile Store data in Text file

--direct Direct Access Mode for non jdbc based access

Different table access

--columns field1, field2 Import selected columns

--where condition Import selected rows
--columns fields where cond. Selected rows of selected columns
--query any query For any SQL query
import-all-tables For importing all tables

-m no. No. of map tasks

--split-by column_name For dividing mapped tasks
Incremental import

For importing new version/latest record

For appending new recods
--incremental append last-value value check-column column_name
For appending and updating records
--incremental lastmodified last-value value(timestamp) check-column
column_name
(Will need to maintain timestamp, so an extra column)
Job info

--create job_name
--delete job_name
--exec job_name
--show job_name Show parameters
--list List of all saved jobs
Importing data in Hbase

Prerequisites:-
Hbase cluster up in running
HBASE_HOME environment variable is set
For importing a Primary key table
bin/sqoop import connect jdbc:mysql://url username name password pwd
table name hbase-table hbase_name column-family hbase_table_col1
hbase-create-table
For importing a non-primary key table
bin/sqoop import connect jdbc:mysql://url username name password pwd
table name hbase-table hbase_name column-family hbase_table_col1
hbase-row-key col_name hbase-create-table
Importing database in HIVE

Prerequisites:-
HIVE installed
HIVE_HOME environment variable is set
Importing primary key table
bin/sqoop import connect jdbc:mysql://url username name password pwd
table name hive-table name create-hive-table hive-import hive-home
path/to/hive/home
Importing non-primary key table
bin/sqoop import connect jdbc:mysql://url username name password pwd
table name hive-table name create-hive-table hive-import hive-home
path/to/hive/home split-by col_name
Getting HDFS data into HIVE

Hive> CREATE EXTERNAL TABLE student(id int, name string)

ROW FORMAT DELIMITED FIELDS TERMINATED BY ',
LINES TERMINATED BY '\n
STORED AS TEXTFILE
LOCATION '/user/username/student';
Sqoop export

Basic command:
Bin/sqoop export connect location table name username name password
pwd export-dir /location
--input-fields-terminated-by,
--input-lines-terminated-by,
How export works

Validate metadata of output RDBMS table

Execute the Mapreduce job to perform actual transfer

Use staging-table argument to move staged data in single transaction

Export from HIVE

Create an invoice table as

CREATE TABLE invoice(
id INT NOT NULL PRIMARY KEY
from VARCHAR(32), to VARCHAR(32));
Use command:-
bin/sqoop export connect jdbc:Location table invoice export-dir
Location/invoice username name password pwd m no. input-fields-
terminated-by\001(Octal of ^A)

Water Usage Tracking Spreadsheet - 0
No ratings yet
Water Usage Tracking Spreadsheet - 0
27 pages
Sqoop Commands
No ratings yet
Sqoop Commands
4 pages
Ansible For Linux by Examples
From Everand
Ansible For Linux by Examples
Luca Berton
No ratings yet
Rider Dadyy Yankee
100% (1)
Rider Dadyy Yankee
26 pages
A Mathematical Model For The Production of Low Density Polyethylene in A Tubular Reactor
No ratings yet
A Mathematical Model For The Production of Low Density Polyethylene in A Tubular Reactor
8 pages
Sqoop Export and Import Commands
No ratings yet
Sqoop Export and Import Commands
5 pages
Sqoop - A Haddop Technology: Srikalahasti
No ratings yet
Sqoop - A Haddop Technology: Srikalahasti
13 pages
6.moving Data Into Hadoop
No ratings yet
6.moving Data Into Hadoop
18 pages
How Sqoop Works?: Relationaldatabase Servers in The Relational Database Structure
No ratings yet
How Sqoop Works?: Relationaldatabase Servers in The Relational Database Structure
7 pages
Knowledge About Apache Sqoop and Its All Basic Commands To Import and Export The Data
No ratings yet
Knowledge About Apache Sqoop and Its All Basic Commands To Import and Export The Data
7 pages
Sqoop Additional Reading Pp-200913-222451-Unlocked
No ratings yet
Sqoop Additional Reading Pp-200913-222451-Unlocked
18 pages
Fundamentals of Apache Sqoop Notes
No ratings yet
Fundamentals of Apache Sqoop Notes
66 pages
04-Sqoop(1)(1)
No ratings yet
04-Sqoop(1)(1)
30 pages
Sqoop
No ratings yet
Sqoop
9 pages
Sqoop LAB
No ratings yet
Sqoop LAB
12 pages
bda u3 copy
No ratings yet
bda u3 copy
59 pages
Mod 2
No ratings yet
Mod 2
70 pages
Apache Sqoop
No ratings yet
Apache Sqoop
21 pages
Chapter n3 Sqoop
No ratings yet
Chapter n3 Sqoop
24 pages
How Sqoop Works?: Sqoop "SQL To Hadoop and Hadoop To SQL"
No ratings yet
How Sqoop Works?: Sqoop "SQL To Hadoop and Hadoop To SQL"
27 pages
Sqoop v1.1
No ratings yet
Sqoop v1.1
18 pages
Sqoop Practice
No ratings yet
Sqoop Practice
7 pages
Sqoop Cammand
No ratings yet
Sqoop Cammand
8 pages
Apache Sqoop Data Transfer Between Hadoop and RDBMS
No ratings yet
Apache Sqoop Data Transfer Between Hadoop and RDBMS
9 pages
Experiment No 2
No ratings yet
Experiment No 2
9 pages
M - M - Num-Mappers
No ratings yet
M - M - Num-Mappers
4 pages
unit 3 Apache Sqoop and drill
No ratings yet
unit 3 Apache Sqoop and drill
10 pages
Sqoop 2
No ratings yet
Sqoop 2
10 pages
Big Data: Sqoop
No ratings yet
Big Data: Sqoop
43 pages
This Documents Are About Apache Sqoop
No ratings yet
This Documents Are About Apache Sqoop
23 pages
5 - Big - Data Vivek
No ratings yet
5 - Big - Data Vivek
4 pages
SQOOP
No ratings yet
SQOOP
8 pages
Unit 4 3 Lumify,Data Rapper and Sqooop
No ratings yet
Unit 4 3 Lumify,Data Rapper and Sqooop
27 pages
Apache Sqoop: Hanoi - Autumn 2019
No ratings yet
Apache Sqoop: Hanoi - Autumn 2019
18 pages
B22 BDA Experiment 03
No ratings yet
B22 BDA Experiment 03
11 pages
Creating A Table in RDBMS 3 2. Importing RDBMS Data Into H DFS 3 Exporting HDFS Data To RDBMS .. 6
No ratings yet
Creating A Table in RDBMS 3 2. Importing RDBMS Data Into H DFS 3 Exporting HDFS Data To RDBMS .. 6
5 pages
Sqoop Students Datadotz
No ratings yet
Sqoop Students Datadotz
19 pages
Sqoop
No ratings yet
Sqoop
4 pages
Bda 11
No ratings yet
Bda 11
10 pages
SIC Big Data Chapter 3 Workbook
No ratings yet
SIC Big Data Chapter 3 Workbook
86 pages
Sqoop Cheatsheet
No ratings yet
Sqoop Cheatsheet
3 pages
Practical 6 (BDA) ETI
No ratings yet
Practical 6 (BDA) ETI
3 pages
Interview
No ratings yet
Interview
86 pages
Sqoop
No ratings yet
Sqoop
9 pages
L4B-Sqoop Import - Mysql To Hive: Scenario 1 - The Setting
No ratings yet
L4B-Sqoop Import - Mysql To Hive: Scenario 1 - The Setting
14 pages
Sqoop Implementation Revised
No ratings yet
Sqoop Implementation Revised
7 pages
Module IV .Docx
No ratings yet
Module IV .Docx
5 pages
Hive Documet
No ratings yet
Hive Documet
33 pages
Module 5_Sqoop
No ratings yet
Module 5_Sqoop
25 pages
Sqoop User Guide
No ratings yet
Sqoop User Guide
90 pages
DSCI 5350 - Lecture 3 PDF
No ratings yet
DSCI 5350 - Lecture 3 PDF
39 pages
Data Lake 1
No ratings yet
Data Lake 1
48 pages
BD Sqltohadoop3 PDF
No ratings yet
BD Sqltohadoop3 PDF
13 pages
Lesson 3 - Data - Ingestion - Into - Big - Data - Systems - and - ETL
No ratings yet
Lesson 3 - Data - Ingestion - Into - Big - Data - Systems - and - ETL
104 pages
SqoopTutorial Ver 2.0
No ratings yet
SqoopTutorial Ver 2.0
51 pages
Module 2
No ratings yet
Module 2
27 pages
Using Sqooptool To Transfer Data Between Hadoop and Mysql: Implementation
No ratings yet
Using Sqooptool To Transfer Data Between Hadoop and Mysql: Implementation
4 pages
5 SQL Hadoop Analyzing Big Data Hive m5 Storage Eco System Slides
No ratings yet
5 SQL Hadoop Analyzing Big Data Hive m5 Storage Eco System Slides
15 pages
Unit 3 Topic 8 Flume and Scoop
No ratings yet
Unit 3 Topic 8 Flume and Scoop
35 pages
Hive PPTs
No ratings yet
Hive PPTs
34 pages
Sqoop 1
No ratings yet
Sqoop 1
29 pages
BigData Module 2
No ratings yet
BigData Module 2
18 pages
Quick Configuration of Openldap and Kerberos in Linux and Authenicating Linux to Active Directory
From Everand
Quick Configuration of Openldap and Kerberos in Linux and Authenicating Linux to Active Directory
Dr. Hidaia Mahmood Alassouli
No ratings yet
Condenser Calculator
No ratings yet
Condenser Calculator
17 pages
Mariah Campbell Biology Homework
No ratings yet
Mariah Campbell Biology Homework
4 pages
Exp 2018 Excellence
No ratings yet
Exp 2018 Excellence
24 pages
Destiny Harvest Centre: Prayer, Fasting & Revival
No ratings yet
Destiny Harvest Centre: Prayer, Fasting & Revival
2 pages
LF03685 Airway Larry Adv Head W Stand 04 2022
No ratings yet
LF03685 Airway Larry Adv Head W Stand 04 2022
6 pages
THE INDEPENDENT Issue 587
100% (1)
THE INDEPENDENT Issue 587
44 pages
Project Scope Document
No ratings yet
Project Scope Document
32 pages
Emily Klarer - Argumentative Essay
No ratings yet
Emily Klarer - Argumentative Essay
4 pages
BIRTH AND EARLY LIFE OF PROPHET SAW _034428
No ratings yet
BIRTH AND EARLY LIFE OF PROPHET SAW _034428
17 pages
Tirumala Tirupati Devasthanams: Tirupati
No ratings yet
Tirumala Tirupati Devasthanams: Tirupati
3 pages
MODULE 1 - ECONOMIC GLOBALIZATION - CONTEMPORARY WORLD - Pot
No ratings yet
MODULE 1 - ECONOMIC GLOBALIZATION - CONTEMPORARY WORLD - Pot
21 pages
Asp Flooding Simulation
No ratings yet
Asp Flooding Simulation
8 pages
Rajshahi University of Engineering & Technology: Heaven's Light Is Our Guide
No ratings yet
Rajshahi University of Engineering & Technology: Heaven's Light Is Our Guide
15 pages
Project Management Notes
No ratings yet
Project Management Notes
82 pages
DNV-CP-0293 Abrasion Resistant Coatings
No ratings yet
DNV-CP-0293 Abrasion Resistant Coatings
13 pages
CP JHP - Log Book Certificate
No ratings yet
CP JHP - Log Book Certificate
6 pages
Cheek Plumper
No ratings yet
Cheek Plumper
6 pages
7830547 PowerShiftR V2 Manual Resistance Installation Guideline (Rev B)
No ratings yet
7830547 PowerShiftR V2 Manual Resistance Installation Guideline (Rev B)
39 pages
Is GPS 200
No ratings yet
Is GPS 200
226 pages
Main Idea (Ereading Worksheet)
No ratings yet
Main Idea (Ereading Worksheet)
14 pages
E-Notice Announcement and Management System SRS
100% (1)
E-Notice Announcement and Management System SRS
19 pages
Indefinite Integration: Chapter Highlights
No ratings yet
Indefinite Integration: Chapter Highlights
52 pages
Fischer CV
No ratings yet
Fischer CV
10 pages
Fire Contingency Plan
No ratings yet
Fire Contingency Plan
36 pages
Ignition System - 1
100% (1)
Ignition System - 1
5 pages
FC 24ETH Datasheet
No ratings yet
FC 24ETH Datasheet
3 pages
Reduction in Switching Loss of Phase-Shift Full-Bridge Converter Using PrestoMOS R60xxVNx Series
No ratings yet
Reduction in Switching Loss of Phase-Shift Full-Bridge Converter Using PrestoMOS R60xxVNx Series
10 pages

Sqoop

Uploaded by

Sqoop

Uploaded by

Sqoop

Traditional RDBMS data into Hadoop, Hbase and HIVE.

bin/sqoop import connect jdbc:mysql://url username name password pwd

--as-sequencefile Store data in sequential file format

--direct Direct Access Mode for non jdbc based access

--columns field1, field2 Import selected columns

-m no. No. of map tasks

For importing new version/latest record

Hive> CREATE EXTERNAL TABLE student(id int, name string)

Validate metadata of output RDBMS table

Use staging-table argument to move staged data in single transaction

Create an invoice table as

You might also like