Skip to content

Commit 244fe9b

Browse files
author
Pedro Bernardo
committed
Added pairRdd/join/JoinOperations.py
1 parent 56aa850 commit 244fe9b

File tree

1 file changed

+21
-0
lines changed

1 file changed

+21
-0
lines changed

pairRdd/join/JoinOperations.py

Lines changed: 21 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,21 @@
1+
from pyspark import SparkContext
2+
3+
if __name__ == "__main__":
4+
5+
sc = SparkContext("local", "JoinOperations")
6+
sc.setLogLevel("ERROR")
7+
8+
ages = sc.parallelize([("Tom", 29), ("John", 22)])
9+
addresses = sc.parallelize([("James", "USA"), ("John", "UK")])
10+
11+
join = ages.join(addresses)
12+
join.saveAsTextFile("out/age_address_join.text")
13+
14+
leftOuterJoin = ages.leftOuterJoin(addresses)
15+
leftOuterJoin.saveAsTextFile("out/age_address_left_out_join.text")
16+
17+
rightOuterJoin = ages.rightOuterJoin(addresses)
18+
rightOuterJoin.saveAsTextFile("out/age_address_right_out_join.text")
19+
20+
fullOuterJoin = ages.fullOuterJoin(addresses)
21+
fullOuterJoin.saveAsTextFile("out/age_address_full_out_join.text")

0 commit comments

Comments
 (0)