Skip to content

polyglotDataNerd/poly-flink

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Flink

Apache Flink

Apache Flink Examples EMR

Dependencies:

TODO:

Batch

flink run poly-flink-1.0-development.jar "batch" "s3a://poly-testing/covid/combined/"

Stream

flink run poly-flink-1.0-development.jar "socketstream" "127.0.0.1" 9000 "\n"

flink run poly-flink-1.0-development.jar "filestream" "s3a://poly-testing/covid/combined/"

Table

flink run poly-flink-1.0-development.jar "batchtable" "s3a://poly-testing/covid/orc/combined/"

To interact with Scala Shell on EMR ssh into the cluster and type commands running r5d.xlarge.

    aws s3 cp s3://bigdata-utility/jars/flink-sql-orc_2.12-1.11.0.jar .
    
    /usr/lib/flink/bin/start-scala-shell.sh yarn -s 10 -jm 1024m -tm 4096m --addclasspath /home/hadoop/flink-sql-orc_2.12-1.11.0.jar
  • -s: number of slots = number of cores
  • -jm: job manager memory
  • -tm: taskmanager memory

CLI Interaction