This repository contains the code for the book Stream Processing: Hands-on with Apache Flink.
In order to run the code samples we will need a Kafka and Flink cluster up and running. You can also run the Flink examples from within your favorite IDE in which case you don't need a Flink Cluster.
If you want to run the examples inside a Flink Cluster run to start the Pulsar and Flink clusters.
docker-compose up
When the cluster is up and running successfully run the following command for redpanda:
./redpanda-setup.sh
or this command for kafka setup
./kafka-setup.sh
CREATE FUNCTION maskfn AS 'io.streamingledger.udfs.MaskingFn' LANGUAGE JAVA USING JAR '/opt/flink/jars/spf-0.1.0.jar';
CREATE FUNCTION splitfn AS 'io.streamingledger.udfs.SplitFn' LANGUAGE JAVA USING JAR '/opt/flink/jars/spf-0.1.0.jar';
CREATE FUNCTION lookup AS 'io.streamingledger.udfs.AsyncLookupFn' LANGUAGE JAVA USING JAR '/opt/flink/jars/spf-0.1.0.jar';
CREATE TEMPORARY VIEW sample AS
SELECT *
FROM transactions
LIMIT 10;
SELECT transactionId, maskfn(UUID()) AS maskedCN FROM sample;
SELECT * FROM transactions, LATERAL TABLE(splitfn(operation));
SELECT
transactionId,
serviceResponse,
responseTime
FROM sample, LATERAL TABLE(lookup(transactionId));
- Package the application and create an executable jar file
mvn clan package
-
Copy it under the jar files to be included in the custom Flink images
-
Start the cluster to build the new images by running
docker-compose up
- Deploy the flink job
docker exec -it jobmanager ./bin/flink run \
--class io.streamingledger.datastream.BufferingStream \
jars/spf-0.1.0.jar