RocketMQ Streams

中文文档

Quick Start

Features

Lightweight deployment: RocketMQ Streams can be deployed separately or in cluster mode.
Various types of data input and output: source supports RocketMQ while sink supports databases and RocketMQ, etc.

DataStream Example

import org.apache.rocketmq.streams.client.transform.DataStream;

DataStreamSource source=StreamBuilder.dataStream("namespace","pipeline");
    source
    .fromFile("～/admin/data/text.txt",false)
    .map(message->message)
    .toPrint(1)
    .start();

Maven Repository

<dependency>
    <groupId>org.apache.rocketmq</groupId>
    <artifactId>rocketmq-streams-clients</artifactId>
    <version>1.0.0-SNAPSHOT</version>
</dependency>

Core API

RocketMQ Streams implements a series of advanced APIs, allowing users to write stream computing programs conveniently and achieve their own business requirements.

StreamBuilder

StreamBuilder is used to build the source of stream tasks. It contains two methods: dataStream() and tableStream(), which return two sources, DataStreamSource and TableStreamSource, respectively.

dataStream(nameSpaceName,pipelineName) returns an instance of DataStreamSource, used for segmented programming to achieve stream computing tasks.
tableStream(nameSpaceName,pipelineName) returns an instance of TableStreamSource, used for script programming to achieve stream computing tasks.

DataStream API

Source

DataStreamSource is a source class of segmented programming, used to interface with various data sources and obtain data from major message queues.

fromFile: reads data from the file. This method contains two parameters:
- filePath: specifies which file path to read. Required.
- isJsonData: specifies whether data is in JSON format. Optional. Default value: true.
- tags: the tags for filtering messages used by the RocketMQ consumer. Optional.
fromRocketmq: obtains data from RocketMQ, including four parameters:
- topic: the topic name of RocketMQ. Required.
- groupName: the name of the consumer group. Required.
- isJson: specifies whether data is in JSON format. Optional.
- tags: the tags for filtering messages used by the RocketMQ consumer. Optional.
from: custom data source. You can specify your own data source by implementing ISource interface.

transform

transform allows the input source data to be modified during the stream calculation process for the next step; DataStream API includes DataStream, JoinStream, SplitStream, WindowStream, and many other transform classes.

DataStream

DataStream implements a series of common stream calculation operators as follows:

map: returns a new DataStream by passing each record of the source to the func function.
flatmap: similar to map. One input item corresponds to 0 or more output items.
filter: returns a new DataStream based on the record of the source DataStream only when the ** func** function returns true.
forEach: executes the func function once for each record and returns a new DataStream.
selectFields: returns the corresponding field value for each record, and returns a new DataStream.
operate: executes a custom function for each record and returns a new DataStream.
script: executes a script for each recorded field, returns new fields, and generates a new DataStream.
toPrint: prints the result on the console and generates a new DataStreamAction instance.
toFile: saves the result as a file and generates a new DataStreamAction instance.
toDB: saves the result to the database.
toRocketmq: outputs the result to RocketMQ.
to: outputs the result to the specified storage through the custom ISink interface.
window: performs relevant statistical analysis in the window, generally used in conjunction with groupBy. window() is used to define the size of the window, and groupBy( ) used to define the main key of statistical analysis. You can specify multiple main keys:
- count: counts in the window.
- min: gets the minimum of the statistical value in the window.
- max: gets the maximum of the statistical value in the window.
- avg: gets the average of the statistical values in the window.
- sum: gets the sum of the statistical values in the window.
- reduce: performs custom summary calculations in the window.
join: associates the two streams or one stream and one physical table according to the conditions and merges them into a large stream for related calculations.
- dimJoin associate a stream with a physical table which can be a file or a db table, and all matching records are retained
- dimLeftJoin After a flow is associated with a physical table, all data of the flow is reserved and fields that do not match the physical table are left blank
- join
- leftJoin
union: merges the two streams.
split: splits a data stream into different data streams according to tags for downstream analysis and calculation.
with: specifies related strategies during the calculation, including Checkpoint and state storage strategies, etc.

Strategy

The Strategy mechanism is mainly used to control the underlying logic during the operation of the computing engine, such as the storage methods of Checkpoint and state etc. Subsequent controls for windows, dual-stream joins, and so on will be added. All control strategies are transmitted through the with operator. Multiple policy types can be transmitted at the same time.

//Specify the storage strategy for Checkpoint.
source
    .fromRocketmq("TSG_META_INFO","")
    .map(message->message+"--")
    .toPrint(1)
    .with(CheckpointStrategy.db("jdbc:mysql://XXXXX:3306/XXXXX","","",0L))
    .start();

Name		Name	Last commit message	Last commit date
Latest commit History 428 Commits
docs		docs
rocketmq-streams-channel-db		rocketmq-streams-channel-db
rocketmq-streams-channel-es		rocketmq-streams-channel-es
rocketmq-streams-channel-http		rocketmq-streams-channel-http
rocketmq-streams-channel-mqtt		rocketmq-streams-channel-mqtt
rocketmq-streams-channel-rocketmq		rocketmq-streams-channel-rocketmq
rocketmq-streams-channel-syslog		rocketmq-streams-channel-syslog
rocketmq-streams-checkpoint		rocketmq-streams-checkpoint
rocketmq-streams-clients		rocketmq-streams-clients
rocketmq-streams-commons		rocketmq-streams-commons
rocketmq-streams-configurable		rocketmq-streams-configurable
rocketmq-streams-connectors		rocketmq-streams-connectors
rocketmq-streams-db-operator		rocketmq-streams-db-operator
rocketmq-streams-dbinit		rocketmq-streams-dbinit
rocketmq-streams-dim		rocketmq-streams-dim
rocketmq-streams-examples		rocketmq-streams-examples
rocketmq-streams-filter		rocketmq-streams-filter
rocketmq-streams-lease		rocketmq-streams-lease
rocketmq-streams-runner		rocketmq-streams-runner
rocketmq-streams-schedule		rocketmq-streams-schedule
rocketmq-streams-script		rocketmq-streams-script
rocketmq-streams-serviceloader		rocketmq-streams-serviceloader
rocketmq-streams-state		rocketmq-streams-state
rocketmq-streams-transport-minio		rocketmq-streams-transport-minio
rocketmq-streams-window		rocketmq-streams-window
.asf.yaml		.asf.yaml
.gitignore		.gitignore
.travis.yml		.travis.yml
LICENSE		LICENSE
NOTICE		NOTICE
README-chinese.md		README-chinese.md
README.md		README.md
build_without_test.sh		build_without_test.sh
pom.xml		pom.xml
quick_start.md		quick_start.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RocketMQ Streams

中文文档

Quick Start

Features

DataStream Example

Maven Repository

Core API

StreamBuilder

DataStream API

Source

transform

DataStream

Strategy

About

Releases

Packages

Languages

License

hzh0425/rocketmq-streams

Folders and files

Latest commit

History

Repository files navigation

RocketMQ Streams

中文文档

Quick Start

Features

DataStream Example

Maven Repository

Core API

StreamBuilder

DataStream API

Source

transform

DataStream

Strategy

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages