TimestampExtractor
@Evolving public class LogAndSkipOnInvalidTimestamp extends java.lang.Object
Embedded metadata timestamp was introduced in "KIP-32: Add timestamps to Kafka message" for the new 0.10+ Kafka message format.
Here, "embedded metadata" refers to the fact that compatible Kafka producer clients automatically and transparently embed such timestamps into message metadata they send to Kafka, which can then be retrieved via this timestamp extractor.
If the embedded metadata timestamp represents CreateTime (cf. Kafka broker setting
message.timestamp.type
and Kafka topic setting log.message.timestamp.type
),
this extractor effectively provides event-time semantics.
If LogAppendTime is used as broker/topic setting to define the embedded metadata timestamps,
using this extractor effectively provides ingestion-time semantics.
If you need processing-time semantics, use WallclockTimestampExtractor
.
Constructor | Description |
---|---|
LogAndSkipOnInvalidTimestamp() |
Modifier and Type | Method | Description |
---|---|---|
long |
extract(ConsumerRecord<java.lang.Object,java.lang.Object> record,
long previousTimestamp) |
Extracts the embedded metadata timestamp from the given
ConsumerRecord . |
long |
onInvalidTimestamp(ConsumerRecord<java.lang.Object,java.lang.Object> record,
long recordTimestamp,
long previousTimestamp) |
Writes a log WARN message when the extracted timestamp is invalid (negative) but returns the invalid timestamp as-is,
which ultimately causes the record to be skipped and not to be processed.
|
public long onInvalidTimestamp(ConsumerRecord<java.lang.Object,java.lang.Object> record, long recordTimestamp, long previousTimestamp)
record
- a data recordrecordTimestamp
- the timestamp extractor from the recordpreviousTimestamp
- the latest extracted valid timestamp of the current record's partition˙ (could be -1 if unknown)public long extract(ConsumerRecord<java.lang.Object,java.lang.Object> record, long previousTimestamp)
ConsumerRecord
.extract
in interface TimestampExtractor
record
- a data recordpreviousTimestamp
- the latest extracted valid timestamp of the current record's partition˙ (could be -1 if unknown)ConsumerRecord