Flume Introduction

What is Flume?

Apache Flume is a tool/service/data ingestion mechanism for collecting aggregating and transporting large amounts of streaming data such as log files, events (etc…) from various sources to a centralized data store.

Flume is a highly reliable, distributed, and configurable tool. It is principally designed to copy streaming data (log data) from various web servers to HDFS.

https://s0.wailian.download/2019/06/03/apache_flume-min.pngapache_flume-min

Architecture

https://s0.wailian.download/2019/06/03/flume_architecture-min.pngflume_architecture-min

Flume Event

https://s0.wailian.download/2019/06/03/flume_event-min.pngflume_event-min

Flume Agent

https://s0.wailian.download/2019/06/03/flume_agent1-min.jpgflume_agent1-min

Data Flow

https://s0.wailian.download/2019/06/03/flume_dataflow-min.pngflume_dataflow-min