Apache Flume

Related topics
Product name
Publisher page
  • [Apache|Apache]
  • [Cloudera|Cloudera]
Category
Other System Software
Release
TKU 2022-Aug-1
More information
Publisher link

Product Description

Flume is an open-sourced distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log data. It has a simple and flexible architecture based on streaming data flows. It is robust and fault tolerant with tunable reliability mechanisms and many failover and recovery mechanisms. It uses a simple extensible data model that allows for online analytic application.

Software Pattern Summary

Product ComponentOS TypeVersioningPattern Depth
Apache FlumeUnixActive, FileInstance-based

Platforms Supported by the Pattern

The pattern supports only UNIX platforms.

Identification

Software Instance Triggers

Trigger NodeOS TypeAttributeConditionArgument
Apache FlumeUnixcmdmatchesjava
argsregex "\-Dflume"

Simple Identification Mappings

The following components are identified using simple identity mappings.

Namecmd matchesargs matches
Apache Flume javaregex'\-Dflume'

Obtaining key variables

Obtaining Flume Install Root

Flume install root is obtained from the trigger process arguments  using the following regular expression:

  • "\-cp\s+(/.+)/conf"


Versioning

Version information for the product is currently collected using one of four possible methods.

The methods are tried in an order of precedence based on likely success and/or accuracy of the information that can be gathered. Once a result is obtained, the methods lower in precedence are not attempted. In order of precedence the methods are:

Active Versioning

BMC Atrium Discovery extracts version information from the command result

"%install_root%/bin/flume-ng version"


File Versioning

If the version information wasn't obtained using previous methods, BMC Atrium Discovery extracts it from the RELEASE-NOTES file located in the <install_root> directory  using the following regular expression:

  • Apache Flume\s+(\d+(?:\.\d+)*)

Application Model Produced by Software Pattern

Product Architecture

Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log data. It has a simple and flexible architecture based on streaming data flows. It is robust and fault tolerant with tunable reliability mechanisms and many failover and recovery mechanisms. It uses a simple extensible data model that allows for online analytic application.

Software Pattern Model


SI Depth

The pattern produces an instance based Software Instance for Apache Flume, with the SI key that is created using the SI type (Apache Flume) and the name of the host that runs the Oracle SQL Connector for HDFS instance and process with arguments.

Software Instance Modeling


Subject Matter Expertise

Subject Matter Expert input will be welcome on any other potential approaches not discussed above to improving product  relations to other products.

Testing

The pattern for Apache Flume was tested using actual installations of BMC Atrium Discovery running on Linux.

Information Sources

Open Issues

N/A

Created By: Dmytro Ostapchuk (18 December 2014)

Was this page helpful? Yes No Submitting... Thank you

Comments