Mastering Apache Storm : processing big data streams in real time /
Master the intricacies of Apache Storm and develop real-time stream processing applications with ease About This Book Exploit the various real-time processing functionalities offered by Apache Storm such as parallelism, data partitioning, and more Integrate Storm with other Big Data technologies lik...
Clasificación: | Libro Electrónico |
---|---|
Autor principal: | |
Formato: | Electrónico eBook |
Idioma: | Inglés |
Publicado: |
Birmingham :
Packt Publishing,
2017.
|
Temas: | |
Acceso en línea: | Texto completo |
Tabla de Contenidos:
- Cover; Copyright; Credits; About the Author; About the Reviewers; www.PacktPub.com; Customer Feedback; Table of Contents; Preface; Chapter 1: Real-Time Processing and Storm Introduction; Apache Storm; Features of Storm; Storm components; Nimbus; Supervisor nodes; The ZooKeeper cluster; The Storm data model; Definition of a Storm topology; Operation modes in Storm; Programming languages; Summary; Chapter 2: Storm Deployment, Topology Development, and Topology Options; Storm prerequisites; Installing Java SDK 7; Deployment of the ZooKeeper cluster; Setting up the Storm cluster.
- Developing the hello world exampleThe different options of the Storm topology; Deactivate; Activate; Rebalance; Kill; Dynamic log level settings; Walkthrough of the Storm UI; Cluster Summary section; Nimbus Summary section; Supervisor Summary section; Nimbus Configuration section; Topology Summary section; Dynamic log level settings; Updating the log level from the Storm UI; Updating the log level from the Storm CLI; Summary; Chapter 3: Storm Parallelism and Data Partitioning; Parallelism of a topology; Worker process; Executor; Task; Configure parallelism at the code level.
- Worker process, executor, and task distributionRebalance the parallelism of a topology; Rebalance the parallelism of a SampleStormClusterTopology topology; Different types of stream grouping in the Storm cluster; Shuffle grouping; Field grouping; All grouping; Global grouping; Direct grouping; Local or shuffle grouping; None grouping; Custom grouping; Guaranteed message processing; Tick tuple; Summary; Chapter 4: Trident Introduction; Trident introduction; Understanding Trident's data model; Writing Trident functions, filters, and projections; Trident function; Trident filter.
- Trident projectionTrident repartitioning operations; Utilizing shuffle operation; Utilizing partitionBy operation; Utilizing global operation; Utilizing broadcast operation; Utilizing batchGlobal operation; Utilizing partition operation; Trident aggregator; partitionAggregate; aggregate; ReducerAggregator; Aggregator; CombinerAggregator; persistentAggregate; Aggregator chaining; Utilizing the groupBy operation; When to use Trident; Summary; Chapter 5: Trident Topology and Uses; Trident groupBy operation; groupBy before partitionAggregate; groupBy before aggregate; Non-transactional topology.
- Trident hello world topologyTrident state; Distributed RPC; When to use Trident; Summary; Chapter 6: Storm Scheduler; Introduction to Storm scheduler; Default scheduler; Isolation scheduler; Resource-aware scheduler; Component-level configuration; Memory usage example; CPU usage example; Worker-level configuration; Node-level configuration; Global component configuration; Custom scheduler; Configuration changes in the supervisor node; Configuration setting at component level; Writing a custom supervisor class; Converting component IDs to executors; Converting supervisors to slots.