Using Kudu with Apache Spark and Apache Flume : hands-on experience using the Kudu Storage Manager with Apache Spark, Spark SQL, MLlib, and Apache Flume /
"Apache Kudu, the breakthrough storage technology, is often used in conjunction with other Hadoop ecosystem frameworks for data ingest, processing, and analysis. This is a practical, hands-on course that shows you how Kudu works with four of those frameworks: Apache Spark, Spark SQL, MLlib, and...
Clasificación: | Libro Electrónico |
---|---|
Otros Autores: | |
Formato: | Electrónico Video |
Idioma: | Inglés |
Publicado: |
[Place of publication not identified] :
O'Reilly Media,
[2017]
|
Temas: | |
Acceso en línea: | Texto completo (Requiere registro previo con correo institucional) |
Sumario: | "Apache Kudu, the breakthrough storage technology, is often used in conjunction with other Hadoop ecosystem frameworks for data ingest, processing, and analysis. This is a practical, hands-on course that shows you how Kudu works with four of those frameworks: Apache Spark, Spark SQL, MLlib, and Apache Flume. You'll use the Kudu-Spark module with Spark and SparkSQL to seamlessly create, move, and update data between Kudu and Spark; then use Apache Flume to stream events into a Kudu table, and finally, query it using Apache Impala. The course is designed for learners with some limited experience using Hadoop ecosystem components like HDFS, Hive, Spark, or Impala."--Resource description page. |
---|---|
Notas: | Title from title screen (viewed April 4, 2017). Date of publication from resource description page. |
Descripción Física: | 1 online resource (1 streaming video file (28 min., 46 sec.)) : digital, sound, color |