Cargando…

From flat files to deconstructed databases : the evolution and future of the big data ecosystem /

"Julien Le Dem (WeWork) discusses the key open source components of the big data ecosystem--including Apache Calcite, Parquet, Arrow, Avro, and Kafka as well as batch and streaming systems--and explains how they relate to each other and how they make the ecosystem more of a database and less of...

Descripción completa

Detalles Bibliográficos
Clasificación:Libro Electrónico
Autor Corporativo: O'Reilly Strata Data Conference
Formato: Electrónico Congresos, conferencias Video
Idioma:Inglés
Publicado: [Place of publication not identified] : O'Reilly Media, 2019.
Temas:
Acceso en línea:Texto completo (Requiere registro previo con correo institucional)

MARC

LEADER 00000cgm a2200000 i 4500
001 OR_on1138875892
003 OCoLC
005 20231017213018.0
006 m o c
007 cr cna||||||||
007 vz czazuu
008 200203s2019 xx 044 o vleng d
040 |a UMI  |b eng  |e rda  |e pn  |c UMI  |d OCLCF  |d TOH  |d OCLCO  |d OCLCQ  |d OCLCO 
035 |a (OCoLC)1138875892 
037 |a CL0501000090  |b Safari Books Online 
050 4 |a QA76.9.B45 
049 |a UAMI 
100 1 |a Le Dem, Julien,  |e onscreen presenter. 
245 1 0 |a From flat files to deconstructed databases :  |b the evolution and future of the big data ecosystem /  |c Julien Le Dem. 
264 1 |a [Place of publication not identified] :  |b O'Reilly Media,  |c 2019. 
300 |a 1 online resource (1 streaming video file (43 min., 49 sec.)) 
336 |a two-dimensional moving image  |b tdi  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
337 |a video  |b v  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
500 |a Title from resource description page (Safari, viewed January 31, 2020). 
511 0 |a Presenter, Julien Le Dem. 
520 |a "Julien Le Dem (WeWork) discusses the key open source components of the big data ecosystem--including Apache Calcite, Parquet, Arrow, Avro, and Kafka as well as batch and streaming systems--and explains how they relate to each other and how they make the ecosystem more of a database and less of a filesystem. (Parquet is the columnar data layout to optimize data at rest for querying. Arrow is the in-memory representation for maximum throughput execution and overhead-free data exchange. Calcite is the optimizer to make the most of our infrastructure capabilities.) Julien also explores the emerging components that are still missing or haven't become standard yet to fully materialize the transformation to an extremely flexible database that lets you innovate with your data. This session was recorded at the 2019 O'Reilly Strata Data Conference in San Francisco."--Resource description page 
590 |a O'Reilly  |b O'Reilly Online Learning: Academic/Public Library Edition 
650 0 |a Big data. 
650 0 |a Data mining. 
650 2 |a Data Mining 
650 6 |a Données volumineuses. 
650 6 |a Exploration de données (Informatique) 
650 7 |a Big data.  |2 fast  |0 (OCoLC)fst01892965 
650 7 |a Data mining.  |2 fast  |0 (OCoLC)fst00887946 
655 4 |a Electronic videos. 
711 2 |a O'Reilly Strata Data Conference  |d (2019 :  |c San Francisco, Calif.) 
856 4 0 |u https://learning.oreilly.com/videos/~/0636920339847/?ar  |z Texto completo (Requiere registro previo con correo institucional) 
994 |a 92  |b IZTAP