Cargando…

Understanding metadata : create the foundation for a scalable data architecture /

One viable option for organizations looking to harness massive amounts of data is the data lake, a single repository for storing all the raw data, both structured and unstructured, that floods into the company. But that isn't the end of the story. The key to making a data lake work is data gove...

Descripción completa

Detalles Bibliográficos
Clasificación:Libro Electrónico
Autores principales: Castanedo, Federico (Autor), Gidley, Scott (Autor)
Formato: Electrónico eBook
Idioma:Inglés
Publicado: Sebastopol, CA : O'Reilly Media, [2017]
Edición:First edition.
Temas:
Acceso en línea:Texto completo (Requiere registro previo con correo institucional)

MARC

LEADER 00000cam a2200000Ii 4500
001 OR_ocn982065210
003 OCoLC
005 20231017213018.0
006 m o d
007 cr unu||||||||
008 170410s2017 caua o 000 0 eng d
040 |a UMI  |b eng  |e rda  |e pn  |c UMI  |d STF  |d TOH  |d OCLCF  |d OCLCQ  |d COO  |d UOK  |d CEF  |d KSU  |d VT2  |d OCLCQ  |d DEBBG  |d WYU  |d C6I  |d UAB  |d CZL  |d OCLCO  |d OCLCQ 
020 |z 9781491974889 
029 1 |a GBVCP  |b 1004860935 
029 1 |a AU@  |b 000066233733 
035 |a (OCoLC)982065210 
037 |a CL0500000846  |b Safari Books Online 
050 4 |a Z666.7 
082 1 4 |a [E] 
049 |a UAMI 
100 1 |a Castanedo, Federico,  |e author. 
245 1 0 |a Understanding metadata :  |b create the foundation for a scalable data architecture /  |c Federico Castanedo and Scott Gidley. 
246 3 0 |a Create the foundation for a scalable data architecture 
250 |a First edition. 
264 1 |a Sebastopol, CA :  |b O'Reilly Media,  |c [2017] 
264 4 |c ©2017 
300 |a 1 online resource (1 volume) :  |b illustrations 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
588 0 |a Online resource; title from title page (Safari, viewed April 10, 2017). 
520 |a One viable option for organizations looking to harness massive amounts of data is the data lake, a single repository for storing all the raw data, both structured and unstructured, that floods into the company. But that isn't the end of the story. The key to making a data lake work is data governance, using metadata to provide valuable context through tagging and cataloging. This practical report examines why metadata is essential for managing, migrating, accessing, and deploying any big data solution. Authors Federico Castanedo and Scott Gidley dive into the specifics of analyzing metadata for keeping track of your data--where it comes from, where it's located, and how it's being used--so you can provide safeguards and reduce risk. In the process, you'll learn about methods for automating metadata capture. This report also explains the main features of a data lake architecture, and discusses the pros and cons of several data lake management solutions that support metadata. These solutions include: Traditional data integration/management vendors such as the IBM Research Accelerated Discovery Lab Tooling from open source projects, including Teradata Kylo and Informatica Startups such as Trifacta and Zaloni that provide best of breed technology. 
590 |a O'Reilly  |b O'Reilly Online Learning: Academic/Public Library Edition 
650 0 |a Metadata. 
650 6 |a Métadonnées. 
650 7 |a Metadata.  |2 fast  |0 (OCoLC)fst01017519 
700 1 |a Gidley, Scott,  |e author. 
856 4 0 |u https://learning.oreilly.com/library/view/~/9781491988992/?ar  |z Texto completo (Requiere registro previo con correo institucional) 
994 |a 92  |b IZTAP