Cargando…

Cataloging Unstructured Data in IBM Watson Knowledge Catalog with IBM Spectrum Discover /

This IBM® Redpaper publication explains how IBM Spectrum® Discover integrates with the IBM Watson® Knowledge Catalog (WKC) component of IBM Cloud® Pak for Data (IBM CP4D) to make the enriched catalog content in IBM Spectrum Discover along with the associated data available in WKC and IBM CP4D. From...

Descripción completa

Detalles Bibliográficos
Autores principales: Dain, Joseph (Autor), Selim, Abeer (Autor), Patil, Anil (Autor), Vollmar, Christopher (Autor), De Rezende, Flavio (Autor), Greco, Frank (Autor), Lee, Frank (Autor), Crawford, Isom (Autor), Bozhinov, Ivaylo (Autor), Wong, Joanna (Autor), Blumert, Joshua (Autor), Coyne, Larry (Autor)
Autor Corporativo: Safari, an O'Reilly Media Company
Formato: Electrónico eBook
Idioma:Inglés
Publicado: IBM Redbooks, 2020.
Edición:1st edition.
Temas:
Acceso en línea:Texto completo (Requiere registro previo con correo institucional)

MARC

LEADER 00000cam a22000007a 4500
001 OR_on1192526752
003 OCoLC
005 20231017213018.0
006 m o d
007 cr cnu||||||||
008 120820s2020 xx o 000 0 eng
040 |a AU@  |b eng  |e pn  |c AU@  |d UAB  |d OCLCO  |d OCLCF  |d LVT  |d OCLCO  |d OCLCQ  |d OCLCO 
019 |a 1302275428 
020 |z 9780738459028 
020 |z 073845902X 
024 8 |a 9780738459028 
029 0 |a AU@  |b 000067830083 
035 |a (OCoLC)1192526752  |z (OCoLC)1302275428 
049 |a UAMI 
100 1 |a Dain, Joseph,  |e author. 
245 1 0 |a Cataloging Unstructured Data in IBM Watson Knowledge Catalog with IBM Spectrum Discover /  |c Dain, Joseph. 
250 |a 1st edition. 
264 1 |b IBM Redbooks,  |c 2020. 
300 |a 1 online resource (108 pages) 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
347 |a text file 
520 |a This IBM® Redpaper publication explains how IBM Spectrum® Discover integrates with the IBM Watson® Knowledge Catalog (WKC) component of IBM Cloud® Pak for Data (IBM CP4D) to make the enriched catalog content in IBM Spectrum Discover along with the associated data available in WKC and IBM CP4D. From an end-to-end IBM solution point of view, IBM CP4D and WKC provide state-of-the-art data governance, collaboration, and artificial intelligence (AI) and analytics tools, and IBM Spectrum Discover complements these features by adding support for unstructured data on large-scale file and object storage systems on premises and in the cloud. Many organizations face challenges to manage unstructured data. Some challenges that companies face include: Pinpointing and activating relevant data for large-scale analytics, machine learning (ML) and deep learning (DL) workloads. Lacking the fine-grained visibility that is needed to map data to business priorities. Removing redundant, obsolete, and trivial (ROT) data and identifying data that can be moved to a lower-cost storage tier. Identifying and classifying sensitive data as it relates to various compliance mandates, such as the General Data Privacy Regulation (GDPR), Payment Card Industry Data Security Standards (PCI-DSS), and the Health Information Portability and Accountability Act (HIPAA). This paper describes how IBM Spectrum Discover provides seamless integration of data in IBM Storage with IBM Watson Knowledge Catalog (WKC). Features include: Event-based cataloging and tagging of unstructured data across the enterprise. Automatically inspecting and classifying over 1000 unstructured data types, including genomics and imaging specific file formats. Automatically registering assets with WKC based on IBM Spectrum Discover search and filter criteria, and by using assets in IBM CP4D. Enforcing data governance policies in WKC in IBM CP4D based on insights from IBM Spectrum Discover, and using assets in IBM CP4D. Several in-depth use cases are used that show examples of healthcare, life sciences, and financial services. IBM Spectrum Discover integration with WKC enables storage administrators, data stewards, and data scientists to efficiently manage, classify, and gain insights from massive amounts of data. The integration improves storage economics, helps mitigate risk, and accelerates large-scale analytics to create competitive advantage and speed critical research. 
542 |f Copyright 2020 © IBM  |g 2020 
550 |a Made available through: Safari, an O'Reilly Media Company. 
588 |a Online resource; Title from title page (viewed August 11, 2020) 
590 |a O'Reilly  |b O'Reilly Online Learning: Academic/Public Library Edition 
650 0 |a Database management. 
650 0 |a IBM computers. 
650 0 |a Information retrieval  |x Computer programs. 
650 0 |a Information storage and retrieval systems. 
650 2 |a Information Systems 
650 6 |a Bases de données  |x Gestion. 
650 6 |a IBM (Ordinateurs) 
650 6 |a Systèmes d'information. 
650 7 |a Database management  |2 fast 
650 7 |a IBM computers  |2 fast 
650 7 |a Information retrieval  |x Computer programs  |2 fast 
650 7 |a Information storage and retrieval systems  |2 fast 
700 1 |a Selim, Abeer,  |e author. 
700 1 |a Patil, Anil,  |e author. 
700 1 |a Vollmar, Christopher,  |e author. 
700 1 |a De Rezende, Flavio,  |e author. 
700 1 |a Greco, Frank,  |e author. 
700 1 |a Lee, Frank,  |e author. 
700 1 |a Crawford, Isom,  |e author. 
700 1 |a Bozhinov, Ivaylo,  |e author. 
700 1 |a Wong, Joanna,  |e author. 
700 1 |a Blumert, Joshua,  |e author. 
700 1 |a Coyne, Larry,  |e author. 
710 2 |a Safari, an O'Reilly Media Company. 
856 4 0 |u https://learning.oreilly.com/library/view/~/9780738459028/?ar  |z Texto completo (Requiere registro previo con correo institucional) 
936 |a BATCHLOAD 
994 |a 92  |b IZTAP