Cargando…

FUZZY DATA MATCHING WITH SQL enhancing data quality and query performance /

If you were handed two different but related sets of data, what tools would you use to find the matches? What if all you had was SQL SELECT access to a database? In this practical book, author Jim Lehmer provides best practices, techniques, and tricks to help you import, clean, match, score, and thi...

Descripción completa

Detalles Bibliográficos
Clasificación:Libro Electrónico
Autor principal: Lehmer, Jim (Autor)
Formato: Electrónico eBook
Idioma:Inglés
Publicado: Sebastopol, CA : O'Reilly Media, Inc., [2024]
Temas:
Acceso en línea:Texto completo (Requiere registro previo con correo institucional)

MARC

LEADER 00000cam a22000007a 4500
001 OR_on1402019474
003 OCoLC
005 20231017213018.0
006 m o d
007 cr |n|||||||||
008 231008s2023 cau o 001 0 eng d
040 |a YDX  |b eng  |c YDX  |d ORMDA 
020 |a 9781098152246  |q (electronic bk.) 
020 |a 1098152247  |q (electronic bk.) 
020 |z 1098152271 
020 |z 9781098152277 
035 |a (OCoLC)1402019474 
037 |a 9781098152260  |b O'Reilly Media 
050 4 |a QA76.73.S67 
082 0 4 |a 005.75/6  |2 23/eng/20231010 
049 |a UAMI 
100 1 |a Lehmer, Jim,  |e author. 
245 1 0 |a FUZZY DATA MATCHING WITH SQL  |h [electronic resource] :  |b enhancing data quality and query performance /  |c Jim Lehmer. 
260 |a Sebastopol, CA :  |b O'Reilly Media, Inc.,  |c [2024] 
300 |a 1 online resource 
500 |a Includes index. 
520 |a If you were handed two different but related sets of data, what tools would you use to find the matches? What if all you had was SQL SELECT access to a database? In this practical book, author Jim Lehmer provides best practices, techniques, and tricks to help you import, clean, match, score, and think about heterogeneous data using SQL. DBAs, programmers, business analysts, and data scientists will learn how to identify and remove duplicates, parse strings, extract data from XML and JSON, generate SQL using SQL, regularize data and prepare datasets, and apply data quality and ETL approaches for finding the similarities and differences between various expressions of the same data. Full of real-world techniques, the examples in the book contain working code. You'll learn how to: Identity and remove duplicates in two different datasets using SQL Regularize data and achieve data quality using SQL Extract data from XML and JSON Generate SQL using SQL to increase your productivity Prepare datasets for import, merging, and better analysis using SQL Report results using SQL Apply data quality and ETL approaches to finding similarities and differences between various expressions of the same data. 
590 |a O'Reilly  |b O'Reilly Online Learning: Academic/Public Library Edition 
650 0 |a SQL (Computer program language) 
650 0 |a Database management. 
776 0 8 |i Print version:  |z 1098152271  |z 9781098152277  |w (OCoLC)1391324998 
856 4 0 |u https://learning.oreilly.com/library/view/~/9781098152260/?ar  |z Texto completo (Requiere registro previo con correo institucional) 
938 |a YBP Library Services  |b YANK  |n 305747968 
938 |a YBP Library Services  |b YANK  |n 305747968 
994 |a 92  |b IZTAP