|
|
|
|
LEADER |
00000cam a2200000 a 4500 |
001 |
OR_on1134076868 |
003 |
OCoLC |
005 |
20231017213018.0 |
006 |
m o d |
007 |
cr un|---aucuu |
008 |
200104s2020 cau ob 001 0 eng d |
040 |
|
|
|a EBLCP
|b eng
|e pn
|c EBLCP
|d GW5XE
|d N$T
|d OCLCF
|d ESU
|d LQU
|d UPM
|d UKMGB
|d COO
|d OCLCQ
|d OCLCO
|d COM
|d OCLCQ
|d OCLCO
|
015 |
|
|
|a GBC066506
|2 bnb
|
016 |
7 |
|
|a 019659230
|2 Uk
|
019 |
|
|
|a 1137833011
|
020 |
|
|
|a 9781484254615
|q (electronic bk.)
|
020 |
|
|
|a 1484254619
|q (electronic bk.)
|
020 |
|
|
|z 9781484254608
|q (print)
|
020 |
|
|
|z 1484254600
|
024 |
8 |
|
|a 10.1007/978-1-4842-5
|
029 |
1 |
|
|a AU@
|b 000066483290
|
029 |
1 |
|
|a UKMGB
|b 019659230
|
035 |
|
|
|a (OCoLC)1134076868
|z (OCoLC)1137833011
|
037 |
|
|
|a com.springer.onix.9781484254615
|b Springer Nature
|
050 |
|
4 |
|a QA76.9.I52
|
082 |
0 |
4 |
|a 001.4/226
|2 23
|
049 |
|
|
|a UAMI
|
100 |
1 |
|
|a Feasel, Kevin.
|
245 |
1 |
0 |
|a PolyBase revealed :
|b data virtualization with SQL server, Hadoop, Apache Spark, and beyond /
|c Kevin Feasel.
|
260 |
|
|
|a Berkeley, CA :
|b Apress L.P.,
|c 2020.
|
300 |
|
|
|a 1 online resource (320 pages)
|
336 |
|
|
|a text
|b txt
|2 rdacontent
|
337 |
|
|
|a computer
|b c
|2 rdamedia
|
338 |
|
|
|a online resource
|b cr
|2 rdacarrier
|
588 |
0 |
|
|a Print version record.
|
505 |
0 |
|
|a Intro -- Table of Contents -- About the Author -- About the Technical Reviewer -- Acknowledgments -- Introduction -- Chapter 1: Installing and Configuring PolyBase -- Choose the Form of Your PolyBase -- Installing PolyBase Standalone-Windows -- Installing PolyBase Scale-Out Group -- Building a Configuration File -- Installing Without a GUI -- Installing PolyBase Standalone-Linux -- Configuring PolyBase -- Configuring a Client -- Enable PolyBase -- Mandatory Configuration -- Scale-Out Group Configuration -- Troubleshooting Common Errors -- Testing for Success -- Conclusion
|
505 |
8 |
|
|a Chapter 2: Connecting to Azure Blob Storage -- Making Preparations in Azure -- Create a Storage Account -- Upload Data -- Building a Link -- Credentials -- External Data Sources -- External File Formats -- Delimited Files -- Flat File Compression -- Define an External File Format -- External Tables -- Querying External Data -- Inserting into External Tables -- PolyBase Data Insertion Considerations -- PolyBase Is Insert-Only -- Insert Only into Folders -- Conclusion -- Chapter 3: Connecting to Hadoop -- Hadoop Prerequisites -- Preparing Files in HDFS -- Gather Configuration Settings
|
505 |
8 |
|
|a Configuring SQL Server -- Update PolyBase Configuration Files -- External PolyBase Objects for Hadoop -- Credentials -- External Data Sources -- External File Formats -- Delimited Files -- RCFile -- ORC -- Parquet -- External Tables -- Querying Data in Hadoop -- Row Counts with Police Incident Data -- Newlines and Quotes with Fire Incident Data -- Going Faster with Parking Violations Data -- Inserting Data into Hadoop -- Conclusion -- Chapter 4: Using Predicate Pushdown to Enhance Query Performance -- The Importance of Predicate Pushdown -- Predicate Pushdown in PolyBase
|
505 |
8 |
|
|a Diving into Predicate Pushdown -- Packet Capture Without Predicate Pushdown -- Packet Capture with Predicate Pushdown -- When Predicate Pushdown Makes Sense -- Small Data: Raleigh Police Incidents -- Bigger Data: New York City Parking Violations -- Limitations in Pushdown-Eligible Predicates -- Limitations on Pushdown with Complex Filters -- MapReduce and Pushdown in Summary -- Conclusion -- Chapter 5: Common Hadoop and Blob Storage Integration Errors -- Finding the Real Logger -- PolyBase Log Files -- DMS Errors -- DMS Movement -- DWEngine Errors -- DWEngine Movement -- DWEngine Server
|
505 |
8 |
|
|a DMS PolyBase -- DWEngine PolyBase -- Hadoop Log Files -- Job Tracker -- YARN Resource Manager -- JobHistory UI -- NameNode Logs -- Log Files -- Configuration Issues -- SQL Server Configuration -- Check External Resources -- Check SQL Server Configuration Files -- Hadoop-Side Configuration -- Invalid User Permissions or Missing Account -- Could Not Obtain Block -- Host File Pointing to 127.0.0.1 -- Kerberos Should Be On or Off, Not Both -- PolyBase and Dockerized Data Nodes -- Data Issues -- Structural Mismatch -- Unsupported Characters or Formats -- PolyBase Data Limitations -- Curate Your Data
|
500 |
|
|
|a Predicate Pushdown Failure
|
520 |
|
|
|a Harness the power of PolyBase data virtualization software to make data from a variety of sources easily accessible through SQL queries while using the T-SQL skills you already know and have mastered. PolyBase Revealed shows you how to use the PolyBase feature of SQL Server 2019 to integrate SQL Server with Azure Blob Storage, Apache Hadoop, other SQL Server instances, Oracle, Cosmos DB, Apache Spark, and more. You will learn how PolyBase can help you reduce storage and other costs by avoiding the need for ETL processes that duplicate data in order to make it accessible from one source. PolyBase makes SQL Server into that one source, and T-SQL is your golden ticket. The book also covers PolyBase scale-out clusters, allowing you to distribute PolyBase queries among several SQL Server instances, thus improving performance. With great flexibility comes great complexity, and this book shows you where to look when queries fail, complete with coverage of internals, troubleshooting techniques, and where to find more information on obscure cross-platform errors. Data virtualization is a key target for Microsoft with SQL Server 2019. This book will help you keep your skills current, remain relevant, and build new business and career opportunities around Microsoft's product direction. You will: Install and configure PolyBase as a stand-alone service, or unlock its capabilities with a scale-out cluster Understand how PolyBase interacts with outside data sources while presenting their data as regular SQL Server tables Write queries combining data from SQL Server, Apache Hadoop, Oracle, Cosmos DB, Apache Spark, and more Troubleshoot PolyBase queries using SQL Server Dynamic Management Views Tune PolyBase queries using statistics and execution plans Solve common business problems, including "cold storage" of infrequently accessed data and simplifying ETL jobs
|
504 |
|
|
|a Includes bibliographical references and index.
|
590 |
|
|
|a O'Reilly
|b O'Reilly Online Learning: Academic/Public Library Edition
|
650 |
|
0 |
|a Information visualization.
|
650 |
|
6 |
|a Visualisation de l'information.
|
650 |
|
7 |
|a Information visualization
|2 fast
|
776 |
0 |
8 |
|i Print version:
|a Feasel, Kevin.
|t PolyBase Revealed : Data Virtualization with SQL Server, Hadoop, Apache Spark, and Beyond.
|d Berkeley, CA : Apress L.P., ©2020
|z 9781484254608
|
856 |
4 |
0 |
|u https://learning.oreilly.com/library/view/~/9781484254615/?ar
|z Texto completo (Requiere registro previo con correo institucional)
|
938 |
|
|
|a ProQuest Ebook Central
|b EBLB
|n EBL6000731
|
938 |
|
|
|a EBSCOhost
|b EBSC
|n 2335578
|
994 |
|
|
|a 92
|b IZTAP
|