Cargando…

Beginning Apache Spark using Azure Databricks : unleashing large cluster analytics in the Cloud /

Analyze vast amounts of data in record time using Apache Spark with Databricks in the Cloud. Learn the fundamentals, and more, of running analytics on large clusters in Azure and AWS, using Apache Spark with Databricks on top. Discover how to squeeze the most value out of your data at a mere fractio...

Descripción completa

Detalles Bibliográficos
Clasificación:Libro Electrónico
Autor principal: Ilijason, Robert (Autor)
Formato: Electrónico eBook
Idioma:Inglés
Publicado: [Berkeley, California?] : Apress, [2020]
Temas:
Acceso en línea:Texto completo (Requiere registro previo con correo institucional)

MARC

LEADER 00000cam a2200000 a 4500
001 OR_on1159594472
003 OCoLC
005 20231017213018.0
006 m o d
007 cr |n|||||||||
008 200625s2020 cau ob 001 0 eng d
040 |a YDX  |b eng  |e pn  |c YDX  |d GW5XE  |d EBLCP  |d LQU  |d OCLCF  |d UMI  |d N$T  |d NLW  |d LIP  |d UKMGB  |d UKAHL  |d OCL  |d OCLCO  |d VLB  |d OCLCO  |d OCLCQ  |d UPM  |d OCLCQ  |d OCLCO 
015 |a GBC0G5588  |2 bnb 
016 7 |a 019828359  |2 Uk 
019 |a 1159163302  |a 1159170260  |a 1161999430  |a 1163811513  |a 1164673551  |a 1175706816  |a 1182533295  |a 1183411500  |a 1184031470  |a 1198377315  |a 1203886910  |a 1226248471 
020 |a 9781484257814  |q (electronic bk.) 
020 |a 1484257812  |q (electronic bk.) 
020 |z 1484257804 
020 |z 9781484257807 
024 7 |a 10.1007/978-1-4842-5781-4.  |2 doi 
024 8 |a 10.1007/978-1-4842-5 
029 1 |a AU@  |b 000067301061 
029 1 |a AU@  |b 000067526433 
029 1 |a UKMGB  |b 019828359 
035 |a (OCoLC)1159594472  |z (OCoLC)1159163302  |z (OCoLC)1159170260  |z (OCoLC)1161999430  |z (OCoLC)1163811513  |z (OCoLC)1164673551  |z (OCoLC)1175706816  |z (OCoLC)1182533295  |z (OCoLC)1183411500  |z (OCoLC)1184031470  |z (OCoLC)1198377315  |z (OCoLC)1203886910  |z (OCoLC)1226248471 
037 |a CL0501000147  |b Safari Books Online 
050 4 |a QA76.585  |b .I55 2020eb 
072 7 |a KJQ.  |2 bicssc 
072 7 |a BUS070030.  |2 bisacsh 
072 7 |a KJQ.  |2 thema 
082 0 4 |a 004.67/82  |2 23 
049 |a UAMI 
100 1 |a Ilijason, Robert,  |e author 
245 1 0 |a Beginning Apache Spark using Azure Databricks :  |b unleashing large cluster analytics in the Cloud /  |c Robert Ilijason. 
264 1 |a [Berkeley, California?] :  |b Apress,  |c [2020] 
264 4 |c ©2020 
300 |a 1 online resource (xvii, 274 pages) 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
347 |a text file 
347 |b PDF 
505 0 |a Chapter 1: Introduction to Large-Scale Data Analytics -- Chapter 2: Spark and Databricks -- Chapter 3: Getting Started with Databricks -- Chapter 4: Workspaces, Clusters, and Notebooks -- Chapter 5: Getting Data into Databricks -- Chapter 6: Querying Data Using SQL -- Chapter 7: The Power of Python -- Chapter 8: ETL and Advanced Data Wrangling -- Chapter 9: Connecting to and from Afar -- Chapter 10: Running in Production -- Chapter 11: Bits and Pieces. 
520 |a Analyze vast amounts of data in record time using Apache Spark with Databricks in the Cloud. Learn the fundamentals, and more, of running analytics on large clusters in Azure and AWS, using Apache Spark with Databricks on top. Discover how to squeeze the most value out of your data at a mere fraction of what classical analytics solutions cost, while at the same time getting the results you need, incrementally faster. This book explains how the confluence of these pivotal technologies gives you enormous power, and cheaply, when it comes to huge datasets. You will begin by learning how cloud infrastructure makes it possible to scale your code to large amounts of processing units, without having to pay for the machinery in advance. From there you will learn how Apache Spark, an open source framework, can enable all those CPUs for data analytics use. Finally, you will see how services such as Databricks provide the power of Apache Spark, without you having to know anything about configuring hardware or software. By removing the need for expensive experts and hardware, your resources can instead be allocated to actually finding business value in the data. This book guides you through some advanced topics such as analytics in the cloud, data lakes, data ingestion, architecture, machine learning, and tools, including Apache Spark, Apache Hadoop, Apache Hive, Python, and SQL. Valuable exercises help reinforce what you have learned. What You Will Learn Discover the value of big data analytics that leverage the power of the cloud Get started with Databricks using SQL and Python in either Microsoft Azure or AWS Understand the underlying technology, and how the cloud and Spark fit into the bigger picture See how these tools are used in the real world Run basic analytics, including machine learning, on billions of rows at a fraction of a cost or free This book is for data engineers, data scientists, and cloud architects who want or need to run advanced analytics in the cloud. It is assumed that the reader has data experience, but perhaps minimal exposure to Apache Spark and Azure Databricks. The book is also recommended for people who want to get started in the analytics field, as it provides a strong foundation. Robert Ilijason is a 20-year veteran in the business intelligence (BI) segment. He has worked as a contractor for some of Europes biggest companies and has conducted large-scale analytics projects within the areas of retail, telecom, banking, government, and more. Robert has seen his share of analytic trends come and go over the years, but unlike most of them, he strongly believes that Apache Spark in the cloud, especially with Azure Databricks, is a game changer. 
504 |a Includes bibliographical references and index. 
590 |a O'Reilly  |b O'Reilly Online Learning: Academic/Public Library Edition 
630 0 0 |a Spark (Electronic resource : Apache Software Foundation) 
630 0 7 |a Spark (Electronic resource : Apache Software Foundation)  |2 fast 
650 0 |a Cloud computing. 
650 0 |a Big data. 
650 6 |a Infonuagique. 
650 6 |a Données volumineuses. 
650 7 |a Microsoft programming.  |2 bicssc 
650 7 |a Computer programming  |x software development.  |2 bicssc 
650 7 |a Business mathematics & systems.  |2 bicssc 
650 7 |a Computers  |x Programming  |x Microsoft Programming.  |2 bisacsh 
650 7 |a Computers  |x Programming  |x Open Source.  |2 bisacsh 
650 7 |a Business & Economics  |x Industries  |x Computer Industry.  |2 bisacsh 
650 7 |a Cloud computing  |2 fast 
650 7 |a Big data  |2 fast 
650 7 |a Computer programming  |2 fast 
650 7 |a Microsoft software  |2 fast 
650 7 |a Open source software  |2 fast 
776 0 8 |i Print version:  |z 1484257804  |z 9781484257807  |w (OCoLC)1137853990 
856 4 0 |u https://learning.oreilly.com/library/view/~/9781484257814/?ar  |z Texto completo (Requiere registro previo con correo institucional) 
938 |a Askews and Holts Library Services  |b ASKH  |n AH37842936 
938 |a ProQuest Ebook Central  |b EBLB  |n EBL6227078 
938 |a ProQuest Ebook Central  |b EBLB  |n EBL6227125 
938 |a EBSCOhost  |b EBSC  |n 2498662 
938 |a YBP Library Services  |b YANK  |n 16819006 
994 |a 92  |b IZTAP