Cargando…

Practical big data analytics : hands-on techniques to implement enterprise analytics and machine learning using Hadoop, Spark, NoSQL and R /

Big Data analytics relates to the strategies used by enterprises to process and analyze large amounts of data to bring out hidden insights. With the help of open source and enterprise tools, such as R, Python, Hadoop, and Spark, you will learn how to effectively mine your Big Data. By the end of thi...

Descripción completa

Detalles Bibliográficos
Clasificación:Libro Electrónico
Autor principal: Dasgupta, Nataraj (Autor)
Formato: Electrónico eBook
Idioma:Inglés
Publicado: Birmingham, UK : Packt Publishing, 2018.
Temas:
Acceso en línea:Texto completo (Requiere registro previo con correo institucional)

MARC

LEADER 00000cam a2200000 i 4500
001 OR_on1021887799
003 OCoLC
005 20231017213018.0
006 m o d
007 cr unu||||||||
008 180206s2018 enka o 000 0 eng d
040 |a UMI  |b eng  |e rda  |e pn  |c UMI  |d EBLCP  |d STF  |d NLE  |d OCLCF  |d MERUC  |d TOH  |d N$T  |d TEFOD  |d YDX  |d CEF  |d KSU  |d VT2  |d DEBBG  |d OCLCQ  |d UKMGB  |d G3B  |d S9I  |d C6I  |d UAB  |d UKAHL  |d OCLCQ  |d OCLCO  |d NZAUC  |d OCLCQ 
015 |a GBB875018  |2 bnb 
016 7 |a 018754799  |2 Uk 
019 |a 1022199287 
020 |a 9781783554409  |q (electronic bk.) 
020 |a 1783554401  |q (electronic bk.) 
020 |a 1783554398 
020 |a 9781783554393 
020 |z 9781783554393 
024 3 |a 9781783554393 
029 1 |a GBVCP  |b 1014938392 
029 1 |a UKMGB  |b 018754799 
035 |a (OCoLC)1021887799  |z (OCoLC)1022199287 
037 |a CL0500000937  |b Safari Books Online 
037 |a 363BC7C8-514B-4F6D-B323-AF1C2B58474F  |b OverDrive, Inc.  |n http://www.overdrive.com 
050 4 |a QA76.585 
072 7 |a COM  |x 013000  |2 bisacsh 
072 7 |a COM  |x 014000  |2 bisacsh 
072 7 |a COM  |x 018000  |2 bisacsh 
072 7 |a COM  |x 067000  |2 bisacsh 
072 7 |a COM  |x 032000  |2 bisacsh 
072 7 |a COM  |x 037000  |2 bisacsh 
072 7 |a COM  |x 052000  |2 bisacsh 
082 0 4 |a 004.6782  |2 23 
049 |a UAMI 
100 1 |a Dasgupta, Nataraj,  |e author. 
245 1 0 |a Practical big data analytics :  |b hands-on techniques to implement enterprise analytics and machine learning using Hadoop, Spark, NoSQL and R /  |c Nataraj Dasgupta. 
264 1 |a Birmingham, UK :  |b Packt Publishing,  |c 2018. 
300 |a 1 online resource (1 volume) :  |b illustrations 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
347 |a data file 
588 0 |a Online resource; title from title page (viewed February 5, 2018). 
505 0 |a Cover; Copyright and Credits; Packt Upsell; Contributors; Table of Contents; Preface; Chapter 1: Too Big or Not Too Big; What is big data?; A brief history of data; Dawn of the information age; Dr. Alan Turing and modern computing; The advent of the stored-program computer; From magnetic devices to SSDs; Why we are talking about big data now if data has always existed; Definition of big data; Building blocks of big data analytics; Types of Big Data; Structured; Unstructured; Semi-structured; Sources of big data; The 4Vs of big data. 
505 8 |a When do you know you have a big data problem and where do you start your search for the big data solution?Summary; Chapter 2: Big Data Mining for the Masses; What is big data mining?; Big data mining in the enterprise; Building the case for a Big Data strategy; Implementation life cycle; Stakeholders of the solution; Implementing the solution; Technical elements of the big data platform; Selection of the hardware stack; Selection of the software stack; Summary; Chapter 3: The Analytics Toolkit; Components of the Analytics Toolkit; System recommendations; Installing on a laptop or workstation. 
505 8 |a Installing on the cloudInstalling Hadoop; Installing Oracle VirtualBox; Installing CDH in other environments; Installing Packt Data Science Box; Installing Spark; Installing R; Steps for downloading and installing Microsoft R Open; Installing RStudio; Installing Python; Summary; Chapter 4: Big Data With Hadoop; The fundamentals of Hadoop; The fundamental premise of Hadoop; The core modules of Hadoop; Hadoop Distributed File System -- HDFS; Data storage process in HDFS; Hadoop MapReduce; An intuitive introduction to MapReduce; A technical understanding of MapReduce. 
505 8 |a Block size and number of mappers and reducersHadoop YARN; Job scheduling in YARN; Other topics in Hadoop; Encryption; User authentication; Hadoop data storage formats; New features expected in Hadoop 3; The Hadoop ecosystem; Hands-on with CDH; WordCount using Hadoop MapReduce; Analyzing oil import prices with Hive; Joining tables in Hive; Summary; Chapter 5: Big Data Mining with NoSQL; Why NoSQL?; The ACID, BASE, and CAP properties; ACID and SQL; The BASE property of NoSQL; The CAP theorem; The need for NoSQL technologies; Google Bigtable; Amazon Dynamo; NoSQL databases; In-memory databases. 
505 8 |a Columnar databasesDocument-oriented databases; Key-value databases; Graph databases; Other NoSQL types and summary of other types of databases ; Analyzing Nobel Laureates data with MongoDB; JSON format; Installing and using MongoDB; Tracking physician payments with real-world data; Installing kdb+, R, and RStudio; Installing kdb+; Installing R; Installing RStudio; The CMS Open Payments Portal; Downloading the CMS Open Payments data; Creating the Q application; Loading the data; The backend code; Creating the frontend web portal; R Shiny platform for developers. 
505 8 |a Putting it all together -- The CMS Open Payments application. 
520 |a Big Data analytics relates to the strategies used by enterprises to process and analyze large amounts of data to bring out hidden insights. With the help of open source and enterprise tools, such as R, Python, Hadoop, and Spark, you will learn how to effectively mine your Big Data. By the end of this book, you will have a clear understanding ... 
590 |a eBooks on EBSCOhost  |b EBSCO eBook Subscription Academic Collection - Worldwide 
590 |a O'Reilly  |b O'Reilly Online Learning: Academic/Public Library Edition 
650 0 |a Big data. 
650 0 |a Cloud computing. 
650 0 |a Machine learning. 
650 6 |a Données volumineuses. 
650 6 |a Infonuagique. 
650 6 |a Apprentissage automatique. 
650 7 |a Database design & theory.  |2 bicssc 
650 7 |a Cloud computing.  |2 bicssc 
650 7 |a Information architecture.  |2 bicssc 
650 7 |a Data capture & analysis.  |2 bicssc 
650 7 |a Computers  |x Data Modeling & Design.  |2 bisacsh 
650 7 |a Computers  |x Data Processing.  |2 bisacsh 
650 7 |a COMPUTERS  |x Computer Literacy.  |2 bisacsh 
650 7 |a COMPUTERS  |x Computer Science.  |2 bisacsh 
650 7 |a COMPUTERS  |x Hardware  |x General.  |2 bisacsh 
650 7 |a COMPUTERS  |x Information Technology.  |2 bisacsh 
650 7 |a COMPUTERS  |x Machine Theory.  |2 bisacsh 
650 7 |a COMPUTERS  |x Reference.  |2 bisacsh 
650 7 |a Big data.  |2 fast  |0 (OCoLC)fst01892965 
650 7 |a Cloud computing.  |2 fast  |0 (OCoLC)fst01745899 
650 7 |a Machine learning.  |2 fast  |0 (OCoLC)fst01004795 
856 4 0 |u https://learning.oreilly.com/library/view/~/9781783554393/?ar  |z Texto completo (Requiere registro previo con correo institucional) 
938 |a Askews and Holts Library Services  |b ASKH  |n BDZ0036187802 
938 |a EBL - Ebook Library  |b EBLB  |n EBL5254586 
938 |a EBSCOhost  |b EBSC  |n 1699227 
938 |a YBP Library Services  |b YANK  |n 15132106 
994 |a 92  |b IZTAP