Cargando…

Pro Microsoft HDInsight : Hadoop on Windows /

Pro Microsoft HDInsight is a complete guide to deploying and using Apache Hadoop on the Microsoft Windows Azure Platforms. The information in this book enables you to process enormous volumes of structured as well as non-structured data easily using HDInsight, which is Microsoft's own distribut...

Descripción completa

Detalles Bibliográficos
Clasificación:Libro Electrónico
Autor principal: Sarkar, Debarchan
Formato: Electrónico eBook
Idioma:Inglés
Publicado: New York : Apress : Distributed to the Book trade worldwide by Springer Science+Business Media New York, ©2014.
Colección:The expert's voice in big data
Temas:
Acceso en línea:Texto completo (Requiere registro previo con correo institucional)

MARC

LEADER 00000cam a2200000Ia 4500
001 OR_ocn874139155
003 OCoLC
005 20231017213018.0
006 m o d
007 cr unu||||||||
008 140321s2014 nyua o 001 0 eng d
040 |a UMI  |b eng  |e pn  |c UMI  |d S4S  |d COO  |d IDEBK  |d SFB  |d DEBBG  |d B24X7  |d GW5XE  |d E7B  |d EBLCP  |d DEBSZ  |d OCLCQ  |d YDXCP  |d OCLCF  |d OCLCQ  |d Z5A  |d LIV  |d MERUC  |d OCLCQ  |d ESU  |d NUI  |d TXI  |d VT2  |d IOG  |d N$T  |d REB  |d VLB  |d OCLCQ  |d OCLCO  |d CEF  |d INT  |d U3W  |d AU@  |d OCLCA  |d OCLCQ  |d OCLCO  |d WYU  |d YOU  |d UWO  |d OCLCQ  |d OCLCO  |d UAB  |d OCLCQ  |d OCLCO  |d OCLCQ  |d BRF  |d DCT  |d HAGCC  |d OCLCO  |d INARC  |d OCL  |d OCLCQ 
019 |a 878829397  |a 912391889  |a 966388219  |a 1026460243  |a 1048128869  |a 1065701533  |a 1067146340  |a 1082300263  |a 1204034604  |a 1300218431 
020 |a 9781430260561  |q (electronic bk.) 
020 |a 1430260564  |q (electronic bk.) 
020 |z 1430260556 
020 |z 9781430260554 
024 7 |a 10.1007/978-1-4302-6056-1  |2 doi 
029 1 |a AU@  |b 000053308459 
029 1 |a AU@  |b 000061961947 
029 1 |a CHNEW  |b 000887660 
029 1 |a CHVBK  |b 374465770 
029 1 |a DEBBG  |b BV041792646 
029 1 |a DEBBG  |b BV042031984 
029 1 |a DEBBG  |b BV042987601 
029 1 |a DEBBG  |b BV043609419 
029 1 |a DEBSZ  |b 407738525 
029 1 |a DEBSZ  |b 414174488 
029 1 |a AU@  |b 000065314570 
035 |a (OCoLC)874139155  |z (OCoLC)878829397  |z (OCoLC)912391889  |z (OCoLC)966388219  |z (OCoLC)1026460243  |z (OCoLC)1048128869  |z (OCoLC)1065701533  |z (OCoLC)1067146340  |z (OCoLC)1082300263  |z (OCoLC)1204034604  |z (OCoLC)1300218431 
037 |a CL0500000406  |b Safari Books Online 
050 4 |a QA76.9.D5  |b S375 2014 
072 7 |a COM  |x 000000  |2 bisacsh 
072 7 |a UNF  |2 bicssc 
072 7 |a UYQE  |2 bicssc 
082 0 4 |a 006.312  |2 23 
049 |a UAMI 
100 1 |a Sarkar, Debarchan. 
245 1 0 |a Pro Microsoft HDInsight :  |b Hadoop on Windows /  |c Debarchan Sarkar. 
260 |a New York :  |b Apress :  |b Distributed to the Book trade worldwide by Springer Science+Business Media New York,  |c ©2014. 
300 |a 1 online resource (1 volume) :  |b illustrations 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
347 |a text file  |b PDF  |2 rda 
490 0 |a The expert's voice in big data 
588 0 |a Online resource; title from title page (Safari, viewed Mar. 13, 2014). 
500 |a Includes index. 
520 |a Pro Microsoft HDInsight is a complete guide to deploying and using Apache Hadoop on the Microsoft Windows Azure Platforms. The information in this book enables you to process enormous volumes of structured as well as non-structured data easily using HDInsight, which is Microsoft's own distribution of Apache Hadoop. Furthermore, the blend of Infrastructure as a Service (IaaS) and Platform as a Service (PaaS) offerings available through Windows Azure lets you take advantage of Hadoop's processing power without the worry of creating, configuring, maintaining, or managing your own cluster. With the data explosion that is soon to happen, the open source Apache Hadoop Framework is gaining traction, and it benefits from a huge ecosystem that has risen around the core functionalities of the Hadoop distributed file system (HDFS(Tm)) and Hadoop Map Reduce. Pro Microsoft HDInsight equips you with the knowledge, confidence, and technique to configure and manage this ecosystem on Windows Azure. The book is an excellent choice for anyone aspiring to be a data scientist or data engineer, putting you a step ahead in the data mining field. Guides you through installation and configuration of an HDInsight cluster on Windows Azure Provides clear examples of configuring and executing Map Reduce jobs Helps you consume data and diagnose errors from the Windows Azure HDInsight Service. 
505 0 |a At a Glance; Contents; About the Author; About the Technical Reviewers; Acknowledgments; Introduction; Chapter 1: Introducing HDInsight; What Is Big Data, and Why Now?; How Is Big Data Different?; Is Big Data the Right Solution for You?; The Apache Hadoop Ecosystem; Microsoft HDInsight: Hadoop on Windows; Combining HDInsight with Your Business Processes; Summary; Chapter 2: Understanding Windows Azure HDInsight Service; Microsoft's Cloud-Computing Platform; Windows Azure HDInsight Service; HDInsight Versions; Cluster Version 2.1; Cluster Version 1.6; Storage Location Options. 
505 8 |a Azure storage accountsAccessing containers; Understanding the Windows Azure Storage Blob; Uploading Data to Windows Azure Storage Blob; Windows Azure Flat Network Storage; Summary; Chapter 3: Provisioning Your HDInsight Service Cluster; Creating the Storage Account; Creating a SQL Azure Database; Deploying Your HDInsight Cluster; Customizing Your Cluster Creation; Configuring the Cluster User and Hive/Oozie Storage; Choosing Your Storage Account; Finishing the Cluster Creation; Monitoring the Cluster; Configuring the Cluster; Summary; Chapter 4: Automating HDInsight Cluster Provisioning. 
505 8 |a Using the Hadoop .NET SDKAdding the NuGet Packages; Connecting to Your Subscription; Coding the Application; Using the PowerShell cmdlets for HDInsight; Command-Line Interface (CLI); Summary; Chapter 5: Submitting Jobs to Your HDInsight Cluster; Using the Hadoop .NET SDK; Adding the References; Submitting a Custom MapReduce Job; Adding the MapReduce Classes; Running the MapReduce Job; Submitting the wordcount MapReduce Job; Submitting a Hive Job; Adding the References; Creating the Hive Queries; Running the Hive Job; Monitoring Job Status; Using PowerShell; Writing Script; Executing The Job. 
505 8 |a Using MRRunnerSummary; Chapter 6: Exploring the HDInsight Name Node; Accessing the HDInsight Name Node; Hadoop Command Line; The Hive Console; The Sqoop Console; The Pig Console; Hadoop Web Interfaces; Hadoop MapReduce Status; The Name Node Status Portal; The TaskTracker Portal; HDInsight Windows Services; Installation Directory; Summary; Chapter 7: Using Windows Azure HDInsight Emulator; Installing the Emulator; Verifying the Installation; Using the Emulator; Future Directions; Summary; Chapter 8: Accessing HDInsight over Hive and ODBC; Hive: The Hadoop Data Warehouse; Working with Hive. 
505 8 |a Creating Hive TablesLoading Data; Querying Tables with HiveQL; Hive Storage; The Hive ODBC Driver; Installing the Driver; Testing the Driver; Connecting to the HDInsight Emulator; Configuring a DSN-less Connection; Summary; Chapter 9: Consuming HDInsight from Self-Service BI Tools; PowerPivot Enhancements; Creating a Stock Report; Power View for Excel; Power BI: The Future; Summary; Chapter 10: Integrating HDInsight with SQL Server Integration Services; SSIS as an ETL Tool; Creating the Project; Creating the Data Flow; Creating the Source Hive Connection. 
590 |a O'Reilly  |b O'Reilly Online Learning: Academic/Public Library Edition 
630 0 0 |a Apache Hadoop. 
630 0 0 |a Microsoft Windows (Computer file) 
630 0 7 |a Apache Hadoop.  |2 fast  |0 (OCoLC)fst01911570 
630 0 7 |a Microsoft Windows (Computer file)  |2 fast  |0 (OCoLC)fst01367862 
650 0 |a Big data. 
650 0 |a Electronic data processing. 
650 0 |a Data mining. 
650 2 |a Electronic Data Processing 
650 2 |a Data Mining 
650 6 |a Informatique. 
650 6 |a Exploration de données (Informatique) 
650 6 |a Données volumineuses. 
650 7 |a COMPUTERS  |x General.  |2 bisacsh 
650 7 |a Data mining.  |2 fast  |0 (OCoLC)fst00887946 
650 7 |a Big data.  |2 fast  |0 (OCoLC)fst01892965 
650 7 |a Electronic data processing.  |2 fast  |0 (OCoLC)fst00906956 
773 0 |t Springer eBooks 
776 0 8 |i Print version:  |z 9781430260554 
776 0 8 |i Print version:  |z 9781430260561 
856 4 0 |u https://learning.oreilly.com/library/view/~/9781430260554/?ar  |z Texto completo (Requiere registro previo con correo institucional) 
938 |a Internet Archive  |b INAR  |n promicrosofthdin0000sark 
938 |a Books 24x7  |b B247  |n bks00062994 
938 |a EBL - Ebook Library  |b EBLB  |n EBL1694189 
938 |a ebrary  |b EBRY  |n ebr10845289 
938 |a EBSCOhost  |b EBSC  |n 1173864 
938 |a ProQuest MyiLibrary Digital eBook Collection  |b IDEB  |n cis28233594 
938 |a YBP Library Services  |b YANK  |n 11729235 
994 |a 92  |b IZTAP