Book Name: Big Data Analytics: A Hands-On Approach
Author: Arshdeep Bahga, Vijay Madisetti
Publisher: Arshdeep Bahga & Vijay Madisetti
File size: 108 Mb
File format: PDF
Big Data Analytics: A Hands-On Approach Pdf Book Description:
The publication is organized into three chief components, including a total of twelve characters. Part I provides an introduction to large data, software of large data, and large data analytics and science patterns and architectures. A publication data analytics and science program system design methodology is suggested and its recognition through usage of open-ended large data frameworks is clarified. This methodology refers to large data analytics software as understanding of this suggested Alpha, Beta, Gamma and Delta versions, which contain resources and frameworks for gathering and ingesting data from several sources to the huge data analytics infrastructure, distributed filesystems and non-relational (NoSQL) databases for information storage, processing frameworks for batch and real time data, functioning databases, net and visualization frameworks. This new methodology creates the pedagogical base of the publication.
Part II introduces the reader to different tools and frameworks for large data analytics, along with also the architectural and programming elements of the frameworks as used in the proposed design methodology. We chose Python because the main programming language with this particular book. Other languages, besides Python, are also readily used inside the Big Data heap explained within this publication. We explain tools and frameworks such as Data Acquisition such as Publish-subscribe messaging frameworks like Apache Kafka and Amazon Kinesis, Source-Sink connectors like Apache Flume, Database Connectors like Apache Sqoop, Messaging Queues for example RabbitMQ, ZeroMQ, RestMQ, Amazon SQS and habit REST-based connectors along with WebSocket-based connectors.
The reader is introduced into Hadoop Distributed File System (HDFS) and also HBase non-relational database. The batch evaluation chapter offers an comprehensive study of frameworks like Hadoop-MapReduce, Pig, Oozie, Spark and Solr. From the chapter on interactive , we explain with the assistance of illustrations, using frameworks and services like Spark SQL, Hive, Amazon Redshift and Google BigQuery. The chapter about serving databases and internet frameworks offer an introduction to favorite relational and non-relational databases (like MySQL, Amazon DynamoDB, Cassandra, and MongoDB) along with also the Django Python web framework. Part III focuses complex topics on large data such as analytics algorithms and information visualization tools. The chapter on information visualization refers to examples of producing a variety of kinds of visualizations using frameworks like Lightning, pygal and Seaborn.
DMCA Disclaimer: This site complies with DMCA Digital Copyright Laws. Please bear in mind that we do not own copyrights to these books. We’re sharing this material with our audience ONLY for educational purpose. We highly encourage our visitors to purchase original books from the respected publishers. If someone with copyrights wants us to remove this content, please contact us immediately.All books on the freebooksmedical.com are free and NOT HOSTED ON OUR WEBSITE. If you feel that we have violated your copyrights, then please contact us immediately (click here).