Big Data: A Tutorial-Based Approach, 1st Edition (Hardback) book cover

Big Data

A Tutorial-Based Approach, 1st Edition

By Nasir Raheem

Chapman and Hall/CRC

176 pages | 43 B/W Illus.

Purchasing Options:$ = USD
Hardback: 9780367183455
pub: 2019-02-25
SAVE ~$14.00
$70.00
$56.00
x
eBook (VitalSource) : 9780429060939
pub: 2019-02-21
from $12.50


FREE Standard Shipping!

Description

Big Data: A Tutorial-Based Approach explores the tools and techniques used to bring about the marriage of structured and unstructured data. It focuses on Hadoop Distributed Storage and MapReduce Processing by implementing (i) Tools and Techniques of Hadoop Eco System, (ii) Hadoop Distributed File System Infrastructure, and (iii) efficient MapReduce processing. The book includes Use Cases and Tutorials to provide an integrated approach that answers the ‘What’, ‘How’, and ‘Why’ of Big Data.

Features

  • Identifies the primary drivers of Big Data
  • Walks readers through the theory, methods and technology of Big Data
  • Explains how to handle the 4 V’s of Big Data in order to extract value for better business decision making
  • Shows how and why data connectors are critical and necessary for Agile text analytics
  • Includes in-depth tutorials to perform necessary set-ups, installation, configuration and execution of important tasks
  • Explains the command line as well as GUI interface to a powerful data exchange tool between Hadoop and legacy r-dbms databases

Reviews

[Big Data: A Tutorial-Based Approach] is a well thought-out guide, comprising of tutorials and graphic illustrations, that builds an integrated approach which clearly answers the ‘What’ and the ‘How’ and the ‘Why’ of ‘Big Data’. It takes the readers on an inquisitive journey through the information wonderland of data lakes and provides the tools and techniques to bring about the marriage of structured and unstructured data.

It is a must-read primer that keeps its eyes always set on the end goal of extracting useful business insight from ‘Big Data’ by fully exploiting the potential of Hadoop Distributed File System Infrastructure, MapReduce processing, and Agile Data Analytics to implement proper Data Migration, Data Ingestion, Data Management, Data Analytics, Data Visualization and Data Virtualization processes.

Last but not the least, this book finally tests the readers on their understanding of ‘Big Data’ in the form of a QUIZ.

-Dr. Sohail Subhani, Winona State University

Table of Contents

Chapter 1: Introduction to Big Data

Chapter 2: Big Data Implementation

Chapter 3: Big Data Use Cases

Chapter 4: Big Data Migration

Chapter 5: Big Data Ingestion, Integration, and Management

Chapter 6: Big Data Repository

Chapter 7: Big Data Visualization

Chapter 8: Structured & Un-structured Data Analytics

Chapter 9: Data Virtualization

Chapter 10: Cloud Computing

About the Author

Author

Nasir Raheem is an accomplished, innovative, and results-driven project manager, architect and business analyst with over 20 years of wide-ranging experience encompassing I.T Infra-structure design, planning and implementation of highly integrated systems that included Big Data (HIVE) Database Administration, Business Re-engineering, Asset & Data management (ServiceNow), Data Integration, Data Modeling, Disaster Recovery and ERP Database /Application cloning projects. He is an experienced manager of IT projects related to multi-billion dollar corporate mergers, migration, server upgrades, database upgrades, data conversion, cloning and integration of supply chain management ERP and CRM application modules at Wells Fargo Bank, WebTV (now Microsoft) and Hitachi Global Storage Technologies (now Western Digital). He is also a published author and instructor of an online course approved by Harvard University Innovation Lab, ‘March towards Big Data.’

Subject Categories

BISAC Subject Codes/Headings:
COM000000
COMPUTERS / General
COM012040
COMPUTERS / Programming / Games
COM018000
COMPUTERS / Data Processing
COM021000
COMPUTERS / Database Management / General
COM062000
COMPUTERS / Data Modeling & Design
COM089000
COMPUTERS / Data Visualization
COM091000
COMPUTERS / Cloud Computing