Discover millions of ebooks, audiobooks, and so much more with a free trial

Only $11.99/month after trial. Cancel anytime.

Microsoft SQL Server 2012 with Hadoop
Microsoft SQL Server 2012 with Hadoop
Microsoft SQL Server 2012 with Hadoop
Ebook159 pages1 hour

Microsoft SQL Server 2012 with Hadoop

Rating: 1 out of 5 stars

1/5

()

Read preview

About this ebook

This book will be a step-by-step tutorial, which practically teaches working with big data on SQL Server through sample examples in increasing complexity. Microsoft SQL Server 2012 with Hadoop is specifically targeted at readers who want to cross-pollinate their Hadoop skills with SQL Server 2012 business intelligence and data analytics. A basic understanding of traditional RDBMS technologies and query processing techniques is essential.
LanguageEnglish
Release dateAug 26, 2013
ISBN9781782177999
Microsoft SQL Server 2012 with Hadoop

Related to Microsoft SQL Server 2012 with Hadoop

Related ebooks

Programming For You

View More

Related articles

Reviews for Microsoft SQL Server 2012 with Hadoop

Rating: 1 out of 5 stars
1/5

1 rating0 reviews

What did you think?

Tap to rate

Review must be at least 10 words

    Book preview

    Microsoft SQL Server 2012 with Hadoop - Debarchan Sarkar

    Table of Contents

    Microsoft SQL Server 2012 with Hadoop

    Credits

    About the Author

    About the Reviewer

    www.PacktPub.com

    Support files, eBooks, discount offers and more

    Why Subscribe?

    Free Access for Packt account holders

    Instant Updates on New Packt Books

    Preface

    What this book covers

    What you need for this book

    Who this book is for

    Conventions

    Reader feedback

    Customer support

    Errata

    Piracy

    Questions

    1. Introduction to Big Data and Hadoop

    Big Data – what's the big deal?

    The Apache Hadoop framework

    HDFS

    MapReduce

    NameNode

    Secondary NameNode

    DataNode

    JobTracker

    TaskTracker

    Hive

    Pig

    Flume

    Sqoop

    Oozie

    HBase

    Mahout

    Summary

    2. Using Sqoop – The SQL Server Hadoop Connector

    The SQL Server-Hadoop Connector

    Installation prerequisites

    A Hadoop cluster on Linux

    Installing and configuring Sqoop

    Setting up the Microsoft JDBC driver

    Downloading the SQL Server-Hadoop Connector

    Installing the SQL Server-Hadoop Connector

    The Sqoop import tool

    Importing the tables in Hive

    The Sqoop export tool

    Data types

    Summary

    3. Using the Hive ODBC Driver

    The Hive ODBC Driver

    SQL Server Integration Services (SSIS)

    SSIS as an ETL – extract, transform, and load tool

    Developing the package

    Creating the project

    Creating the Data Flow

    Creating the source Hive connection

    Creating the destination SQL connection

    Creating the Hive source component

    Creating the SQL destination component

    Mapping the columns

    Running the package

    Summary

    4. Creating a Data Model with SQL Server Analysis Services

    Configuring the SQL Linked Server to Hive

    The Linked Server script

    Using OpenQuery

    Creating a view

    Creating an SSAS data model

    Summary

    5. Using Microsoft's Self-Service Business Intelligence Tools

    PowerPivot enhancements

    Power View for Excel

    Summary

    Index

    Microsoft SQL Server 2012 with Hadoop


    Microsoft SQL Server 2012 with Hadoop

    Copyright © 2013 Packt Publishing

    All rights reserved. No part of this book may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, without the prior written permission of the publisher, except in the case of brief quotations embedded in critical articles or reviews.

    Every effort has been made in the preparation of this book to ensure the accuracy of the information presented. However, the information contained in this book is sold without warranty, either express or implied. Neither the author, nor Packt Publishing, and its dealers and distributors will be held liable for any damages caused or alleged to be caused directly or indirectly by this book.

    Packt Publishing has endeavored to provide trademark information about all of the companies and products mentioned in this book by the appropriate use of capitals. However, Packt Publishing cannot guarantee the accuracy of this information.

    First published: August 2013

    Production Reference: 1200813

    Published by Packt Publishing Ltd.

    Livery Place

    35 Livery Street

    Birmingham B3 2PB, UK.

    ISBN 978-1-78217-798-2

    www.packtpub.com

    Cover Image by Aniket Sawant (<aniket_sawant_photography@hotmail.com>)

    Credits

    Authors

    Debarchan Sarkar

    Reviewer

    Atdhe Buja Msc

    Acquisition Editor

    James Jones

    Commissioning Editor

    Shaon Basu

    Technical Editor

    Chandni Maishery

    Project Coordinator

    Akash Poojary

    Proofreader

    Mario Cecere

    Indexer

    Rekha Nair

    Tejal Soni

    Graphics

    Abhinash Sahu

    Production Coordinator

    Nilesh R. Mohite

    Cover Work

    Nilesh R. Mohite

    About the Author

    Debarchan Sarkar is a Microsoft Data Platform engineer who hails from Calcutta, the city of joy, India. He has been a seasoned SQL Server engineer with Microsoft, India for the last six years and has now started venturing into the open source world, specifically the Apache Hadoop framework. He is a SQL Server Business Intelligence specialist with subject matter expertise in SQL Server Integration Services.

    Debarchan is currently working on another book with Apress on Microsoft's Hadoop distribution, HDInsight.

    I would like to thank my parents, Devjani Sarkar and Asok Sarkar for their continuous support and encouragement behind this book.

    About the Reviewer

    Atdhe Buja Msc is a Certified Ethical Hacker, Database Administrator (MCITP, OCA11g) and a developer with good management skills. He is a DBA at Ministry of Public Administration, Pristina, RKS, where he also manages some projects of E-Governance and eight years' experience in SQL Server.

    Atdhe is a regular columnist for UBT News, currently he holds a MSc. in Computer Science and Engineering, has a Bachelor in Management and Information and continues studies for a Bachelor degree in Political Science in UP.

    Specialized and Certified in many technologies such as SQL Server 2000, 2005, 2008, 2008 R2, Oracle 11g, CEH-Ethical Hacker, Windows Server, MS Project, System Center Operation Manager, and Web Design.

    His capabilities go beyond the above mentioned knowledge!

    I thank my wife Donika Bajrami and my family Buja for all the encouragement

    Enjoying the preview?
    Page 1 of 1