Discover millions of ebooks, audiobooks, and so much more with a free trial

Only $11.99/month after trial. Cancel anytime.

Getting Started with Big Data Query using Apache Impala
Getting Started with Big Data Query using Apache Impala
Getting Started with Big Data Query using Apache Impala
Ebook73 pages25 minutes

Getting Started with Big Data Query using Apache Impala

Rating: 0 out of 5 stars

()

Read preview

About this ebook

This book is designed for anyone who learns how to get started with Apache Impala. The book covers SQL queries and data manipulation for Apache Impala. The following is a list of highlight topics:
* Introduction to Apache Impala
* Working with Apache Impala Shell
* SQL Querying with Apache Hue and Apache Impala
* Loading Dataset to Apache Impala
* Basic SQL Query for Apache Impala
* Joining Query and Subquery on Apache Impala
* Partition Data on Apache Impala
* Apache Impala Database Programming with Java
LanguageEnglish
PublisherLulu.com
Release dateFeb 6, 2021
ISBN9781716108396
Getting Started with Big Data Query using Apache Impala

Read more from Agus Kurniawan

Related to Getting Started with Big Data Query using Apache Impala

Related ebooks

Databases For You

View More

Related articles

Reviews for Getting Started with Big Data Query using Apache Impala

Rating: 0 out of 5 stars
0 ratings

0 ratings0 reviews

What did you think?

Tap to rate

Review must be at least 10 words

    Book preview

    Getting Started with Big Data Query using Apache Impala - Agus Kurniawan

    Contact

    1. Introduction to Apache Impala

    1.1 Introduction

    Apache Impala is a modern, open source, distributed SQL query engine for Apache Hadoop. With Impala, we can query data, whether stored in HDFS, Apache Hive or Apache HBase – including SELECT, JOIN, and aggregate functions. You can find the official project on this link, https://impala.apache.org/. In this book, we learn how to perform queries on Apache Impala.

    1.2 Installing Apache Impala

    In this section, I use Cloudera Manager to install Apache Impala. You can install Apache Impala to Linux manually. You can see my Cloudera Manager in Figure below.

    m1-1

    To add Hadoop service using Cloudera Manager, you can can click Add Server on a context menu as shown in Figure below.

    m1-2

    After clicked, you can install Apache Impala. Make sure you also install HDFS, HBASE and Hue.

    m1-3

    Once installed, we can start to work with Apache Impala.

    1.3 Setting up Lab Demo

    You can set up Apache Impala with Cloudera Manager or own Linux. For demo, I use Apache Impala on Cloudera environment. I deployed Apache Impala on Ubuntu Linux.

    2. Working with Apache Impala Shell

    2.1 Introduction

    Apache Impala provide a service and a shell. In this chapter, we learn how to work with Apache Impala shell. To show Impala shell version, you can type this command.

    You will see Impala shell on your Terminal. You can see my Impala shell version is shown in Figure below.

    m2-1

    Next, we will work with Impala shell.

    2.2 Connecting to Apache Impala Service

    To start Impala shell, you open a Terminal on your Apache Impala server. Then, type this command.

    This will connect to your

    Enjoying the preview?
    Page 1 of 1