Predictive Analytics Using Rattle and Qlik Sense
()
About this ebook
- Create visualizations, dashboards, and data applications with Qlik Sense and Rattle
- Load, explore, and manipulate data to Rattle to create predictions and discover hidden patterns in the data
- A step-by-step guide to learning predictive analytics in a quick and easy way
If you are a business analyst who wants to understand how to improve your data analysis and how to apply predictive analytics, then this book is ideal for you. This book assumes you have some basic knowledge of statistics and a spreadsheet editor such as Excel, but knowledge of QlikView is not required.
Related to Predictive Analytics Using Rattle and Qlik Sense
Related ebooks
QlikView Essentials Rating: 0 out of 5 stars0 ratingsLearning Qlik Sense®: The Official Guide - Second Edition Rating: 5 out of 5 stars5/5Learning Qlik® Sense: The Official Guide Rating: 0 out of 5 stars0 ratingsBuilding a Recommendation System with R Rating: 0 out of 5 stars0 ratingsLearning Tableau 2019 - Third Edition: Tools for Business Intelligence, data prep, and visual analytics, 3rd Edition Rating: 0 out of 5 stars0 ratingsLearning Tableau Rating: 0 out of 5 stars0 ratingsMaking Big Data Work for Your Business: A guide to effective Big Data analytics Rating: 0 out of 5 stars0 ratingsWeb Application Development with R Using Shiny - Second Edition Rating: 0 out of 5 stars0 ratingsPractical Data Analysis - Second Edition Rating: 0 out of 5 stars0 ratingsKNIME Essentials Rating: 0 out of 5 stars0 ratingsQlikView for Finance Rating: 0 out of 5 stars0 ratingsLearning Tableau 10 - Second Edition Rating: 4 out of 5 stars4/5Building ERP Solutions with Microsoft Dynamics NAV Rating: 0 out of 5 stars0 ratingsMastering QlikView Data Visualization Rating: 0 out of 5 stars0 ratingsGetting Started with Python Data Analysis Rating: 0 out of 5 stars0 ratingsHands-On Time Series Analysis with R: Perform time series analysis and forecasting using R Rating: 0 out of 5 stars0 ratingsQlik A Complete Guide - 2021 Edition Rating: 0 out of 5 stars0 ratingsThe Key to Successful Data Migration: Pre-Migration Activities Rating: 0 out of 5 stars0 ratingsQlikView for Developers Cookbook Rating: 0 out of 5 stars0 ratingsPYTHON FOR DATA ANALYSIS: A Practical Guide to Manipulating, Cleaning, and Analyzing Data Using Python (2023 Beginner Crash Course) Rating: 0 out of 5 stars0 ratingsSAS Visual Analytics for SAS Viya Rating: 0 out of 5 stars0 ratingsData Analysis Using SQL and Excel Rating: 3 out of 5 stars3/5Practical Data Cleaning: Bite-Size Stats, #5 Rating: 0 out of 5 stars0 ratingsBuilding the Data Warehouse Rating: 5 out of 5 stars5/5Data Modeling A Complete Guide - 2021 Edition Rating: 0 out of 5 stars0 ratingsData Visualization Strategy Standard Requirements Rating: 0 out of 5 stars0 ratingsInformation Structure Design for Databases: A Practical Guide to Data Modelling Rating: 5 out of 5 stars5/5Creating Data Stories with Tableau Public Rating: 0 out of 5 stars0 ratings
Data Visualization For You
Learning pandas - Second Edition Rating: 4 out of 5 stars4/5How to Lie with Maps Rating: 4 out of 5 stars4/5Hands-On Data Analysis with Pandas: Efficiently perform data collection, wrangling, analysis, and visualization using Python Rating: 0 out of 5 stars0 ratingsData Analytics for Beginners: Introduction to Data Analytics Rating: 4 out of 5 stars4/5The Big Book of Dashboards: Visualizing Your Data Using Real-World Business Scenarios Rating: 4 out of 5 stars4/5DAX Patterns: Second Edition Rating: 5 out of 5 stars5/5Financial Reporting with Dashboards in Power BI Rating: 0 out of 5 stars0 ratingsEffective Data Storytelling: How to Drive Change with Data, Narrative and Visuals Rating: 4 out of 5 stars4/5Teach Yourself VISUALLY Power BI Rating: 0 out of 5 stars0 ratingsLearning PySpark Rating: 0 out of 5 stars0 ratingsData Visualization: A Practical Introduction Rating: 5 out of 5 stars5/5Visual Analytics with Tableau Rating: 0 out of 5 stars0 ratingsMastering Excel: Excel Apps Rating: 3 out of 5 stars3/5Learn D3.js: Create interactive data-driven visualizations for the web with the D3.js library Rating: 0 out of 5 stars0 ratingsD3.js in Action: Data visualization with JavaScript Rating: 0 out of 5 stars0 ratingsMastering Text Mining with R Rating: 0 out of 5 stars0 ratingsProgramming ArcGIS with Python Cookbook - Second Edition Rating: 4 out of 5 stars4/5Clojure Data Analysis Cookbook - Second Edition Rating: 0 out of 5 stars0 ratingsR for Data Science Rating: 5 out of 5 stars5/5Getting to Know ArcGIS Desktop 10.8 Rating: 4 out of 5 stars4/5How to Become a Data Analyst: My Low-Cost, No Code Roadmap for Breaking into Tech Rating: 0 out of 5 stars0 ratingsCool Infographics: Effective Communication with Data Visualization and Design Rating: 4 out of 5 stars4/5Excel for Beginners 2023: A Step-by-Step and Comprehensive Guide to Master the Basics of Excel, with Formulas, Functions, & Charts Rating: 0 out of 5 stars0 ratingsSpatial Statistics Illustrated Rating: 5 out of 5 stars5/5
Reviews for Predictive Analytics Using Rattle and Qlik Sense
0 ratings0 reviews
Book preview
Predictive Analytics Using Rattle and Qlik Sense - Ferran Garcia Pagans
Table of Contents
Predictive Analytics Using Rattle and Qlik Sense
Credits
About the Author
About the Reviewers
www.PacktPub.com
Support files, eBooks, discount offers, and more
Why subscribe?
Free access for Packt account holders
Instant updates on new Packt books
Preface
What this book covers
What you need for this book
Who this book is for
Conventions
Reader feedback
Customer support
Downloading the example code
Downloading the color images of this book
Errata
Piracy
Questions
1. Getting Ready with Predictive Analytics
Analytics, predictive analytics, and data visualization
Purpose of the book
Introducing R, Rattle, and Qlik Sense Desktop
Installing the environment
Downloading and installing R
Starting the R Console to test your R installation
Downloading and installing Rattle
Installing Qlik Sense Desktop
Exploring Qlik Sense Desktop
Further learning
Summary
2. Preparing Your Data
Datasets, observations, and variables
Loading data
Loading a CSV File
Transforming data
Transforming data with Rattle
Rescaling data
Using the Impute option to deal with missing values
Recoding variables
Binning
Indicator variables
Join Categories
As Category
As Numeric
Cleaning up
Exporting data
Further learning
Summary
3. Exploring and Understanding Your Data
Text summaries
Summary reports
Measures of central tendency – mean, median, and mode
Measures of dispersion – range, quartiles, variance, and standard deviation
Range
Quartiles
Variance
Standard deviation
Measures of the shape of the distribution – skewness and kurtosis
Showing missing values
Visualizing distributions
Numeric variables
Box plots
Histograms
Cumulative plots
Categorical variables
Bar plots
Mosaic plots
Correlations among input variables
The Explore Missing and Hierarchical options
Further learning
Summary
4. Creating Your First Qlik Sense Application
Customer segmentation and customer buying behavior
Loading data and creating a data model
Preparing the data
Creating a simple data app
Associative logic
Creating charts
Analyzing your data
Further learning
Summary
5. Clustering and Other Unsupervised Learning Methods
Machine learning – unsupervised and supervised learning
Cluster analysis
Centroid-based clustering the using K-means algorithm
Customer segmentation with K-means clustering
Preparing the data in Qlik Sense
Creating a customer segmentation sheet in Qlik Sense
Hierarchical clustering
Association analysis
Further learning
Summary
6. Decision Trees and Other Supervised Learning Methods
Partitioning datasets and model optimization
Decision Tree Learning
Entropy and information gain
Underfitting and overfitting
Using a Decision Tree to classify credit risks
Using Rattle to score new loan applications
Creating a Qlik Sense application to predict credit risks
Ensemble classifiers
Boosting
Random Forest
Supported Vector Machines
Other models
Linear and Logistic Regression
Neural Networks
Further learning
Summary
7. Model Evaluation
Cross-validation
Regression performance
Predicted versus Observed Plot
Measuring the performance of classifiers
Confusion matrix, accuracy, sensitivity, and specificity
Risk Chart
ROC Curve
Further learning
Summary
8. Visualizations, Data Applications, Dashboards, and Data Storytelling
Data visualization in Qlik Sense
Visualization toolbox
Creating a bar chart
The Data menu
The Sorting menu
The Add-ons menu
The Appearance menu
Data analysis, data applications, and dashboards
Qlik Sense data analysis
In-memory analysis
Associative experience
Data applications and dashboards
The DAR approach
Data storytelling with Qlik Sense
Creating a new story
Further learning
Summary
9. Developing a Complete Application
Understanding the bike rental problem
Exploring the data with Qlik Sense
Checking for temporal patterns
Visual correlation analysis
Creating a Qlik Sense App to control the activity
Using Rattle to forecast the demand
Correlation Analysis with Rattle
Building a model
Improving performance
Model evaluation
Scoring new data
Further learning
Summary
Index
Predictive Analytics Using Rattle and Qlik Sense
Predictive Analytics Using Rattle and Qlik Sense
Copyright © 2015 Packt Publishing
All rights reserved. No part of this book may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, without the prior written permission of the publisher, except in the case of brief quotations embedded in critical articles or reviews.
Every effort has been made in the preparation of this book to ensure the accuracy of the information presented. However, the information contained in this book is sold without warranty, either express or implied. Neither the author, nor Packt Publishing, and its dealers and distributors will be held liable for any damages caused or alleged to be caused directly or indirectly by this book.
Packt Publishing has endeavored to provide trademark information about all of the companies and products mentioned in this book by the appropriate use of capitals. However, Packt Publishing cannot guarantee the accuracy of this information.
First published: June 2015
Production reference: 1260615
Published by Packt Publishing Ltd.
Livery Place
35 Livery Street
Birmingham B3 2PB, UK.
ISBN 978-1-78439-580-3
www.packtpub.com
Credits
Author
Ferran Garcia Pagans
Reviewers
Gert Jan Feick
Miguel Ángel García
Commissioning Editor
Dipika Gaonkar
Acquisition Editor
Reshma Raman
Content Development Editor
Ajinkya Paranjape
Technical Editor
Narsimha Pai
Copy Editors
Janbal Dharmaraj
Brandt D'Mello
Kevin McGowan
Aditya Nair
Rashmi Sawant
Project Coordinator
Harshal Ved
Proofreader
Safis Editing
Indexer
Hemangini Bari
Graphics
Sheetal Aute
Production Coordinator
Komal Ramchandani
Cover Work
Komal Ramchandani
About the Author
Ferran Garcia Pagans studied software engineering at the University of Girona and Ramon Llull University. After that, he did his masters in business administration at ESADE Business School. He has 16 years of experience in the software industry, where he helped customers from different industries to create software solutions. He started his career working at the Ramon Llull University as a teacher and researcher. Then, he moved to the Volkswagen group as a software developer. After that, he worked with Oracle as a Java, SOA, and BPM specialist. Currently, he is a solution architect at Qlik, where he helps customers to achieve competitive advantages with data applications.
I would like to thank my love, Laura.
About the Reviewers
Gert Jan Feick studied informatics (language, knowledge, and interaction) at Technical University Enschede (NL). He started his career as a project manager at a medium-sized software development company, specializing in requirement analysis and project management. From 2005 onward, he was responsible for building up a company in the areas of software development, reporting and visualizations, and analysis. In 2011, he moved to Germany and became a management consultant at Infomotion GmbH, where he was responsible for the team that works on self-service and Agile BI as well as reporting and analysis.
He regularly contributes to online forums (including the Qlik Community), speaks at conventions, and writes articles. You can follow him on Twitter at @gjfeick, where he tweets about QlikView, big data, self-service and Agile BI, data visualization, and other topics in general.
Miguel Ángel García is a Business Intelligence consultant and QlikView Solutions Architect. Having worked throughout many successful QlikView implementations, from inception through implementation, and performed across a wide variety of roles on each project; his experience and skills range from presales to applications development and design, technical architecture, system administration, functional analysis, and overall project execution.
He is the co-author of QlikView 11 for Developers, Packt Publishing, which was published in November 2012, and its corresponding translation into Spanish QlikView 11 para Desarrolladores
, published in December 2013. He also worked as a technical reviewer for several other QlikView books.
He runs a QlikView consultancy, AfterSync (www.aftersync.com), through which he helps customers discover the power of the Qlik platform.
He currently holds the QlikView Designer, QlikView Developer, and QlikView System Administrator certifications, issued by Qlik, for Versions 9, 10, and 11.
www.PacktPub.com
Support files, eBooks, discount offers, and more
For support files and downloads related to your book, please visit www.PacktPub.com.
Did you know that Packt offers eBook versions of every book published, with PDF and ePub files available? You can upgrade to the eBook version at www.PacktPub.com and as a print book customer, you are entitled to a discount on the eBook copy. Get in touch with us at
At www.PacktPub.com, you can also read a collection of free technical articles, sign up for a range of free newsletters and receive exclusive discounts and offers on Packt books and eBooks.
https://www2.packtpub.com/books/subscription/packtlib
Do you need instant solutions to your IT questions? PacktLib is Packt's online digital book library. Here, you can search, access, and read Packt's entire library of books.
Why subscribe?
Fully searchable across every book published by Packt
Copy and paste, print, and bookmark content
On demand and accessible via a web browser
Free access for Packt account holders
If you have an account with Packt at www.PacktPub.com, you can use this to access PacktLib today and view 9 entirely free books. Simply use your login credentials for immediate access.
Instant updates on new Packt books
Get notified! Find out when new books are published by following @PacktEnterprise on Twitter or the Packt Enterprise Facebook page.
Preface
Today, a lot of organizations are investing in improving their analytics skills and tools. They know that, by analyzing their data, they can improve the performance of their business process and achieve real value from that.
The objective of this book is to introduce you to predictive analytics and data visualization by developing some example applications. We'll use R and Rattle to create the predictive model and Qlik Sense to create a data application that allows business users to explore their data.
We use Rattle and Qlik Sense to avoid learn programming and focus on predictive analytics and data visualizations concepts.
What this book covers
Chapter 1, Getting Ready with Predictive Analytics, explains the key concepts of predictive analytics and how to install our learning environments, such as Qlik Sense, R, and Rattle.
Chapter 2, Preparing Your Data, covers the basic characteristics of datasets, how to load a dataset into Rattle, and how to transform it. As data is the basic ingredient of analytics, preparing the data to analyze it is the first step.
Chapter 3, Exploring and Understanding Your Data, introduces you to Exploratory Data Analysis (EDA) using Rattle. EDA is a statistical approach to understanding data.
Chapter 4, Creating Your First Qlik Sense Application, discusses how to load a dataset into Qlik Sense, create a data model and basic charts, and explore data using Qlik Sense. Using Exploratory Data Analysis and Rattle to understand our data is a very mathematical approach. Usually, business users prefer a more intuitive approach, such as Qlik Sense
Chapter 5, Clustering and Other Unsupervised Learning Methods, covers machine, supervised, and unsupervised learning but focuses on unsupervised learning We create an example application using K-means, a classic machine learning algorithm. We use Rattle to process the dataset and then we load it into Qlik Sense to present the data to the business user.
Chapter 6, Decision Trees and Other Supervised Learning Methods, focuses on supervised learning. It helps you create an example application using Decision Tree Learning. We use Rattle to process the data and Qlik Sense to communicate with it.
Chapter 7, Model Evaluation, explains how to evaluate the performance of a model. Model evaluation is very useful to improve the performance.
Chapter 8, Visualizations, Data Applications, Dashboards, and Data Storytelling, focuses on data visualization and data storytelling using Qlik Sense.
Chapter 9, Developing a Complete Application, explains how to create a complete application. It covers how to explore the data, create a predictive model, and create a data application.
What you need for this book
To install our learning environment and complete the examples, you need a 64-bit computer:
OS: Windows 7, Windows 8, or 8.1
Processor: Intel Core2 Duo or higher
Memory: 4 GB or more
.NET Framework: 4.0
Security: Local admin privileges to install R, Rattle, and Qlik Sense.
Who this book is for
If you are a business analyst who wants to understand how to improve your data analysis and how to apply predictive analytics, then this book is ideal for you. This book assumes that you to have some basic knowledge of QlikView, but no knowledge of implementing predictive analysis with QlikView. It would also be helpful to be familiar with the basic concepts of statistics and a spreadsheet editor, such as Excel.
Conventions
In this book, you will find a number of text styles that distinguish between different kinds of information. Here are some examples of these styles and an explanation of their meaning.
Code words in text, database table names, folder names, filenames, file extensions, pathnames, dummy URLs, user input, and Twitter handles are shown as follows: After you have downloaded it, type library(rattle) and R will load the Rattle package into memory, and you will be able to use it.
A block of code is set as follows:
If Purpose = 'Education' AND Sex = 'male' AND Age > 25 Then No Default
If Purpose = 'Education' AND Sex = 'male' AND Age < 25 Then Yes Default
New terms and important words are shown in bold. Words that you see on the screen, for example, in menus or dialog boxes, appear in the text like this: Don't be afraid, we will use two software tools Rattle and Qlik Sense Desktop in order to avoid complex code.
Note
Warnings or important notes appear in a box like this.
Tip
Tips and tricks appear like this.
Reader feedback
Feedback from our readers is always welcome. Let us know what you think about this book—what you liked or disliked. Reader feedback is important for us as it helps us develop titles that you will really get the most out of.
To send us general feedback, simply e-mail <feedback@packtpub.com>, and mention the book's title in the subject of your message.
If there is a topic that you have expertise in and you are interested in either writing or contributing to a book, see our author guide at www.packtpub.com/authors.
Customer support
Now that you are the proud owner of a Packt book, we have a number of things to help you to get the most from your purchase.
Downloading the example code
You can download the example code files from your account at http://www.packtpub.com for all the Packt Publishing books you have purchased. If you purchased this book elsewhere, you can visit