Pentaho Analytics for MongoDB Cookbook
By Latino Joel and Ward Harris
()
About this ebook
About This Book
- Create reports and stunning dashboards with MongoDB data
- Accelerate data access and maximize productivity with unique features of Pentaho for MongoDB
- A step-by-step recipe-based guide for making full use of Pentaho suite tools with MongoDB
Who This Book Is For
This book is intended for data architects and developers with a basic level of knowledge of MongoDB. Familiarity with Pentaho is not expected.
What You Will Learn
- Extract, load, and transform data from MongoDB collections to other datasources
- Design Pentaho Reports using different types of connections for MongoDB
- Create a OLAP mondrian schema for MongoDB
- Explore your MongoDB data using Pentaho Analyzer
- Utilize the drag and drop web interface to create dashboards
- Use Kettle Thin JDBC with MongoDB for analysis
- Integrate advanced dashboards with MondoDB using different types of connections
- Publish and run a report on Pentaho BI server using a web interface
In Detail
MongoDB is an open source, schemaless NoSQL database system. Pentaho as a famous open source Analysis tool provides high performance, high availability, and easy scalability for large sets of data. The variant features in Pentaho for MongoDB are designed to empower organizations to be more agile and scalable and also enables applications to have better flexibility, faster performance, and lower costs.
Whether you are brand new to online learning or a seasoned expert, this book will provide you with the skills you need to create turnkey analytic solutions that deliver insight and drive value for your organization.
The book will begin by taking you through Pentaho Data Integration and how it works with MongoDB. You will then be taken through the Kettle Thin JDBC Driver for enabling a Java application to interact with a database. This will be followed by exploration of a MongoDB collection using Pentaho Instant view and creating reports with MongoDB as a datasource using Pentaho Report Designer. The book will then teach you how to explore and visualize your data in Pentaho BI Server using Pentaho Analyzer. You will then learn how to create advanced dashboards with your data. The book concludes by highlighting contributions of the Pentaho Community.
Style and approach
A comprehensive, recipe-based guide to take complete advantage of the Pentaho Analytics for MongoDB.
Related to Pentaho Analytics for MongoDB Cookbook
Related ebooks
IoT Data Analytics using Python: Learn how to use Python to collect, analyze, and visualize IoT data (English Edition) Rating: 0 out of 5 stars0 ratingsMongoDB Cookbook - Second Edition Rating: 0 out of 5 stars0 ratings.NET 7 for Jobseekers: Elevate your coding journey with .NET 7 (English Edition) Rating: 0 out of 5 stars0 ratingsPostgreSQL Administration Cookbook, 9.5/9.6 Edition Rating: 0 out of 5 stars0 ratingsInstant Pentaho Data Integration Kitchen Rating: 0 out of 5 stars0 ratingsBizTalk Server 2010 Cookbook Rating: 0 out of 5 stars0 ratingsBootstrap 4 Cookbook Rating: 0 out of 5 stars0 ratingsMongoDB High Availability Rating: 5 out of 5 stars5/5Lucene 4 Cookbook Rating: 0 out of 5 stars0 ratingsSoftware architecture A Complete Guide - 2019 Edition Rating: 0 out of 5 stars0 ratingsBeginning DotNetNuke Skinning and Design Rating: 0 out of 5 stars0 ratingsJava with TDD from the Beginning Rating: 0 out of 5 stars0 ratingsPostgreSQL 9 Administration Cookbook - Second Edition Rating: 0 out of 5 stars0 ratingsBuilding Websites with VB.NET and DotNetNuke 4 Rating: 1 out of 5 stars1/5ASP.NET 3.5 Social Networking Rating: 0 out of 5 stars0 ratingsASP.NET 3.5 CMS Development Rating: 0 out of 5 stars0 ratingsAngularJS Web Application Development Cookbook Rating: 0 out of 5 stars0 ratingsSoftware Design Pattern A Complete Guide - 2020 Edition Rating: 0 out of 5 stars0 ratingsMicroservices With Net Core A Complete Guide - 2021 Edition Rating: 0 out of 5 stars0 ratingsUltimate Typescript Handbook: Build, scale and maintain Modern Web Applications with Typescript Rating: 0 out of 5 stars0 ratingsMulti-Tier Application Programming with PHP: Practical Guide for Architects and Programmers Rating: 0 out of 5 stars0 ratingsSQL and NoSQL Interview Questions: Your essential guide to acing SQL and NoSQL job interviews (English Edition) Rating: 0 out of 5 stars0 ratingsAlfresco 3 Cookbook Rating: 0 out of 5 stars0 ratingsContinuous Integration in .NET Rating: 0 out of 5 stars0 ratingsNode.js 6.x Blueprints Rating: 0 out of 5 stars0 ratingsCassandra High Availability Rating: 5 out of 5 stars5/5.NET Mastery: The .NET Interview Questions and Answers Rating: 0 out of 5 stars0 ratingsSpark SQL A Complete Guide Rating: 0 out of 5 stars0 ratingsAdministrating Solr Rating: 0 out of 5 stars0 ratingsScrum Release Management: Successful Combination of Scrum, Lean Startup, and User Story Mapping Rating: 0 out of 5 stars0 ratings
Programming For You
Python Programming : How to Code Python Fast In Just 24 Hours With 7 Simple Steps Rating: 4 out of 5 stars4/5SQL QuickStart Guide: The Simplified Beginner's Guide to Managing, Analyzing, and Manipulating Data With SQL Rating: 4 out of 5 stars4/5HTML & CSS: Learn the Fundaments in 7 Days Rating: 4 out of 5 stars4/5Coding All-in-One For Dummies Rating: 4 out of 5 stars4/5Learn to Code. Get a Job. The Ultimate Guide to Learning and Getting Hired as a Developer. Rating: 5 out of 5 stars5/5Hacking: Ultimate Beginner's Guide for Computer Hacking in 2018 and Beyond: Hacking in 2018, #1 Rating: 4 out of 5 stars4/5PYTHON: Practical Python Programming For Beginners & Experts With Hands-on Project Rating: 5 out of 5 stars5/5Grokking Algorithms: An illustrated guide for programmers and other curious people Rating: 4 out of 5 stars4/5SQL All-in-One For Dummies Rating: 3 out of 5 stars3/5Java for Beginners: A Crash Course to Learn Java Programming in 1 Week Rating: 5 out of 5 stars5/5Learn PowerShell in a Month of Lunches, Fourth Edition: Covers Windows, Linux, and macOS Rating: 0 out of 5 stars0 ratingsPython Projects for Beginners: A Ten-Week Bootcamp Approach to Python Programming Rating: 0 out of 5 stars0 ratingsThe Unofficial Guide to Open Broadcaster Software: OBS: The World's Most Popular Free Live-Streaming Application Rating: 0 out of 5 stars0 ratingsPokemon Go: Guide + 20 Tips and Tricks You Must Read Hints, Tricks, Tips, Secrets, Android, iOS Rating: 5 out of 5 stars5/5Teach Yourself C++ Rating: 4 out of 5 stars4/5SQL: For Beginners: Your Guide To Easily Learn SQL Programming in 7 Days Rating: 5 out of 5 stars5/5The Little SAS Book: A Primer, Sixth Edition Rating: 5 out of 5 stars5/5Python: For Beginners A Crash Course Guide To Learn Python in 1 Week Rating: 4 out of 5 stars4/5Excel : The Ultimate Comprehensive Step-By-Step Guide to the Basics of Excel Programming: 1 Rating: 5 out of 5 stars5/5101 Amazing Nintendo NES Facts: Includes facts about the Famicom Rating: 4 out of 5 stars4/5
Reviews for Pentaho Analytics for MongoDB Cookbook
0 ratings0 reviews
Book preview
Pentaho Analytics for MongoDB Cookbook - Latino Joel
Table of Contents
Pentaho Analytics for MongoDB Cookbook
Credits
About the Authors
About the Reviewers
www.PacktPub.com
Support files, eBooks, discount offers, and more
Why Subscribe?
Free Access for Packt account holders
Preface
Pentaho Installation
What this book covers
What you need for this book
Who this book is for
Sections
Getting ready
How to do it…
How it works…
There's more…
See also
Conventions
Reader feedback
Customer support
Downloading the example code
Errata
Piracy
Questions
1. PDI and MongoDB
Introduction
Learning basic operations with Pentaho Data Integration
Getting ready
How to do it…
How it works…
There's more…
Migrating data from the RDBMS to MongoDB
Getting ready
How to do it…
How it works…
There's more…
How to reuse the properties of a MongoDB connection
Loading data from MongoDB to MySQL
Getting ready
How to do it…
How it works…
Migrating data from files to MongoDB
Getting ready
How to do it…
How it works…
Exporting MongoDB data using the aggregation framework
Getting ready
How to do it…
How it works…
See also
MongoDB Map/Reduce using the User Defined Java Class step and MongoDB Java Driver
Getting ready
How to do it…
How it works…
There's more…
Working with jobs and filtering MongoDB data using parameters and variables
Getting ready
How to do it…
How it works…
2. The Thin Kettle JDBC Driver
Introduction
Using a transformation as a data service
Getting ready
How to do it…
How it works…
See also
Running the Carte server in a single instance
Getting ready
How to do it…
How it works…
There's more…
Running the Pentaho Data Integration server in a single instance
Getting ready
How to do it…
How it works…
Define a connection using a SQL Client (SQuirreL SQL)
Getting ready
How to do it …
How it works…
There's more…
3. Pentaho Instaview
Introduction
Creating an analysis view
Getting ready
How to do it…
How it works…
Modifying Instaview transformations
Getting ready
How to do it…
How it works…
Modifying the Instaview model
Getting ready
How to do it…
How it works…
See also
Exploring, saving, deleting, and opening analysis reports
Getting ready
How to do it…
How it works…
See also
4. A MongoDB OLAP Schema
Introduction
Creating a date dimension
Getting ready
How to do it…
How it works…
There's more…
Creating an Orders cube
Getting ready
How to do it…
How it works…
Creating the customer and product dimensions
Getting ready
How to do it…
How it works…
See also
Saving and publishing a Mondrian schema
Getting ready
How to do it…
How it works…
There's more…
See also
Creating a Mondrian 4 physical schema
Getting ready
How to do it…
How it works…
Creating a Mondrian 4 cube
Getting ready
How to do it…
How it works…
Publishing a Mondrian 4 schema
Getting ready
How to do it…
How it works…
5. Pentaho Reporting
Introduction
Copying the MongoDB JDBC library
Getting ready
How to do it…
How it works…
Connecting to MongoDB using Reporting Wizard
Getting ready
How to do it…
How it works…
Connecting to MongoDB via PDI
Getting ready
How to do it…
How it works…
Adding a chart to a report
Getting ready
How to do it…
How it works…
Adding parameters to a report
Getting ready
How to do it…
How it works…
Adding a formula to a report
Getting ready
How to do it…
How it works…
Grouping data in reports
Getting ready
How to do it…
How it works…
Creating subreports
Getting ready
How to do it…
How it works…
Creating a report with MongoDB via Java
Getting ready
How to do it…
How it works…
Publishing a report to the Pentaho server
Getting ready
How to do it…
How it works…
Running a report in the Pentaho server
Getting ready
How to do it…
How it works…
6. The Pentaho BI Server
Introduction
Importing Foodmart MongoDB sample data
Getting ready
How to do it…
How it works…
There's more…
Creating a new analysis view using Pentaho Analyzer
Getting ready
How to do it…
How it works…
There's more…
Creating a dashboard using Pentaho Dashboard Designer
Getting ready
How to do it…
How it works…
See also
7. Pentaho Dashboards
Introduction
Copying the MongoDB JDBC library
Getting ready
How to do it…
How it works…
Importing a sample repository
Getting ready
How to do it…
How it works…
Using a transformation data source
Getting ready
How to do it…
How it works…
Using a BeanShell data source
Getting ready
How to do it…
How it works…
Using Pentaho Analyzer for MongoDB data source
Getting ready
How to do it…
How it works…
Using a Thin Kettle data source
Getting ready
How to do it…
How it works…
Defining dashboard layouts
Getting ready
How to do it…
How it works…
Creating a Dashboard Table component
Getting ready
How to do it…
How it works…
Creating a Dashboard line chart component
Getting ready
How to do it…
How it works…
8. Pentaho Community Contributions
Introduction
The PDI MongoDB Delete Step
Getting ready
How to do it…
How it works…
The PDI MongoDB GridFS Output Step
Getting ready
How to do it…
How it works…
The PDI MongoDB Map/Reduce Output step
Getting ready
How to do it…
How it works…
See also
The PDI MongoDB Lookup step
Getting ready
How to do it…
How it works…
There's more…
Index
Pentaho Analytics for MongoDB Cookbook
Pentaho Analytics for MongoDB Cookbook
Copyright © 2015 Packt Publishing
All rights reserved. No part of this book may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, without the prior written permission of the publisher, except in the case of brief quotations embedded in critical articles or reviews.
Every effort has been made in the preparation of this book to ensure the accuracy of the information presented. However, the information contained in this book is sold without warranty, either express or implied. Neither the authors, nor Packt Publishing, and its dealers and distributors will be held liable for any damages caused or alleged to be caused directly or indirectly by this book.
Packt Publishing has endeavored to provide trademark information about all of the companies and products mentioned in this book by the appropriate use of capitals. However, Packt Publishing cannot guarantee the accuracy of this information.
First published: December 2015
Production reference: 1181215
Published by Packt Publishing Ltd.
Livery Place
35 Livery Street
Birmingham B3 2PB, UK.
ISBN 978-1-78355-327-3
www.packtpub.com
Credits
Authors
Joel Latino
Harris Ward
Reviewers
Rio Bastian
Mark Kromer
Commissioning Editor
Usha Iyer
Acquisition Editor
Nikhil Karkal
Content Development Editor
Anish Dhurat
Technical Editor
Menza Mathew
Copy Editor
Vikrant Phadke
Project Coordinator
Bijal Patel
Proofreader
Safis Editing
Indexer
Rekha Nair
Production Coordinator
Manu Joseph
Cover Work
Manu Joseph
About the Authors
Joel Latino was born in Ponte de Lima, Portugal, in 1989. He has been working in the IT industry since 2010, mostly as a software developer and BI developer.
He started his career at a Portuguese company and specialized in strategic planning, consulting, implementation, and maintenance of enterprise software that is fully adapted to its customers' needs.
He earned his graduate degree in informatics engineering from the School of Technology and Management of Viana do Castelo Polytechnic Institute.
In 2014, he moved to Edinburgh, Scotland, to work for Ivy Information Systems, a highly specialized open source BI company in the United Kingdom.
Joel mainly focuses on open source web technology, databases, and business intelligence, and is fascinated by mobile technologies. He is responsible for developing some plugins for Pentaho, such as Android and Apple push notification steps, and lot of other plugins under Ivy Information Systems.
I would like to thank my family for supporting me throughout my career and endeavors.
Harris Ward has been working in the IT sector since 2004, initially developing websites using LAMP and moving on to business intelligence in 2006. His first role was based in Germany on a product called InfoZoom, where he was introduced to the world of business intelligence. He later discovered open source business intelligence tools and dedicated the last 9 years to not only working on developing solutions, but also working to expand the Pentaho community with the help of other committed members.
Harris has worked as a Pentaho consultant over the past 7 years under Ambient BI. Later, he decided to form Ivy Information Systems Scotland, a company focused on delivering more advanced Pentaho solutions as well as developing a wide range of Pentaho plugins that you can find in the marketplace today.
About the Reviewers
Rio Bastian is a happy software engineer. He has worked on various IT projects. He is interested in business intelligence, data integration, web services (using WSO2 API or ESB), and tuning SQL and Java code. He has also been a Pentaho business intelligence trainer for several companies in Indonesia and Malaysia. Currently, Rio is working on developing one of Garuda Indonesia airline's e-commerce channel web service systems in PT. Aero Systems Indonesia.
In his spare time, he tries to share his experience in software development through his personal blog at altanovela.wordpress.com. You can reach him on Skype at rio.bastian or e-mail him at
Mark Kromer has been working in the database, analytics, and business intelligence industry for 20 years, with a focus on big data and NoSQL since 2011. As a product manager, he has been responsible for the Pentaho MongoDB Analytics product road map for Pentaho, the graph database strategy for DataStax, and the business intelligence road map for Microsoft's vertical solutions. Mark is currently a big data cloud architect and is a frequent contributor to the TDWI BI magazine, MSDN Magazine, and SQL Server Magazine. You can keep up with his speaking and writing schedule at http://www.kromerbigdata.com.
www.PacktPub.com
Support files, eBooks, discount offers, and more
For support files and downloads related to your book, please visit www.PacktPub.com.
Did you know that Packt offers eBook versions of every book published, with PDF and ePub files available? You can upgrade to the eBook version at www.PacktPub.com and as a print book customer, you are entitled to a discount on the eBook copy. Get in touch with us at
At www.PacktPub.com, you can also read a collection of free technical articles, sign up for a range of free newsletters and receive exclusive discounts and offers on Packt books and eBooks.
https://www2.packtpub.com/books/subscription/packtlib
Do you need instant solutions to your IT questions? PacktLib is Packt's online digital book library. Here, you can search, access, and read Packt's entire library of books.
Why Subscribe?
Fully searchable across every book published by Packt
Copy and paste, print, and bookmark content
On demand and accessible via a web browser
Free Access for Packt account holders
If you have an account with Packt at www.PacktPub.com, you can use this to access PacktLib today and view 9 entirely free books. Simply use your login credentials for immediate access.
Preface
With an increasing interest in big data technologies, Pentaho, as a famous open source analysis tool, and MongoDB, the most famous NoSQL database, have gained special focus. The variety of features in Pentaho for MongoDB are end-to-end. This means from data storage in MongoDB clusters to visualization in a dashboard, in a report by e-mail, it's definitely a good change for the processes in enterprises. It's a powerful combination of scalable data storage, data transformation, and analysis.
Pentaho Analytics for MongoDB Cookbook explains the features of Pentaho for MongoDB in detail through clear and practical recipes that you can quickly apply to your solutions. Each chapter guides you through the different components of Pentaho: data integration, OLAP, reporting, dashboards, and analysis. This book is a guide to getting started with Pentaho and provides all of the practical information about the connectivity of Pentaho for MongoDB.
Pentaho Installation
Pentaho is a commercial open source product, which that means there are two versions available: Pentaho Community Edition (CE) and Pentaho Enterprise Edition (EE). To be able to cover all the recipes of this book, please choose Pentaho EE. You can download the trial version, available at http://www.pentaho.com. In this