Learning Apache Drill: Query and Analyze Distributed Data Sources with SQL
By Charles Givre and Paul Rogers
()
Currently unavailable
Currently unavailable
About this ebook
Get up to speed with Apache Drill, an extensible distributed SQL query engine that reads massive datasets in many popular file formats such as Parquet, JSON, and CSV. Drill reads data in HDFS or in cloud-native storage such as S3 and works with Hive metastores along with distributed databases such as HBase, MongoDB, and relational databases. Drill works everywhere: on your laptop or in your largest cluster.
In this practical book, Drill committers Charles Givre and Paul Rogers show analysts and data scientists how to query and analyze raw data using this powerful tool. Data scientists today spend about 80% of their time just gathering and cleaning data. With this book, you’ll learn how Drill helps you analyze data more effectively to drive down time to insight.
- Use Drill to clean, prepare, and summarize delimited data for further analysis
- Query file types including logfiles, Parquet, JSON, and other complex formats
- Query Hadoop, relational databases, MongoDB, and Kafka with standard SQL
- Connect to Drill programmatically using a variety of languages
- Use Drill even with challenging or ambiguous file formats
- Perform sophisticated analysis by extending Drill’s functionality with user-defined functions
- Facilitate data analysis for network security, image metadata, and machine learning
Charles Givre
Mr. Charles Givre is an Apache Drill committer and has worked as a Senior Lead Data Scientist for Booz Allen Hamilton for the last six years where he works in the intersection of cyber security and data science. Mr. Givre is passionate about teaching others data science and analytic skills and has taught data science classes all over the world at conferences, universities and for clients. Most recently, Mr. Givre taught a data science class at the BlackHat conference in Las Vegas and the Center for Research in Applied Cryptography and Cyber Security at Bar Ilan University. He is a sought-after speaker and has delivered presentations at major industry conferences such as Strata-Hadoop World, BlackHat, Open Data Science Conference and others.
Related to Learning Apache Drill
Related ebooks
Parallel Computing Rating: 0 out of 5 stars0 ratingsPython for Microcontrollers: Getting Started with MicroPython Rating: 0 out of 5 stars0 ratingsOracle Database 10g RMAN Backup & Recovery Rating: 5 out of 5 stars5/5Moving To The Cloud: Developing Apps in the New World of Cloud Computing Rating: 3 out of 5 stars3/5Programming the Raspberry Pi, Third Edition: Getting Started with Python Rating: 5 out of 5 stars5/5Programming the Propeller with Spin: A Beginner's Guide to Parallel Processing Rating: 0 out of 5 stars0 ratingsProgramming Arduino Next Steps: Going Further with Sketches, Second Edition Rating: 3 out of 5 stars3/5Docker in Practice, Second Edition Rating: 0 out of 5 stars0 ratingsCNET Do-It-Yourself Home Networking Projects Rating: 0 out of 5 stars0 ratingsProgramming the Intel Galileo: Getting Started with the Arduino -Compatible Development Board Rating: 5 out of 5 stars5/5Computing Fundamentals: IC3 Edition Rating: 0 out of 5 stars0 ratingsMac OS X Leopard Portable Genius Rating: 0 out of 5 stars0 ratingsMicrosoft SQL Server 2005: A Beginner''s Guide Rating: 0 out of 5 stars0 ratingsProgramming the Raspberry Pi, Second Edition: Getting Started with Python Rating: 0 out of 5 stars0 ratingsThe Little Book of Sitecore® Tips: Volume 1 Rating: 0 out of 5 stars0 ratingsArduino and Raspberry Pi Sensor Projects for the Evil Genius Rating: 0 out of 5 stars0 ratingsAmphibionics Rating: 0 out of 5 stars0 ratingsProgramming Arduino Next Steps: Going Further with Sketches Rating: 3 out of 5 stars3/5Adobe Acrobat 7.0 QuickSteps Rating: 0 out of 5 stars0 ratings.NET Core in Action Rating: 0 out of 5 stars0 ratingsProgramming the Photon: Getting Started with the Internet of Things Rating: 5 out of 5 stars5/5Dependency Injection: Design patterns using Spring and Guice Rating: 0 out of 5 stars0 ratingsJ2EE Open Source Toolkit: Building an Enterprise Platform with Open Source Tools (Java Open Source Library) Rating: 0 out of 5 stars0 ratingsMicrosoft Virtualization: Master Microsoft Server, Desktop, Application, and Presentation Virtualization Rating: 4 out of 5 stars4/5Programming FPGAs: Getting Started with Verilog Rating: 4 out of 5 stars4/5Core Kubernetes Rating: 0 out of 5 stars0 ratingsOCP Oracle Database 11g Administration II Exam Guide: Exam 1Z0-053 Rating: 0 out of 5 stars0 ratingsPro Java Microservices with Quarkus and Kubernetes: A Hands-on Guide Rating: 0 out of 5 stars0 ratingsDocker Deep Dive: Zero to Docker in a single book Rating: 0 out of 5 stars0 ratings
Computers For You
Mastering ChatGPT: 21 Prompts Templates for Effortless Writing Rating: 5 out of 5 stars5/5The ChatGPT Millionaire Handbook: Make Money Online With the Power of AI Technology Rating: 0 out of 5 stars0 ratingsCreating Online Courses with ChatGPT | A Step-by-Step Guide with Prompt Templates Rating: 4 out of 5 stars4/5Grokking Algorithms: An illustrated guide for programmers and other curious people Rating: 4 out of 5 stars4/5The Insider's Guide to Technical Writing Rating: 0 out of 5 stars0 ratingsCompTIA Security+ Get Certified Get Ahead: SY0-701 Study Guide Rating: 5 out of 5 stars5/5How to Create Cpn Numbers the Right way: A Step by Step Guide to Creating cpn Numbers Legally Rating: 4 out of 5 stars4/5Procreate for Beginners: Introduction to Procreate for Drawing and Illustrating on the iPad Rating: 0 out of 5 stars0 ratingsDeep Search: How to Explore the Internet More Effectively Rating: 5 out of 5 stars5/5SQL QuickStart Guide: The Simplified Beginner's Guide to Managing, Analyzing, and Manipulating Data With SQL Rating: 4 out of 5 stars4/5Mindhacker: 60 Tips, Tricks, and Games to Take Your Mind to the Next Level Rating: 4 out of 5 stars4/5Ultimate Guide to Mastering Command Blocks!: Minecraft Keys to Unlocking Secret Commands Rating: 5 out of 5 stars5/5Artificial Intelligence: The Complete Beginner’s Guide to the Future of A.I. Rating: 4 out of 5 stars4/5CompTIA Security+ Practice Questions Rating: 2 out of 5 stars2/5Remote/WebCam Notarization : Basic Understanding Rating: 3 out of 5 stars3/5Network+ Study Guide & Practice Exams Rating: 4 out of 5 stars4/5Dark Aeon: Transhumanism and the War Against Humanity Rating: 5 out of 5 stars5/5CompTIA IT Fundamentals (ITF+) Study Guide: Exam FC0-U61 Rating: 0 out of 5 stars0 ratingsElon Musk Rating: 4 out of 5 stars4/5The Professional Voiceover Handbook: Voiceover training, #1 Rating: 5 out of 5 stars5/5Everybody Lies: Big Data, New Data, and What the Internet Can Tell Us About Who We Really Are Rating: 4 out of 5 stars4/5Hacking: Ultimate Beginner's Guide for Computer Hacking in 2018 and Beyond: Hacking in 2018, #1 Rating: 4 out of 5 stars4/5
Reviews for Learning Apache Drill
0 ratings0 reviews