Spark GraphX in Action
By Michael Malak and Robin East
()
About this ebook
Spark GraphX in Action starts out with an overview of Apache Spark and the GraphX graph processing API. This example-based tutorial then teaches you how to configure GraphX and how to use it interactively. Along the way, you'll collect practical techniques for enhancing applications and applying machine learning algorithms to graph data.
Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications.
About the Technology
GraphX is a powerful graph processing API for the Apache Spark analytics engine that lets you draw insights from large datasets. GraphX gives you unprecedented speed and capacity for running massively parallel and machine learning algorithms.
About the Book
Spark GraphX in Action begins with the big picture of what graphs can be used for. This example-based tutorial teaches you how to use GraphX interactively. You'll start with a crystal-clear introduction to building big data graphs from regular data, and then explore the problems and possibilities of implementing graph algorithms and architecting graph processing pipelines. Along the way, you'll collect practical techniques for enhancing applications and applying machine learning algorithms to graph data.
What's Inside
- Understanding graph technology
- Using the GraphX API
- Developing algorithms for big graphs
- Machine learning with graphs
- Graph visualization
About the Reader
Readers should be comfortable writing code. Experience with Apache Spark and Scala is not required.
About the Authors
Michael Malak has worked on Spark applications for Fortune 500 companies since early 2013. Robin East has worked as a consultant to large organizations for over 15 years and is a data scientist at Worldpay.
Table of Contents
-
PART 1 SPARK AND GRAPHS
- Two important technologies: Spark and graphs
- GraphX quick start
- Some fundamentals PART 2 CONNECTING VERTICES
- GraphX Basics
- Built-in algorithms
- Other useful graph algorithms
- Machine learning PART 3 OVER THE ARC
- The missing algorithms
- Performance and monitoring
- Other languages and tools
Michael Malak
Michael Malak has worked on Spark applications for Fortune 500 companies since early 2013.
Related to Spark GraphX in Action
Related ebooks
Spark in Action Rating: 0 out of 5 stars0 ratingsParallel and High Performance Computing Rating: 0 out of 5 stars0 ratingsLearning Apache Spark 2 Rating: 0 out of 5 stars0 ratingsGraph Databases in Action: Examples in Gremlin Rating: 0 out of 5 stars0 ratingsMachine Learning Systems: Designs that scale Rating: 0 out of 5 stars0 ratingsData Science with Python and Dask Rating: 0 out of 5 stars0 ratingsApache Spark Graph Processing Rating: 0 out of 5 stars0 ratingsPlay for Java Rating: 0 out of 5 stars0 ratingsNode.js in Practice Rating: 0 out of 5 stars0 ratingsVisualizing Graph Data Rating: 0 out of 5 stars0 ratingsMastering Large Datasets with Python: Parallelize and Distribute Your Python Code Rating: 0 out of 5 stars0 ratingsIsomorphic Web Applications: Universal Development with React Rating: 0 out of 5 stars0 ratingsServerless Architectures on AWS: With examples using AWS Lambda Rating: 0 out of 5 stars0 ratingsEvent Streams in Action: Real-time event systems with Kafka and Kinesis Rating: 0 out of 5 stars0 ratingsD3.js in Action: Data visualization with JavaScript Rating: 0 out of 5 stars0 ratingsSpark in Action: Covers Apache Spark 3 with Examples in Java, Python, and Scala Rating: 0 out of 5 stars0 ratingsMaking Sense of NoSQL: A guide for managers and the rest of us Rating: 0 out of 5 stars0 ratingsGraphQL in Action Rating: 2 out of 5 stars2/5Scala in Action Rating: 0 out of 5 stars0 ratingsElasticsearch in Action Rating: 0 out of 5 stars0 ratingsMachine Learning in Action Rating: 0 out of 5 stars0 ratingsFunctional and Reactive Domain Modeling Rating: 0 out of 5 stars0 ratingsScalatra in Action Rating: 0 out of 5 stars0 ratingsRe-Engineering Legacy Software Rating: 0 out of 5 stars0 ratingsMongoDB in Action: Covers MongoDB version 3.0 Rating: 0 out of 5 stars0 ratingsSingle Page Web Applications: JavaScript end-to-end Rating: 0 out of 5 stars0 ratingsNode.js in Action Rating: 0 out of 5 stars0 ratingsIrresistible APIs: Designing web APIs that developers will love Rating: 0 out of 5 stars0 ratingsRedis in Action Rating: 0 out of 5 stars0 ratingsAkka in Action Rating: 0 out of 5 stars0 ratings
Databases For You
Grokking Algorithms: An illustrated guide for programmers and other curious people Rating: 4 out of 5 stars4/5SQL Clearly Explained Rating: 5 out of 5 stars5/5Serverless Architectures on AWS, Second Edition Rating: 5 out of 5 stars5/5Advanced Analytics in Power BI with R and Python: Ingesting, Transforming, Visualizing Rating: 0 out of 5 stars0 ratingsJoe Celko's SQL Programming Style Rating: 4 out of 5 stars4/5Codeless Data Structures and Algorithms: Learn DSA Without Writing a Single Line of Code Rating: 0 out of 5 stars0 ratingsLearn SQL in 24 Hours Rating: 5 out of 5 stars5/5SQL QuickStart Guide: The Simplified Beginner's Guide to Managing, Analyzing, and Manipulating Data With SQL Rating: 4 out of 5 stars4/5Python Projects for Everyone Rating: 0 out of 5 stars0 ratingsGo in Action Rating: 5 out of 5 stars5/5Practical Data Analysis Rating: 4 out of 5 stars4/5Mastering the Microsoft Deployment Toolkit Rating: 0 out of 5 stars0 ratingsSQL Server: Tips and Tricks - 2 Rating: 4 out of 5 stars4/5Access 2016 For Dummies Rating: 0 out of 5 stars0 ratingsCOMPUTER SCIENCE FOR ROOKIES Rating: 0 out of 5 stars0 ratingsA Concise Guide to Object Orientated Programming Rating: 0 out of 5 stars0 ratingsAccess 2010 All-in-One For Dummies Rating: 4 out of 5 stars4/5Access 2019 For Dummies Rating: 0 out of 5 stars0 ratingsBeginning Microsoft Power BI: A Practical Guide to Self-Service Data Analytics Rating: 0 out of 5 stars0 ratingsBig Data Forensics – Learning Hadoop Investigations Rating: 0 out of 5 stars0 ratingsThe Visual Imperative: Creating a Visual Culture of Data Discovery Rating: 4 out of 5 stars4/5Data Science Strategy For Dummies Rating: 0 out of 5 stars0 ratingsVisualizing Graph Data Rating: 0 out of 5 stars0 ratingsThe SQL Workshop: Learn to create, manipulate and secure data and manage relational databases with SQL Rating: 0 out of 5 stars0 ratingsPython and SQLite Development Rating: 0 out of 5 stars0 ratingsBusiness Intelligence Guidebook: From Data Integration to Analytics Rating: 4 out of 5 stars4/5Data Mining: Concepts and Techniques Rating: 4 out of 5 stars4/5Implementing Cloud Design Patterns for AWS Rating: 0 out of 5 stars0 ratingsMeasuring Data Quality for Ongoing Improvement: A Data Quality Assessment Framework Rating: 5 out of 5 stars5/5
Reviews for Spark GraphX in Action
0 ratings0 reviews
Book preview
Spark GraphX in Action - Michael Malak
ޙ"imgp5j˼4 XKK_')2OgI6/Sc]R_Ʋ[;K6yv}MN_lLQ%mʼmZA$tao6m6}K[L/IB
omQߝvy53-L^03[涝'hwΜi@,?'oC
auA-Czʫ#QORS )߁@{U`̼]ә*ka|Y&ie=:.!-ǿ9%Җ %Byvl,}iH$h=m.:>+1rxS%D|HiKf}^}MO|MrlXG̰
]=%)TfcmZh'4x.&ä$P8uh㙽h$vg^Fآoސ,(@mRՑhnJe߭I0]f+Ѯt{.haVM+7ϫ _&^%O?s]@6Oݰ;ElL_7}F"x<}㓧'Oq!L"G]w;/ɱH89INQ8y