Ebook646 pages2 hours

SAS Viya: The Python Perspective

Name: SAS Viya: The Python Perspective
Author: Kevin D. Smith
ISBN: 9781629608839

By Kevin D. Smith and Xiangxiang Meng

Rating: 0 out of 5 stars

()

Read preview

About this ebook

Learn how to access analytics from SAS Cloud Analytic Services (CAS) using Python and the SAS Viya platform. SAS Viya : The Python Perspective is an introduction to using the Python client on the SAS Viya platform. SAS Viya is a high-performance, fault-tolerant analytics architecture that can be deployed on both public and private cloud infrastructures. While SAS Viya can be used by various SAS applications, it also enables you to access analytic methods from SAS, Python, Lua, and Java, as well as through a REST interface using HTTP or HTTPS. This book focuses on the perspective of SAS Viya from Python. SAS Viya is made up of multiple components. The central piece of this ecosystem is SAS Cloud Analytic Services (CAS). CAS is the cloud-based server that all clients communicate with to run analytical methods. The Python client is used to drive the CAS component directly using objects and constructs that are familiar to Python programmers. Some knowledge of Python would be helpful before using this book; however, there is an appendix that covers the features of Python that are used in the CAS Python client. Knowledge of CAS is not required to use this book. However, you will need to have a CAS server set up and running to execute the examples in this book. With this book, you will learn how to:

Install the required components for accessing CAS from Python
Connect to CAS, load data, and run simple analyses
Work with CAS using APIs familiar to Python users
Grasp general CAS workflows and advanced features of the CAS Python client

SAS Viya : The Python Perspective covers topics that will be useful to beginners as well as experienced CAS users. It includes examples from creating connections to CAS all the way to simple statistics and machine learning, but it is also useful as a desktop reference.

Skip carousel

LanguageEnglish

PublisherSAS Institute

Release dateFeb 8, 2018

ISBN9781629608839

Author

Kevin D. Smith

Kevin D. Smith has been a software developer at SAS since 1997. He began his career in the development of PROC TEMPLATE and other underlying ODS technologies, including authoring two books on the subjects. He is now heavily involved in client-side work on the SAS Viya platform. This includes development of the R, Python, and Lua SWAT packages, as well as higher-level packages built on top of the foundation created by SWAT.

Related authors

Skip carousel

Related to SAS Viya

Related ebooks

Skip carousel

SAS Viya: The R Perspective
Ebook
SAS Viya: The R Perspective
byYue Qi
Rating: 0 out of 5 stars
0 ratings
Machine Learning with SAS Viya
Ebook
Machine Learning with SAS Viya
bySAS Institute Inc.
Rating: 0 out of 5 stars
0 ratings
SAS Programming for Enterprise Guide Users, Second Edition
Ebook
SAS Programming for Enterprise Guide Users, Second Edition
byNeil Constable
Rating: 0 out of 5 stars
0 ratings
An Introduction to SAS Visual Analytics: How to Explore Numbers, Design Reports, and Gain Insight into Your Data
Ebook
An Introduction to SAS Visual Analytics: How to Explore Numbers, Design Reports, and Gain Insight into Your Data
byTricia Aanderud
Rating: 5 out of 5 stars
5/5
SAS Text Analytics for Business Applications: Concept Rules for Information Extraction Models
Ebook
SAS Text Analytics for Business Applications: Concept Rules for Information Extraction Models
byTeresa Jade
Rating: 0 out of 5 stars
0 ratings
Segmentation Analytics with SAS Viya: An Approach to Clustering and Visualization
Ebook
Segmentation Analytics with SAS Viya: An Approach to Clustering and Visualization
byRandall S. Collica
Rating: 0 out of 5 stars
0 ratings
PROC DOCUMENT by Example Using SAS
Ebook
PROC DOCUMENT by Example Using SAS
byMichael Tuchman
Rating: 0 out of 5 stars
0 ratings
Predictive Modeling with SAS Enterprise Miner: Practical Solutions for Business Applications, Third Edition
Ebook
Predictive Modeling with SAS Enterprise Miner: Practical Solutions for Business Applications, Third Edition
byKattamuri S. Sarma
Rating: 0 out of 5 stars
0 ratings
End-to-End Data Science with SAS: A Hands-On Programming Guide
Ebook
End-to-End Data Science with SAS: A Hands-On Programming Guide
byJames Gearheart
Rating: 0 out of 5 stars
0 ratings
PROC REPORT by Example: Techniques for Building Professional Reports Using SAS: Techniques for Building Professional Reports Using SAS
Ebook
PROC REPORT by Example: Techniques for Building Professional Reports Using SAS: Techniques for Building Professional Reports Using SAS
byLisa Fine
Rating: 0 out of 5 stars
0 ratings
PROC SQL: Beyond the Basics Using SAS, Third Edition
Ebook
PROC SQL: Beyond the Basics Using SAS, Third Edition
byKirk Paul Lafler
Rating: 0 out of 5 stars
0 ratings
SAS Interview Questions You'll Most Likely Be Asked
Ebook
SAS Interview Questions You'll Most Likely Be Asked
byVibrant Publishers
Rating: 0 out of 5 stars
0 ratings
Business Analytics Using SAS Enterprise Guide and SAS Enterprise Miner: A Beginner's Guide
Ebook
Business Analytics Using SAS Enterprise Guide and SAS Enterprise Miner: A Beginner's Guide
byOlivia Parr-Rud
Rating: 0 out of 5 stars
0 ratings
SAS Visual Analytics for SAS Viya
Ebook
SAS Visual Analytics for SAS Viya
bySAS Institute Inc.
Rating: 0 out of 5 stars
0 ratings
The SAS Programmer's PROC REPORT Handbook: ODS Companion
Ebook
The SAS Programmer's PROC REPORT Handbook: ODS Companion
byJane Eslinger
Rating: 0 out of 5 stars
0 ratings
Smart Data Discovery Using SAS Viya: Powerful Techniques for Deeper Insights
Ebook
Smart Data Discovery Using SAS Viya: Powerful Techniques for Deeper Insights
byFelix Liao
Rating: 0 out of 5 stars
0 ratings
SAS Statistics Data Analysis Certification Questions: Unofficial SAS Data analysis Certification and Interview Questions
Ebook
SAS Statistics Data Analysis Certification Questions: Unofficial SAS Data analysis Certification and Interview Questions
byEquity Press
Rating: 5 out of 5 stars
5/5
The Data Detective's Toolkit: Cutting-Edge Techniques and SAS Macros to Clean, Prepare, and Manage Data
Ebook
The Data Detective's Toolkit: Cutting-Edge Techniques and SAS Macros to Clean, Prepare, and Manage Data
byKim Chantala
Rating: 0 out of 5 stars
0 ratings
Interactive Reports in SAS® Visual Analytics: Advanced Features and Customization
Ebook
Interactive Reports in SAS® Visual Analytics: Advanced Features and Customization
byNicole Ball
Rating: 0 out of 5 stars
0 ratings
Learning Apache Cassandra - Second Edition
Ebook
Learning Apache Cassandra - Second Edition
bySandeep Yarabarla
Rating: 0 out of 5 stars
0 ratings
Fundamentals of Programming in SAS: A Case Studies Approach
Ebook
Fundamentals of Programming in SAS: A Case Studies Approach
byJames Blum
Rating: 0 out of 5 stars
0 ratings
Mastering RStudio – Develop, Communicate, and Collaborate with R
Ebook
Mastering RStudio – Develop, Communicate, and Collaborate with R
byHillebrand Julian
Rating: 0 out of 5 stars
0 ratings
SAS Statistics by Example
Ebook
SAS Statistics by Example
byRon Cody
Rating: 5 out of 5 stars
5/5
Insightful Data Visualization with SAS Viya
Ebook
Insightful Data Visualization with SAS Viya
byFalko Schulz
Rating: 0 out of 5 stars
0 ratings
SAS Certification Prep Guide: Statistical Business Analysis Using SAS9
Ebook
SAS Certification Prep Guide: Statistical Business Analysis Using SAS9
byJoni N. Shreve, PhD
Rating: 0 out of 5 stars
0 ratings
Cody's Data Cleaning Techniques Using SAS, Third Edition
Ebook
Cody's Data Cleaning Techniques Using SAS, Third Edition
byRon Cody
Rating: 5 out of 5 stars
5/5
Deep Learning for Numerical Applications with SAS
Ebook
Deep Learning for Numerical Applications with SAS
byHenry Bequet
Rating: 0 out of 5 stars
0 ratings
Practical and Efficient SAS Programming: The Insider's Guide
Ebook
Practical and Efficient SAS Programming: The Insider's Guide
byMartha Messineo
Rating: 0 out of 5 stars
0 ratings
Unstructured Data Analysis: Entity Resolution and Regular Expressions in SAS
Ebook
Unstructured Data Analysis: Entity Resolution and Regular Expressions in SAS
byMatthew Windham
Rating: 0 out of 5 stars
0 ratings
Elementary Statistics Using SAS
Ebook
Elementary Statistics Using SAS
bySandra D. Schlotzhauer
Rating: 0 out of 5 stars
0 ratings

Applications & Software For You

Skip carousel

Hacks for TikTok: 150 Tips and Tricks for Editing and Posting Videos, Getting Likes, Keeping Your Fans Happy, and Making Money
Ebook
Hacks for TikTok: 150 Tips and Tricks for Editing and Posting Videos, Getting Likes, Keeping Your Fans Happy, and Making Money
byKyle Brach
Rating: 5 out of 5 stars
5/5
Excel Essentials: A Step-by-Step Guide with Pictures for Absolute Beginners to Master the Basics and Start Using Excel with Confidence
Ebook
Excel Essentials: A Step-by-Step Guide with Pictures for Absolute Beginners to Master the Basics and Start Using Excel with Confidence
byNigel Tillery
Rating: 0 out of 5 stars
0 ratings
Learn to Code. Get a Job. The Ultimate Guide to Learning and Getting Hired as a Developer.
Ebook
Learn to Code. Get a Job. The Ultimate Guide to Learning and Getting Hired as a Developer.
byGwendolyn Faraday
Rating: 5 out of 5 stars
5/5
Logic Pro X For Dummies
Ebook
Logic Pro X For Dummies
byGraham English
Rating: 0 out of 5 stars
0 ratings
Sound Design for Filmmakers: Film School Sound
Ebook
Sound Design for Filmmakers: Film School Sound
byMurray Stiller
Rating: 5 out of 5 stars
5/5
Excel : The Ultimate Comprehensive Step-By-Step Guide to the Basics of Excel Programming: 1
Ebook
Excel : The Ultimate Comprehensive Step-By-Step Guide to the Basics of Excel Programming: 1
byKevin Clark
Rating: 5 out of 5 stars
5/5
CompTIA Certification: The Ultimate Guide To Discover CompTIA. Certified Quickly And Easily Passing The Certification Exam. Real Practice Test With Detailed Screenshots, Answers And Explanations
Ebook
CompTIA Certification: The Ultimate Guide To Discover CompTIA. Certified Quickly And Easily Passing The Certification Exam. Real Practice Test With Detailed Screenshots, Answers And Explanations
byDavid Mayer
Rating: 0 out of 5 stars
0 ratings
GarageBand For Dummies
Ebook
GarageBand For Dummies
byBob LeVitus
Rating: 5 out of 5 stars
5/5
Synthesizer Cookbook: How to Use Filters: Sound Design for Beginners, #2
Ebook
Synthesizer Cookbook: How to Use Filters: Sound Design for Beginners, #2
byScreech House
Rating: 3 out of 5 stars
3/5
Hilarious Jokes for Minecrafters: Mobs, Creepers, Skeletons, and More
Ebook
Hilarious Jokes for Minecrafters: Mobs, Creepers, Skeletons, and More
byMichele C. Hollow
Rating: 1 out of 5 stars
1/5
How to Create Cpn Numbers the Right way: A Step by Step Guide to Creating cpn Numbers Legally
Ebook
How to Create Cpn Numbers the Right way: A Step by Step Guide to Creating cpn Numbers Legally
byAlex Parkinson
Rating: 4 out of 5 stars
4/5
The Unofficial Guide to Open Broadcaster Software: OBS: The World's Most Popular Free Live-Streaming Application
Ebook
The Unofficial Guide to Open Broadcaster Software: OBS: The World's Most Popular Free Live-Streaming Application
byPaul Richards
Rating: 0 out of 5 stars
0 ratings
Adobe Photoshop: A Complete Course and Compendium of Features
Ebook
Adobe Photoshop: A Complete Course and Compendium of Features
byStephen Laskevitch
Rating: 5 out of 5 stars
5/5
The Little SAS Book: A Primer, Sixth Edition
Ebook
The Little SAS Book: A Primer, Sixth Edition
byLora D. Delwiche
Rating: 5 out of 5 stars
5/5
iPhone Photography For Dummies
Ebook
iPhone Photography For Dummies
byMark Hemmings
Rating: 0 out of 5 stars
0 ratings
Adobe Illustrator: A Complete Course and Compendium of Features
Ebook
Adobe Illustrator: A Complete Course and Compendium of Features
byJason Hoppe
Rating: 0 out of 5 stars
0 ratings
Blender 3D Basics Beginner's Guide Second Edition
Ebook
Blender 3D Basics Beginner's Guide Second Edition
byGordon Fisher
Rating: 5 out of 5 stars
5/5
Mastering QuickBooks 2020: The ultimate guide to bookkeeping and QuickBooks Online
Ebook
Mastering QuickBooks 2020: The ultimate guide to bookkeeping and QuickBooks Online
byCrystalynn Shelton
Rating: 0 out of 5 stars
0 ratings
Start Your Own Podcast Business: Your Step-By-Step Guide to Success
Ebook
Start Your Own Podcast Business: Your Step-By-Step Guide to Success
byThe Staff of Entrepreneur Media
Rating: 5 out of 5 stars
5/5
Experts' Guide to OneNote
Ebook
Experts' Guide to OneNote
byJeremy P. Jones
Rating: 5 out of 5 stars
5/5
GarageBand Basics: The Complete Guide to GarageBand: Music
Ebook
GarageBand Basics: The Complete Guide to GarageBand: Music
byAventuras De Viaje
Rating: 0 out of 5 stars
0 ratings
Data Science and Big Data Analytics: Discovering, Analyzing, Visualizing and Presenting Data
Ebook
Data Science and Big Data Analytics: Discovering, Analyzing, Visualizing and Presenting Data
byEMC Education Services
Rating: 0 out of 5 stars
0 ratings
Vocal Rescue: Rediscover the Beauty, Power and Freedom in Your Singing
Ebook
Vocal Rescue: Rediscover the Beauty, Power and Freedom in Your Singing
byLois Alba
Rating: 4 out of 5 stars
4/5
Affinity Photo How To
Ebook
Affinity Photo How To
byRobin Whalley
Rating: 0 out of 5 stars
0 ratings
How Do I Do That In InDesign?
Ebook
How Do I Do That In InDesign?
byDave Clayton
Rating: 5 out of 5 stars
5/5
Mastering ChatGPT
Ebook
Mastering ChatGPT
byCharles J. Jones
Rating: 0 out of 5 stars
0 ratings
Six Figure Blogging In 3 Months
Ebook
Six Figure Blogging In 3 Months
byShekhar Mishra
Rating: 4 out of 5 stars
4/5
Adobe InDesign CC: A Complete Course and Compendium of Features
Ebook
Adobe InDesign CC: A Complete Course and Compendium of Features
byStephen Laskevitch
Rating: 0 out of 5 stars
0 ratings
iPhone X Hacks, Tips and Tricks: Discover 101 Awesome Tips and Tricks for iPhone XS, XS Max and iPhone X
Ebook
iPhone X Hacks, Tips and Tricks: Discover 101 Awesome Tips and Tricks for iPhone XS, XS Max and iPhone X
byDavid Cromwell
Rating: 3 out of 5 stars
3/5
FL Studio Cookbook
Ebook
FL Studio Cookbook
byShaun Friedman
Rating: 4 out of 5 stars
4/5

Related podcast episodes

Skip carousel

Ali Ghodsi – The Past, Present, and Future of Big Data – [Founder’s Field Guide, EP.18]: My Guest today is Ali Ghodsi, founder and CEO of Databricks, a data analytics platform for data scientists and developers. He's also the founder of Apache Spark, the open-source project that Databricks is built on, and is an accomplished researcher at...
Podcast episode
Ali Ghodsi – The Past, Present, and Future of Big Data – [Founder’s Field Guide, EP.18]: My Guest today is Ali Ghodsi, founder and CEO of Databricks, a data analytics platform for data scientists and developers. He's also the founder of Apache Spark, the open-source project that Databricks is built on, and is an accomplished researcher at...
byInvest Like the Best with Patrick O'Shaughnessy
0 ratings
0% found this document useful
[DataFramed Careers Series #2] What Makes a Great Data Science Portfolio
Podcast episode
[DataFramed Careers Series #2] What Makes a Great Data Science Portfolio
byDataFramed
0 ratings
0% found this document useful
Simplifying Data Integration Through Eventual Connectivity - Episode 91: An interview about a new pattern for data integration that reduces the amount of effort required to find connections in numerous data sets
Podcast episode
Simplifying Data Integration Through Eventual Connectivity - Episode 91: An interview about a new pattern for data integration that reduces the amount of effort required to find connections in numerous data sets
byData Engineering Podcast
0 ratings
0% found this document useful
Renee M. P. Teate, "SQL for Data Scientists: A Beginner's Guide for Building Datasets for Analysis" (John Wiley & Sons, 2021): An interview with Renee M. P. Teate
Podcast episode
Renee M. P. Teate, "SQL for Data Scientists: A Beginner's Guide for Building Datasets for Analysis" (John Wiley & Sons, 2021): An interview with Renee M. P. Teate
byNew Books in Science, Technology, and Society
0 ratings
0% found this document useful
MLA 020 Kubeflow: Conversation with Dirk-Jan Kubeflow (vs cloud native solutions like SageMaker) - Data Scientist at Dept Agency . (From the website:) The Machine Learning Toolkit for Kubernetes. The Kubeflow project is dedicated to making deployments of...
Podcast episode
MLA 020 Kubeflow: Conversation with Dirk-Jan Kubeflow (vs cloud native solutions like SageMaker) - Data Scientist at Dept Agency . (From the website:) The Machine Learning Toolkit for Kubernetes. The Kubeflow project is dedicated to making deployments of...
byMachine Learning Guide
0 ratings
0% found this document useful
#124 Using AI to Improve Data Quality in Healthcare
Podcast episode
#124 Using AI to Improve Data Quality in Healthcare
byDataFramed
0 ratings
0% found this document useful
#111 The Rise of the Julia Programming Language
Podcast episode
#111 The Rise of the Julia Programming Language
byDataFramed
0 ratings
0% found this document useful
Hasty Treat - Webhooks: In this Hasty Treat, Scott and Wes talk about webhooks — one of those concepts that seems a lot scarier than it actually is. Linode - Sponsor Whether you’re working on a personal project or managing enterprise infrastructure, you deserve simple,...
Podcast episode
Hasty Treat - Webhooks: In this Hasty Treat, Scott and Wes talk about webhooks — one of those concepts that seems a lot scarier than it actually is. Linode - Sponsor Whether you’re working on a personal project or managing enterprise infrastructure, you deserve simple,...
bySyntax - Tasty Web Development Treats
0 ratings
0% found this document useful
Level Up Your Data Platform With Active Metadata: A conversation with Atlan co-founder Prukalpa Sankar about the idea of active metadata and how it can reduce the toil involved in managing a data platform
Podcast episode
Level Up Your Data Platform With Active Metadata: A conversation with Atlan co-founder Prukalpa Sankar about the idea of active metadata and how it can reduce the toil involved in managing a data platform
byData Engineering Podcast
0 ratings
0% found this document useful
Building An Internal Database As A Service Platform At Cloudflare: Data persistence is one of the most challenging aspects of computer systems. In the era of the cloud most developers rely on hosted services to manage their databases, but what if you are a cloud service? In this episode Vignesh Ravichandran explains how his team at Cloudflare provides PostgreSQL as a service to their developers for low latency and high uptime services at global scale. This is an interesting and insightful look at pragmatic engineering for reliability and scale.
Podcast episode
Building An Internal Database As A Service Platform At Cloudflare: Data persistence is one of the most challenging aspects of computer systems. In the era of the cloud most developers rely on hosted services to manage their databases, but what if you are a cloud service? In this episode Vignesh Ravichandran explains how his team at Cloudflare provides PostgreSQL as a service to their developers for low latency and high uptime services at global scale. This is an interesting and insightful look at pragmatic engineering for reliability and scale.
byData Engineering Podcast
0 ratings
0% found this document useful
Find Out About The Technology Behind The Latest PFAD In Analytical Database Development: Building a database engine requires a substantial amount of engineering effort and time investment. Over the decades of research and development into building these software systems there are a number of common components that are shared across implementations. When Paul Dix decided to re-write the InfluxDB engine he found the Apache Arrow ecosystem ready and waiting with useful building blocks to accelerate the process. In this episode he explains how he used the combination of Apache Arrow, Flight, Datafusion, and Parquet to lay the foundation of the newest version of his time-series database.
Podcast episode
Find Out About The Technology Behind The Latest PFAD In Analytical Database Development: Building a database engine requires a substantial amount of engineering effort and time investment. Over the decades of research and development into building these software systems there are a number of common components that are shared across implementations. When Paul Dix decided to re-write the InfluxDB engine he found the Apache Arrow ecosystem ready and waiting with useful building blocks to accelerate the process. In this episode he explains how he used the combination of Apache Arrow, Flight, Datafusion, and Parquet to lay the foundation of the newest version of his time-series database.
byData Engineering Podcast
0 ratings
0% found this document useful
Building Linked Data Products With JSON-LD: A significant amount of time in data engineering is dedicated to building connections and semantic meaning around pieces of information. Linked data technologies provide a means of tightly coupling metadata with raw information. In this episode Brian Platz explains how JSON-LD can be used as a shared representation of linked data for building semantic data products.
Podcast episode
Building Linked Data Products With JSON-LD: A significant amount of time in data engineering is dedicated to building connections and semantic meaning around pieces of information. Linked data technologies provide a means of tightly coupling metadata with raw information. In this episode Brian Platz explains how JSON-LD can be used as a shared representation of linked data for building semantic data products.
byData Engineering Podcast
0 ratings
0% found this document useful
Addressing The Challenges Of Component Integration In Data Platform Architectures: Building a data platform that is enjoyable and accessible for all of its end users is a substantial challenge. One of the core complexities that needs to be addressed is the fractal set of integrations that need to be managed across the individual components. In this episode Tobias Macey shares his thoughts on the challenges that he is facing as he prepares to build the next set of architectural layers for his data platform to enable a larger audience to start accessing the data being managed by his team.
$Addressing The Challenges Of Component Integration In Data Platform Architectures: Building a data platform that is enjoyable and accessible for all of its end users is a substantial challenge. One of the core complexities that needs to be addressed is the fractal set of integrations that need to be managed across the individual components. In this episode Tobias Macey shares his thoughts on the challenges that he is facing as he prepares to build the next set of architectural layers for his data platform to enable a larger audience to start accessing the data being managed by his team.$
$Addressing The Challenges Of Component Integration In Data Platform Architectures: Building a data platform that is enjoyable and accessible for all of its end users is a substantial challenge. One of the core complexities that needs to be addressed is the fractal set of integrations that need to be managed across the individual components. In this episode Tobias Macey shares his thoughts on the challenges that he is facing as he prepares to build the next set of architectural layers for his data platform to enable a larger audience to start accessing the data being managed by his team.$
Podcast episode
Addressing The Challenges Of Component Integration In Data Platform Architectures: Building a data platform that is enjoyable and accessible for all of its end users is a substantial challenge. One of the core complexities that needs to be addressed is the fractal set of integrations that need to be managed across the individual components. In this episode Tobias Macey shares his thoughts on the challenges that he is facing as he prepares to build the next set of architectural layers for his data platform to enable a larger audience to start accessing the data being managed by his team.
byData Engineering Podcast
0 ratings
0% found this document useful
Doing DevRel on Easy Mode with Matty Stratton: Corey’s good friend Matt “Matty” Stratton, now a Staff Developer Advocate at Pulumi, is back for another round of “Screaming!” Now, with a job title that sits at the top of a “very strange career trajectory.” With beginnings at Chef, to IMB, and now Pulum
Podcast episode
Doing DevRel on Easy Mode with Matty Stratton: Corey’s good friend Matt “Matty” Stratton, now a Staff Developer Advocate at Pulumi, is back for another round of “Screaming!” Now, with a job title that sits at the top of a “very strange career trajectory.” With beginnings at Chef, to IMB, and now Pulum
byScreaming in the Cloud
0 ratings
0% found this document useful
How to Build a Website — The Show For Beginners: In this episode of Syntax, Scott and Wes talk about the basics of building a website — how to get started for beginners! Freshbooks - Sponsor Get a 30 day free trial of Freshbooks at and put SYNTAX in the “How did you hear about us?”...
Podcast episode
How to Build a Website — The Show For Beginners: In this episode of Syntax, Scott and Wes talk about the basics of building a website — how to get started for beginners! Freshbooks - Sponsor Get a 30 day free trial of Freshbooks at and put SYNTAX in the “How did you hear about us?”...
bySyntax - Tasty Web Development Treats
0 ratings
0% found this document useful
Eliminate The Overhead In Your Data Integration With The Open Source dlt Library: Cloud data warehouses and the introduction of the ELT paradigm has led to the creation of multiple options for flexible data integration, with a roughly equal distribution of commercial and open source options. The challenge is that most of those options are complex to operate and exist in their own silo. The dlt project was created to eliminate overhead and bring data integration into your full control as a library component of your overall data system. In this episode Adrian Brudaru explains how it works, the benefits that it provides over other data integration solutions, and how you can start building pipelines today.
Podcast episode
Eliminate The Overhead In Your Data Integration With The Open Source dlt Library: Cloud data warehouses and the introduction of the ELT paradigm has led to the creation of multiple options for flexible data integration, with a roughly equal distribution of commercial and open source options. The challenge is that most of those options are complex to operate and exist in their own silo. The dlt project was created to eliminate overhead and bring data integration into your full control as a library component of your overall data system. In this episode Adrian Brudaru explains how it works, the benefits that it provides over other data integration solutions, and how you can start building pipelines today.
byData Engineering Podcast
0 ratings
0% found this document useful
#08 - Tech stack: Metabase, Superset, Redash, Grafana
Podcast episode
#08 - Tech stack: Metabase, Superset, Redash, Grafana
byTOPP - The Open Podcast Podcast
0 ratings
0% found this document useful
Episode 464 - Azure Deployment Environments: Cale and Russell talk to the Microsoft Program Manager for DevBox and Azure Deployment Environments, Sagar Chandra Reddy Lankala, about how Azure Deployment Environments can enable rapid deployment of on-demand dev/test environments while providing governance, security and cost management - plus some more updates from Microsoft Build 2023! Media File: https://azpodcast.blob.core.windows.net/episodes/Episode464.mp3 Sagar's links: GA blog - https://aka.ms/ade-ga-blog Sign up for Terraform support - https://aka.ms/ade-terraform-signup LinkedIn profile - https://www.linkedin.com/in/sagarchandrareddy Other updates mentioned in this episode: Public preview: Introducing NGads V620-series VMs optimized for cloud gaming | Azure updates | Microsoft Azure Generally available: Azure Data Explorer Kusto Emulator on Linux | Azure updates | Microsoft Azure Explore the latest features for Datadog—An Azure Native ISV Service Microsoft Cost Management updates
Podcast episode
Episode 464 - Azure Deployment Environments: Cale and Russell talk to the Microsoft Program Manager for DevBox and Azure Deployment Environments, Sagar Chandra Reddy Lankala, about how Azure Deployment Environments can enable rapid deployment of on-demand dev/test environments while providing governance, security and cost management - plus some more updates from Microsoft Build 2023! Media File: https://azpodcast.blob.core.windows.net/episodes/Episode464.mp3 Sagar's links: GA blog - https://aka.ms/ade-ga-blog Sign up for Terraform support - https://aka.ms/ade-terraform-signup LinkedIn profile - https://www.linkedin.com/in/sagarchandrareddy Other updates mentioned in this episode: Public preview: Introducing NGads V620-series VMs optimized for cloud gaming | Azure updates | Microsoft Azure Generally available: Azure Data Explorer Kusto Emulator on Linux | Azure updates | Microsoft Azure Explore the latest features for Datadog—An Azure Native ISV Service Microsoft Cost Management updates
byThe Azure Podcast
0 ratings
0% found this document useful
Simple And Scalable Encryption Of Data In Use For Analytics And Machine Learning With Opaque Systems: Encryption and security are critical elements in data analytics and machine learning applications. We have well developed protocols and practices around data that is at rest and in motion, but security around data in use is still severely lacking. Recognizing this shortcoming and the capabilities that could be unlocked by a robust solution Rishabh Poddar helped to create Opaque Systems as an outgrowth of his PhD studies. In this episode he shares the work that he and his team have done to simplify integration of secure enclaves and trusted computing environments into analytical workflows and how you can start using it without re-engineering your existing systems.
Podcast episode
Simple And Scalable Encryption Of Data In Use For Analytics And Machine Learning With Opaque Systems: Encryption and security are critical elements in data analytics and machine learning applications. We have well developed protocols and practices around data that is at rest and in motion, but security around data in use is still severely lacking. Recognizing this shortcoming and the capabilities that could be unlocked by a robust solution Rishabh Poddar helped to create Opaque Systems as an outgrowth of his PhD studies. In this episode he shares the work that he and his team have done to simplify integration of secure enclaves and trusted computing environments into analytical workflows and how you can start using it without re-engineering your existing systems.
byData Engineering Podcast
0 ratings
0% found this document useful
Cloud Dataflow with Frances Perry: Cloud Dataflow and its OSS counterpart Apache Beam are amazing tools for Big Data so we asked Frances Perry, the Tech Lead and PMC for those projects, to join us and tell us more about it.
Podcast episode
Cloud Dataflow with Frances Perry: Cloud Dataflow and its OSS counterpart Apache Beam are amazing tools for Big Data so we asked Frances Perry, the Tech Lead and PMC for those projects, to join us and tell us more about it.
byGoogle Cloud Platform Podcast
0 ratings
0% found this document useful
That Datadog Will Hunt with Dann Berg: Dann Berg, Senior Cloud Analyst at Datadog, was an early guest on "Screaming" is back again! Now with the title “senior” attached to the front end of his job. Dann and Datadog are also steeped in the mires of AWS billing, so naturally he and Corey have a
Podcast episode
That Datadog Will Hunt with Dann Berg: Dann Berg, Senior Cloud Analyst at Datadog, was an early guest on "Screaming" is back again! Now with the title “senior” attached to the front end of his job. Dann and Datadog are also steeped in the mires of AWS billing, so naturally he and Corey have a
byScreaming in the Cloud
0 ratings
0% found this document useful
Powering Vector Search With Real Time And Incremental Vector Indexes: The rapid growth of machine learning, especially large language models, have led to a commensurate growth in the need to store and compare vectors. In this episode Louis Brandy discusses the applications for vector search capabilities both in and outside of AI, as well as the challenges of maintaining real-time indexes of vector data.
Podcast episode
Powering Vector Search With Real Time And Incremental Vector Indexes: The rapid growth of machine learning, especially large language models, have led to a commensurate growth in the need to store and compare vectors. In this episode Louis Brandy discusses the applications for vector search capabilities both in and outside of AI, as well as the challenges of maintaining real-time indexes of vector data.
byData Engineering Podcast
0 ratings
0% found this document useful
Composable Data Analytics
Podcast episode
Composable Data Analytics
byThe Cloudcast
0 ratings
0% found this document useful
Column by your name: The analytics database that skips the rows: On this sponsored episode of the podcast, we chat with Rohit (Ro) Amarnath, the CTO at Vertica, to find out how your analytics engine can speed up your workflow. After a humble beginning with a ZX Spectrum 128, he’s now in charge of Vertica Accelerator, a SaaS version of the Vertica database.
Podcast episode
Column by your name: The analytics database that skips the rows: On this sponsored episode of the podcast, we chat with Rohit (Ro) Amarnath, the CTO at Vertica, to find out how your analytics engine can speed up your workflow. After a humble beginning with a ZX Spectrum 128, he’s now in charge of Vertica Accelerator, a SaaS version of the Vertica database.
byThe Stack Overflow Podcast
0 ratings
0% found this document useful
Enhancing The Abilities Of Software Engineers With Generative AI At Tabnine: Software development involves an interesting balance of creativity and repetition of patterns. Generative AI has accelerated the ability of developer tools to provide useful suggestions that speed up the work of engineers. Tabnine is one of the main platforms offering an AI powered assistant for software engineers. In this episode Eran Yahav shares the journey that he has taken in building this product and the ways that it enhances the ability of humans to get their work done, and when the humans have to adapt to the tool.
Podcast episode
Enhancing The Abilities Of Software Engineers With Generative AI At Tabnine: Software development involves an interesting balance of creativity and repetition of patterns. Generative AI has accelerated the ability of developer tools to provide useful suggestions that speed up the work of engineers. Tabnine is one of the main platforms offering an AI powered assistant for software engineers. In this episode Eran Yahav shares the journey that he has taken in building this product and the ways that it enhances the ability of humans to get their work done, and when the humans have to adapt to the tool.
byData Engineering Podcast
0 ratings
0% found this document useful
#302: March 2019 Update Show Part One: Simon is joined by new update co-host Nicki Klein (Senior Technical Evangelist, AWS) to discuss all
Podcast episode
#302: March 2019 Update Show Part One: Simon is joined by new update co-host Nicki Klein (Senior Technical Evangelist, AWS) to discuss all
byAWS Podcast
100%
100% found this document useful
#608: Generative AI Roundup - August 2023: Simon takes you on a tour of your GenAI options. From software development, to AI policy, to trialli
Podcast episode
#608: Generative AI Roundup - August 2023: Simon takes you on a tour of your GenAI options. From software development, to AI policy, to trialli
byAWS Podcast
0 ratings
0% found this document useful
Version Your Data Lakehouse Like Your Software With Nessie: Data lakehouse architectures are gaining popularity due to the flexibility and cost effectiveness that they offer. The link that bridges the gap between data lake and warehouse capabilities is the catalog. The primary purpose of the catalog is to inform the query engine of what data exists and where, but the Nessie project aims to go beyond that simple utility. In this episode Alex Merced explains how the branching and merging functionality in Nessie allows you to use the same versioning semantics for your data lakehouse that you are used to from Git.
Podcast episode
Version Your Data Lakehouse Like Your Software With Nessie: Data lakehouse architectures are gaining popularity due to the flexibility and cost effectiveness that they offer. The link that bridges the gap between data lake and warehouse capabilities is the catalog. The primary purpose of the catalog is to inform the query engine of what data exists and where, but the Nessie project aims to go beyond that simple utility. In this episode Alex Merced explains how the branching and merging functionality in Nessie allows you to use the same versioning semantics for your data lakehouse that you are used to from Git.
byData Engineering Podcast
0 ratings
0% found this document useful
Using Product Driven Development To Improve The Productivity And Effectiveness Of Your Data Teams: With all of the messaging about treating data as a product it is becoming difficult to know what that even means. Vishal Singh is the head of products at Starburst which means that he has to spend all of his time thinking and talking about the details of product thinking and its application to data. In this episode he shares his thoughts on the strategic and tactical elements of moving your work as a data professional from being task-oriented to being product-oriented and the long term improvements in your productivity that it provides.
Podcast episode
Using Product Driven Development To Improve The Productivity And Effectiveness Of Your Data Teams: With all of the messaging about treating data as a product it is becoming difficult to know what that even means. Vishal Singh is the head of products at Starburst which means that he has to spend all of his time thinking and talking about the details of product thinking and its application to data. In this episode he shares his thoughts on the strategic and tactical elements of moving your work as a data professional from being task-oriented to being product-oriented and the long term improvements in your productivity that it provides.
byData Engineering Podcast
0 ratings
0% found this document useful
Establish A Single Source Of Truth For Your Data Consumers With A Semantic Layer: Maintaining a single source of truth for your data is the biggest challenge in data engineering. Different roles and tasks in the business need their own ways to access and analyze the data in the organization. In order to enable this use case, while maintaining a single point of access, the semantic layer has evolved as a technological solution to the problem. In this episode Artyom Keydunov, creator of Cube, discusses the evolution and applications of the semantic layer as a component of your data platform, and how Cube provides speed and cost optimization for your data consumers.
Podcast episode
Establish A Single Source Of Truth For Your Data Consumers With A Semantic Layer: Maintaining a single source of truth for your data is the biggest challenge in data engineering. Different roles and tasks in the business need their own ways to access and analyze the data in the organization. In order to enable this use case, while maintaining a single point of access, the semantic layer has evolved as a technological solution to the problem. In this episode Artyom Keydunov, creator of Cube, discusses the evolution and applications of the semantic layer as a component of your data platform, and how Cube provides speed and cost optimization for your data consumers.
byData Engineering Podcast
0 ratings
0% found this document useful

Skip carousel

Manipulate Data Like A Pro With Pandas
Linux Format
Article
Manipulate Data Like A Pro With Pandas
Jul 27, 2021
7 min read
01 Giving Data Collectors—and Donors—a Real-Time Rush
Fast Company
Article
01 Giving Data Collectors—and Donors—a Real-Time Rush
Mar 20, 2017
7 min read
MapReduce: The ‘Big Data’ Idea Inside Your Android Phone
APC
Article
MapReduce: The ‘Big Data’ Idea Inside Your Android Phone
Dec 2, 2019
4 min read
Rolling The Database As A Service
Linux Format
Article
Rolling The Database As A Service
Aug 27, 2019
A couple of times during our conversation, Robin alluded to the fact that DataStax has now set its eyes on helping users eradicate some of the day-to-day operational complexity from their workflow. The DataStax Apache Cassandra as a Service is one of
2 min read
How To Build The Linux Format Server
Linux Format
Article
How To Build The Linux Format Server
Oct 19, 2021
10 min read
Rediscover Speed With The Redis Revolution
Linux Format
Article
Rediscover Speed With The Redis Revolution
Jul 25, 2023
Credit: https://redis.io Redis is an open-source, in-memory data structure store that has gained popularity R as a highly efficient caching and messaging system. It prioritises speed, efficiency and versatility, making it a top choice for various ap
8 min read
Business NAS appliances 2022
PC Pro Magazine
Article
Business NAS appliances 2022
Apr 10, 2022
4 min read
Business NAS appliances 2021
PC Pro Magazine
Article
Business NAS appliances 2021
May 13, 2021
4 min read
Enterprise Soaring Success
Linux Format
Article
Enterprise Soaring Success
Aug 27, 2019
7 min read
FLASK Web Frameworks
Linux Format
Article
FLASK Web Frameworks
Jun 4, 2019
The main focus of Python has always been to get you cracking on with your coding – the language was never made for web programming. However, this has just made it more interesting to extend the language for the web, or to create an interface to web-b
9 min read
Enterprise-grade Monitoring Made Easy
Linux Format
Article
Enterprise-grade Monitoring Made Easy
Mar 10, 2020
9 min read
The Network NAS appliances 2024
PC Pro Magazine
Article
The Network NAS appliances 2024
Apr 4, 2024
4 min read
Business NAS appliances 2023
PC Pro Magazine
Article
Business NAS appliances 2023
Apr 6, 2023
4 min read
AWS Vs Azure What’s The Difference?
PC Pro Magazine
Article
AWS Vs Azure What’s The Difference?
Sep 11, 2022
7 min read
The Big Cloud Question: How Can You Protect Your Assets On Someone Else’s Servers?
PC Pro Magazine
Article
The Big Cloud Question: How Can You Protect Your Assets On Someone Else’s Servers?
Aug 10, 2023
I write this as an old fool who remembers sitting in endless meetings and presentations back when the whole concept of the cloud was starting up. So I can tell you that, right from the beginning, while vendors were pitching the all-encompassing busin
6 min read
KeePassXC: The Friendlier Free Offline Password Manager
PCWorld
Article
KeePassXC: The Friendlier Free Offline Password Manager
Sep 5, 2023
7 min read
NEXTCLOUD Host Nextcloud 20 on a Raspberry Pi
Linux Format
Article
NEXTCLOUD Host Nextcloud 20 on a Raspberry Pi
Jan 12, 2021
8 min read
Building A Better File Server With The Pi
Linux Format
Article
Building A Better File Server With The Pi
Sep 21, 2021
Running your own cloud storage server saves money, allows you to expand storage as necessary, and can be done with a device as small as a Raspberry Pi. Our previous guide to setting up a Nextcloud server on the Raspberry Pi (LXF280) covered everythin
4 min read
AI As A Service
PC Pro Magazine
Article
AI As A Service
Jul 9, 2020
2 min read
DataStax The Real-time Data Company, Unveiled “Change Data Capture” (CDC) for Astra DB
Techfastly
Article
DataStax The Real-time Data Company, Unveiled “Change Data Capture” (CDC) for Astra DB
May 1, 2022
3 min read
Building A Better File Server With The Pi
APC
Article
Building A Better File Server With The Pi
Dec 27, 2021
4 min read
AWS vs Azure
Linux Format
Article
AWS vs Azure
Aug 22, 2023
9 min read
Lag Is Killing Games
Linux Format
Article
Lag Is Killing Games
Jan 11, 2022
8 min read
Accurate, Open Source IP-based Localisation
Linux Format
Article
Accurate, Open Source IP-based Localisation
Dec 14, 2021
8 min read
Asuswrt Merlin vs Netgear Nighthawk
Maximum PC
Article
Asuswrt Merlin vs Netgear Nighthawk
Nov 9, 2021
ROUTERS HAVE COME a long way in the past 20 years, each year brings better features, new designs, and bigger wireless ranges. But for the enthusiast, a good router stretches beyond just “how quick is it?” And quickly moves on to “how easy is it to co
4 min read
Jonathan Ellis INTERVIEW
Linux Format
Article
Jonathan Ellis INTERVIEW
Oct 22, 2019
6 min read
Build A Pi-powered Network Storage Device
Linux Format
Article
Build A Pi-powered Network Storage Device
Dec 14, 2021
10 min read
Doctor
Maximum PC
Article
Doctor
Oct 11, 2022
6 min read
Supercomputer On A Platter
Business Today
Article
Supercomputer On A Platter
Apr 1, 2022
CHENNAI-HEADQUARTERED automobile major TVS Motor Company uses high-performance computing (HPC) for running R&D simulations and testing the aero-dynamics of two-wheelers, which allows it to make the vehicles stable at speed and more efficient, cool en
7 min read
It’s A Virtual Serverworld
Linux Format
Article
It’s A Virtual Serverworld
Sep 21, 2021
9 min read

Related categories

Skip carousel

Reviews for SAS Viya

Rating: 0 out of 5 stars

0 ratings

0 ratings0 reviews

Book preview

SAS Viya - Kevin D. Smith

Chapter 1: Installing Python, SAS SWAT, and CAS

Installing Python

Installing SAS SWAT

Installing CAS

Making Your First Connection

Conclusion

There are three primary pieces of software that must be installed in order to use SAS Cloud Analytic Services (CAS) from Python:

● Python 2.7 if you use Python 2, or a minimum of Python 3.4 if you use Python 3

● the SAS SWAT Python package

● the CAS server

We cover the recommended ways to install each piece of software in this chapter.

Installing Python

The Python packages that are used to connect to CAS have a minimum requirement of Python 2.7. If you are using version 3 of Python, you need a minimum of Python 3.4. There are some significant differences between Python 2 and Python 3, which are only touched on in this book. We recommend that you conduct your own research about the two primary versions of Python and choose the version that is appropriate for your needs. If you are not familiar with Python or if you don’t have a version preference, we recommend that you use the most recent release of Python 3. If you have an installation of Python 2 that you are using for existing work, then you can continue to use it. The Python package that is used to connect to CAS is compatible with both Python 2 and Python 3.

If you plan to use Microsoft Windows as your client operating system, you might not have an existing Python installation. If you use the Linux operating system or the Macintosh operating system, you probably have a Python installation already. In either case, you might need to install some prerequisite packages. We recommend that you start with a Python distribution such as Anaconda from Continuum Analytics at www.continuum.io which contains all of the prerequisites.

The Anaconda Python distribution includes dozens of the most popular Python packages, which can be installed easily on Windows, Linux, and Macintosh platforms. It also enables you to install a complete Python installation at any location on your system, including your home directory, so that you don’t need administrator privileges. Even if you do have administrator privileges and you have an existing Python installation on the Linux or Macintosh platforms, installing Anaconda as a separate Python is a good idea in order to prevent any mishaps that might occur while installing packages in the existing Python installation.

After you have installed Python, the next step is to install the SWAT package.

Installing SAS SWAT

The SAS SWAT package is the Python package created by SAS which is used to connect to CAS. SWAT stands for SAS Scripting Wrapper for Analytics Transfer. It includes two interfaces to CAS: 1) natively compiled client for binary communication, and 2) a pure Python REST client for HTTP-based connections. Support for the different protocols varies based on the platform that is used. So, you’ll have to check the downloads on the GitHub project to find out what is available for your platform.

To install SWAT, you use the standard Python installation tool pip. On Linux and Macintosh, the pip command is in the bin directory of your Anaconda installation. On Windows, it is in the Scripts directory of the Anaconda distribution. The SWAT installers are located at GitHub in the python-swat project of the sassoftware account. The available releases are listed at the following link:

https://github.com/sassoftware/python-swat/releases

You can install SWAT directly from the download link using pip as follows.

pip install https://github.com/sassoftware/python-

swat/releases/download/vX.X.X/python-swat-X.X.X-platform.tar.gz

Where X.X.X is the version number, and platform is the platform that you are installing on. If your platform isn’t available, you can install using the source code URL on the releases page instead, but you are restricted to using the REST interface over HTTP or HTTPS. The source code release is pure Python, so it will run wherever Python and the prerequisite packages are supported.

Note that if you have both Python 2 and Python 3 installed on your system (or even multiple installations of a particular Python version), you need to be careful to run the pip command from the installation where SWAT is installed. In any case, the same SWAT package works for both Python 2 and Python 3.

After SWAT is installed, you should be able to run the following command in Python in order to load the SWAT package:

>>> import swat

With Anaconda, you can submit the preceding code in several ways. You can use the python command at the command line. However, if you are going to use the command line, we’d recommend that you at least use the ipython command, which is preferred for interactive use. You also have the option of using the Spyder IDE that comes bundled with Anaconda. The Spyder IDE is useful for debugging as well as for development and interactive use. You can also use the popular Jupyter notebook, which was previously known as the IPython notebook. Jupyter is most commonly used within a web browser. It can be launched with the jupyter notebook command at the command line, or you can launch it from the Anaconda Launcher application.

In this book, we primarily show plain text output using the IPython interpreter. However, all of the code from this book is also available in the form of Jupyter notebooks here,

https://github.com/sassoftware/sas-viya-the-python-perspective

Now that we have installed Python and SWAT, the last thing we need is a CAS server.

Installing CAS

The installation of CAS is beyond the scope of this book. Installation on your own server requires a CAS software license and system administrator privileges. You need to contact your system administrator about installing, configuring, and running CAS.

Making Your First Connection

With all of the pieces in place, we can make a test connection just to verify that everything is working. From Python, you should be able to run the following commands:

>>> import swat

>>> conn = swat.CAS('server-name.mycompany.com', port-number,

'userid', 'password')

>>> conn.serverstatus()

>>> conn.close()

Where server-name.mycompany.com is the name or IP address of your CAS server, port-number is the port number that CAS is listening to, userid is your CAS user ID, and password is your CAS password. The serverstatus method should return information about the CAS grid that you are connected to, and the close method closes the connection. If the commands run successfully, then you are ready to move on. If not, you’ll have to do some troubleshooting before you continue.

Conclusion

At this point, you should have Python and the SWAT package installed, and you should have a running CAS server. In the next chapter, we’ll give a brief summary of what it’s like to use CAS from Python. Then, we’ll dig into the chapters that go into the details of each aspect of SWAT.

Chapter 2: The Ten-Minute Guide to Using CAS from Python

Importing SWAT and Getting Connected

Running CAS Actions

Loading Data

Executing Actions on CAS Tables

Data Visualization

Closing the Connection

Conclusion

If you are already familiar with Python, have a running CAS server, and just can’t wait to get started, we’ve written this chapter just for you. This chapter is a very quick summary of what you can do with CAS from Python. We don’t provide a lot of explanation of the examples; that comes in the later chapters. This chapter is here for those who want to dive in and work through the details in the rest of the book as needed.

In all of the sample code in this chapter, we are using the IPython interface to Python.

Importing SWAT and Getting Connected

The only thing you need to know about the CAS server in order to get connected is the host name, the port number, your user name, and your password. The SWAT package contains the CAS class that is used to communicate with the server. The arguments to the CAS class are hostname, port, username, and password1, in that order. Note that you can use the REST interface by specifying the HTTP port that is used by the CAS server. The CAS class can autodetect the port type for the standard CAS port and HTTP. However, if you use HTTPS, you must specify protocol=’https’ as a keyword argument to the CAS constructor. You can also specify ‘cas’ or ‘http’ to explicitly override autodetection.

In [1]: import swat

In [2]: conn = swat.CAS('server-name.mycompany.com', 5570,

...: 'username', 'password')

When you connect to CAS, it creates a session on the server. By default, all resources (CAS actions, data tables, options, and so on) are available only to that session. Some resources can be promoted to a global scope, which we discuss later in the book.

To see what CAS actions are available, use the help method on the CAS connection object, which calls the help action on the CAS server.

In [3]: out = conn.help()

NOTE: Available Action Sets and Actions:

NOTE: accessControl

NOTE: assumeRole - Assumes a role

NOTE: dropRole - Relinquishes a role

NOTE: showRolesIn - Shows the currently active role

NOTE: showRolesAllowed - Shows the roles that a user

is a member of

NOTE: isInRole - Shows whether a role is assumed

NOTE: isAuthorized - Shows whether access is authorized

NOTE: isAuthorizedActions - Shows whether access is

authorized to actions

NOTE: isAuthorizedTables - Shows whether access is authorized

to tables

NOTE: isAuthorizedColumns - Shows whether access is authorized

to columns

NOTE: listAllPrincipals - Lists all principals that have

explicit access controls

NOTE: whatIsEffective - Lists effective access and

explanations (Origins)

NOTE: partition - Partitions a table

NOTE: recordCount - Shows the number of rows in a Cloud

Analytic Services table

NOTE: loadDataSource - Loads one or more data source interfaces

NOTE: update - Updates rows in a table

The printed notes describe all of the CAS action sets and the actions in those action sets. The help action also returns the action set and action information as a return value. The return values from all actions are in the form of CASResults objects, which are a subclass of the Python collections.OrderedDict class. To see a list of all of the keys, use the keys method just as you would with any Python dictionary. In this case, the keys correspond to the names of the CAS action sets.

In [4]: list(out.keys())

Out[4]:

['accessControl',

'builtins',

'configuration',

'dataPreprocess',

'dataStep',

'percentile',

'search',

'session',

'sessionProp',

'simple',

'table']

Printing the contents of the return value shows all of the top-level keys as sections. In the case of the help action, the information about each action set is returned in a table in each section. These tables are stored in the dictionary as Pandas DataFrames.

In [5]: out

Out[5]:

[accessControl]

name description

0 assumeRole Assumes a role

1 dropRole Relinquishes a role

2 showRolesIn Shows the currently active role

3 showRolesAllowed Shows the roles that a user is a mem...

4 isInRole Shows whether a role is assumed

5 isAuthorized Shows whether access is authorized

6 isAuthorizedActions Shows whether access is authorized t...

7 isAuthorizedTables Shows whether access is authorized t...

8 isAuthorizedColumns Shows whether access is authorized t...

9 listAllPrincipals Lists all principals that have expli...

10 whatIsEffective Lists effective access and explanati...

11 listAcsData Lists access controls for caslibs, t...

12 listAcsActionSet Lists access controls for an action ...

13 repAllAcsCaslib Replaces all access controls for a c...

14 repAllAcsTable Replaces all access controls for a t...

15 repAllAcsColumn Replaces all access controls for a c...

16 repAllAcsActionSet Replaces all access controls for an ...

17 repAllAcsAction Replaces all access controls for an ...

18 updSomeAcsCaslib Adds, deletes, and modifies some acc...

19 updSomeAcsTable Adds, deletes, and modifies some acc...

... truncated ...

+ Elapsed: 0.0034s, user: 0.003s, mem: 0.164mb

Since the output is based on the dictionary object, you can access each key individually as well.

In [6]: out['builtins']

Out[6]:

name description

0 addNode Adds a machine to the server

1 removeNode Remove one or more machines from the...

2 help Shows the parameters for an action o...

3 listNodes Shows the host names used by the server

4 loadActionSet Loads an action set for use in this ...

5 installActionSet Loads an action set in new sessions ...

6 log Shows and modifies logging levels

7 queryActionSet Shows whether an action set is loaded

8 queryName Checks whether a name is an action o...

9 reflect Shows detailed parameter information...

10 serverStatus Shows the status of the server

11 about Shows the status of the server

12 shutdown Shuts down the server

13 userInfo Shows the user information for your ...

14 actionSetInfo Shows the build information from loa...

15 history Shows the actions that were run in t...

16 casCommon Provides parameters that are common ...

17 ping Sends a single request to the server...

18 echo Prints the supplied parameters to th...

19 modifyQueue Modifies the action response queue s...

20 getLicenseInfo Shows the license information for a ...

21 refreshLicense Refresh SAS license information from...

22 httpAddress Shows the HTTP address for the serve...

The keys are commonly alphanumeric, so the CASResults object was extended to enable you to access keys as attributes as well. This just keeps your code a bit cleaner. However, you should be aware that if a result key has the same name as a Python dictionary method, the dictionary method takes precedence. In the following code, we access the builtins key again, but this time we access it as if it were an attribute.

In [7]: out.builtins

Out[7]:

name description

0 addNode Adds a machine to the server

1 removeNode Remove one or more machines from the...

2 help Shows the parameters for an action o...

3 listNodes Shows the host names used by the server

4 loadActionSet Loads an action set for use in this ...

5 installActionSet Loads an action set in new sessions ...

6 log Shows and modifies logging levels

7 queryActionSet Shows whether an action set is loaded

8 queryName Checks whether a name is an action o...

9 reflect Shows detailed parameter information...

10 serverStatus Shows the status of the server

11 about Shows the status of the server

12 shutdown Shuts down the server

13 userInfo Shows the user information for your ...

14 actionSetInfo Shows the build information from loa...

15 history Shows the actions that were run in t...

16 casCommon Provides parameters that are common ...

17 ping Sends a single request to the server...

18 echo Prints the supplied parameters to th...

19 modifyQueue Modifies the action response queue s...

20 getLicenseInfo Shows the license information for a ...

21 refreshLicense Refresh SAS license information from...

22 httpAddress Shows the HTTP address for the serve...

Running CAS Actions

Just like the help action, all of the action sets and actions are available as attributes and methods on the CAS connection object. For example, the userinfo action is called as follows.

In [8]: conn.userinfo()

Out[8]:

[userInfo]

{'anonymous': False,

'groups': ['users'],

'hostAccount': True,

'providedName': 'username',

'providerName': 'Active Directory',

'uniqueId': 'username',

'userId': 'username'}

+ Elapsed: 0.000291s, mem: 0.0826mb

The result this time is a CASResults object, the contents of which is a dictionary under a single key (userInfo) that contains information about your user account. Although all actions return a CASResults object, there are no strict rules about what keys and values are in that object. The returned values are determined by the action and vary depending on the type of information returned. Analytic actions typically return one or more DataFrames. If you aren’t using IPython to format your results automatically, you can cast the result to a dictionary and then print it using pprint for a nicer representation.

In [9]: from pprint import pprint

In [10]: pprint(dict(conn.userinfo()))

{'userInfo': {'anonymous': False,

'groups': ['users'],

'hostAccount': True,

'providedName': 'username',

'providerName': 'Active Directory',

'uniqueId': 'username',

'userId': 'username'}}

When calling the help and userinfo actions, we actually used a shortcut. In some cases, you might need to specify the fully qualified name of the action, which includes the action set name. This can happen if two action sets have an action of the same name, or an action name collides with an existing method or attribute name on the CAS object. The userinfo action is contained in the builtins action set. To call it using the fully qualified name, you use builtins.userinfo rather than userinfo on the CAS object. The builtins level in this call corresponds to a CASActionSet object that contains all of the actions in the builtins action set.

In [11]: conn.builtins.userinfo()

The preceding code provides you with the same result as the previous example does.

Loading Data

The easiest way to load data into a CAS server is by using the upload method on the CAS connection object. This method uses a file path or URL that points to a file in various possible formats including CSV, Excel, and SAS data sets. You can also pass a Pandas DataFrame object to the upload method in order to upload the data from that DataFrame to a CAS table. We use the classic Iris data set in the following data loading example.

In [12]: out = conn.upload('https://raw.githubusercontent.com/' +

....: 'pydata/pandas/master/pandas/tests/' +

....: 'data/iris.csv')

In [13]: out

Out[13]:

[caslib]

'CASUSER(username)'

[tableName]

'IRIS'

[casTable]

CASTable('IRIS', caslib='CASUSER(username)')

+ Elapsed: 0.0629s, user: 0.037s, sys: 0.021s, mem: 48.4mb

The output from the upload method is, again, a CASResults object. The output contains the name of the created table, the CASLib that the table was created in, and a CASTable object that can be used to interact with the table on the server. CASTable objects have all of the same CAS action set and action methods of the connection that created it. They also include many of the methods that are defined by Pandas DataFrames so that you can operate on them as if they were local DataFrames. However, until you explicitly fetch the data or call a method that returns data from the table (such as head or tail), all operations are simply combined on the client side (essentially creating a client-side view) until data is actually retrieved from the server.

We can use actions such as tableinfo and columninfo to access general information about the table itself and its columns.

# Store CASTable object in its own variable.

In [14]: iris = out.casTable

# Call the tableinfo action on the CASTable object.

In [15]: iris.tableinfo()

Out[15]:

[TableInfo]

Name Rows Columns Encoding CreateTimeFormatted \

0 IRIS 150 5 utf-8 01Nov2016:16:38:59

ModTimeFormatted JavaCharSet CreateTime ModTime \

0 01Nov2016:16:38:59 UTF8 1.793638e+09 1.793638e+09

Global Repeated View SourceName SourceCaslib Compressed \

0 0 0 0 0

Creator Modifier

0 username

+ Elapsed: 0.000856s, mem: 0.104mb

# Call the columninfo action on the CASTable.

In [16]: iris.columninfo()

Out[16]:

[ColumnInfo]

Column ID Type RawLength FormattedLength NFL NFD

0 SepalLength 1 double 8 12 0 0

1 SepalWidth 2 double 8 12 0 0

2 PetalLength 3 double 8 12 0 0

3 PetalWidth 4 double 8 12 0 0

4 Name 5 varchar 15 15 0 0

+ Elapsed: 0.000727s, mem: 0.175mb

Now that we have some data, let’s run some more interesting CAS actions on it.

Executing Actions on CAS Tables

The simple action set that comes with CAS contains some basic analytic actions. You can use either the help action or the IPython ? operator to view the available actions.

In [17]: conn.simple?

Type: Simple

String form:

File: swat/cas/actions.py

Definition: conn.simple(self, *args, **kwargs)

Docstring:

Analytics

Actions

-------

simple.correlation : Generates a matrix of Pearson product-moment

correlation coefficients

simple.crosstab : Performs one-way or two-way tabulations

simple.distinct : Computes the distinct number of values of the

variables in the variable list

simple.freq : Generates a frequency distribution for one or

more variables

simple.groupby : Builds BY groups in terms of the variable value

combinations given the variables in the variable

list

simple.mdsummary : Calculates multidimensional summaries of numeric

variables

simple.numrows : Shows the number of rows in a Cloud Analytic

Services table

simple.paracoord : Generates a parallel coordinates plot of the

variables in the variable list

simple.regression : Performs a linear regression up to 3rd-order

polynomials

simple.summary : Generates descriptive statistics of numeric

variables such as the sample mean, sample

variance, sample size, sum of squares, and so on

simple.topk : Returns the top-K and bottom-K distinct values of

each variable included in the variable list based

on a user-specified ranking order

Let’s run the summary action on our CAS table.

In [18]: summ = iris.summary()

In [19]: summ

Out[19]:

[Summary]

Descriptive Statistics for IRIS

Column Min Max N NMiss Mean Sum Std \

0 SepalLength 4.3 7.9 150.0 0.0 5.843333 876.5 0.828066

1 SepalWidth 2.0 4.4 150.0 0.0 3.054000 458.1 0.433594

2 PetalLength 1.0 6.9 150.0 0.0 3.758667 563.8 1.764420

3 PetalWidth 0.1 2.5 150.0 0.0 1.198667 179.8 0.763161

StdErr Var USS CSS CV TValue \

0 0.067611 0.685694 5223.85 102.168333 14.171126 86.425375

1 0.035403 0.188004 1427.05 28.012600 14.197587 86.264297

2 0.144064 3.113179 2583.00 463.863733 46.942721 26.090198

3 0.062312 0.582414 302.30 86.779733 63.667470 19.236588

ProbT

0 3.331256e-129

1 4.374977e-129

2 1.994305e-57

3 3.209704e-42

+ Elapsed: 0.0256s, user: 0.019s, sys: 0.009s, mem: 1.74mb

The summary action displays summary statistics in a form that is familiar to SAS users. If you want them in a form similar to what Pandas users are used to, you can use the describe method (just like on DataFrames).

In [20]: iris.describe()

Out[20]:

SepalLength SepalWidth PetalLength PetalWidth

count 150.000000 150.000000 150.000000 150.000000

mean 5.843333 3.054000 3.758667 1.198667

std 0.828066 0.433594 1.764420 0.763161

min 4.300000 2.000000 1.000000 0.100000

25% 5.100000 2.800000 1.600000 0.300000

50% 5.800000 3.000000 4.350000 1.300000

75% 6.400000 3.300000 5.100000 1.800000

max 7.900000 4.400000 6.900000 2.500000

Note that when you call the describe method on a CASTable object, it calls various CAS actions in the background to do the calculations. This includes the summary, percentile, and topk actions. The output of those actions is combined into a DataFrame in the same form that the real Pandas DataFrame describe method returns. This enables you to use CASTable objects and DataFrame objects interchangeably in your workflow for this method and many other methods.

Data Visualization

Since the tables that come back from the CAS server are subclasses of Pandas DataFrames, you can do anything to them that works on DataFrames. You can plot the results of your actions using the plot method or use them as input to more advanced packages such as Matplotlib and Bokeh, which are covered in more detail in a later section.

The following example uses the plot method to download the entire data set and plot it using the default options.

In [21]: iris.plot()

Out[21]:

If the plot doesn’t show up automatically, you might have to tell Matplotlib to display it.

In [22]: import matplotlib.pyplot as plt

In [23]: plt.show()

The output that is created by the plot method follows.

Even if you loaded the same data set that we have used in this example, your plot might look different since CAS stores data in a distributed manner. Because of this, the ordering of data from the server is not deterministic unless you sort it when it is fetched. If you run the following commands, you plot the data sorted by SepalLength and SepalWidth.

In [24]: iris.sort_values(['SepalLength', 'SepalWidth']).plot()

Enjoying the preview?

Page 1 of 1

SAS Viya: The Python Perspective

About this ebook

Kevin D. Smith

Related authors

Related to SAS Viya

Related ebooks

Applications & Software For You

Related podcast episodes

Related articles

Related categories

Reviews for SAS Viya

What did you think?

Book preview

SAS Viya - Kevin D. Smith

Chapter 1: Installing Python, SAS SWAT, and CAS

Installing Python

Installing SAS SWAT

Installing CAS

Making Your First Connection

Conclusion

Chapter 2: The Ten-Minute Guide to Using CAS from Python

Importing SWAT and Getting Connected

Running CAS Actions

Loading Data

Executing Actions on CAS Tables

Data Visualization