Capitalizing Data Science: A Guide to Unlocking the Power of Data for Your Business and Products (English Edition)

Ebook505 pages4 hours

Capitalizing Data Science: A Guide to Unlocking the Power of Data for Your Business and Products (English Edition)

Name: Capitalizing Data Science: A Guide to Unlocking the Power of Data for Your Business and Products (English Edition)
Author: Mathangi Sri Ramachandran
ISBN: 9789355511591

By Mathangi Sri Ramachandran

Rating: 0 out of 5 stars

()

Read preview

About this ebook

Can you foresee how your company and its products will benefit from data science? How can the results of using AI and ML in business be tracked and questioned? Do questions like ‘how do you build a data science team?’ keep popping into your head?
All these strategic concerns and challenges are addressed in this book.

Firstly, the book explores the evolution of decision-making based on empirical evidence. The book then helps compare the data-supported era with the current data-led era. It also discusses how to successfully run a data science project, the lifecycle of a data science project, and what it looks like. The book dives fairly in-depth into various today's data-led applications, highlights example datasets, discusses obstacles, and explains machine learning models and algorithms intuitively.

This book covers structural and organizational considerations for making a data science team. The book helps recommend the use of optimal data science organization structure based on the company's level of development. Finally, the book explains data science's effects on businesses by assisting technological leaders.

Skip carousel

Computers

LanguageEnglish

PublisherBPB Online LLP

Release dateDec 3, 2022

ISBN9789355511591

Author

Mathangi Sri Ramachandran

Related authors

Skip carousel

Related to Capitalizing Data Science

Related ebooks

Skip carousel

Deep Learning for Data Architects: Unleash the power of Python's deep learning algorithms (English Edition)
Ebook
Deep Learning for Data Architects: Unleash the power of Python's deep learning algorithms (English Edition)
byShekhar Khandelwal
Rating: 0 out of 5 stars
0 ratings
Machine Learning for Finance
Ebook
Machine Learning for Finance
bySaurav Singla
Rating: 0 out of 5 stars
0 ratings
Artificial Intelligence: An Executive Guide to Make AI Work for Your Business
Ebook
Artificial Intelligence: An Executive Guide to Make AI Work for Your Business
byDavid E Sweenor
Rating: 0 out of 5 stars
0 ratings
Analyzing Blockchain in Healthcare: Applicability and Empirical Evidence of Blockchain Technology in Health Science (English Edition)
Ebook
Analyzing Blockchain in Healthcare: Applicability and Empirical Evidence of Blockchain Technology in Health Science (English Edition)
byAryan Chaudhary
Rating: 0 out of 5 stars
0 ratings
Internet of Things (IoT) A Quick Start Guide: A to Z of IoT Essentials
Ebook
Internet of Things (IoT) A Quick Start Guide: A to Z of IoT Essentials
byChitra Lele
Rating: 0 out of 5 stars
0 ratings
Applied Machine Learning Solutions with Python: Production-ready ML Projects Using Cutting-edge Libraries and Powerful Statistical Techniques (English Edition)
Ebook
Applied Machine Learning Solutions with Python: Production-ready ML Projects Using Cutting-edge Libraries and Powerful Statistical Techniques (English Edition)
bySiddhanta Bhatta
Rating: 0 out of 5 stars
0 ratings
AI & ML - Powering the Agents of Automation: Demystifying, IOT, Robots, ChatBots, RPA, Drones & Autonomous Cars- The new workforce led Digital Reinvention facilitated by AI & ML and secured through Blockchain
Ebook
AI & ML - Powering the Agents of Automation: Demystifying, IOT, Robots, ChatBots, RPA, Drones & Autonomous Cars- The new workforce led Digital Reinvention facilitated by AI & ML and secured through Blockchain
byVijay Cuddapah
Rating: 0 out of 5 stars
0 ratings
Big Data: Understanding How Data Powers Big Business
Ebook
Big Data: Understanding How Data Powers Big Business
byBill Schmarzo
Rating: 2 out of 5 stars
2/5
Business Analytics: Leveraging Data for Insights and Competitive Advantage
Ebook
Business Analytics: Leveraging Data for Insights and Competitive Advantage
byRonald BLaha
Rating: 0 out of 5 stars
0 ratings
Predictive Analytics and Machine Learning for Managers
Ebook
Predictive Analytics and Machine Learning for Managers
byJ. Alberto Espinosa
Rating: 0 out of 5 stars
0 ratings
Mastering Machine Learning: A Comprehensive Guide to Success
Ebook
Mastering Machine Learning: A Comprehensive Guide to Success
byRick Spair
Rating: 0 out of 5 stars
0 ratings
Operating AI: Bridging the Gap Between Technology and Business
Ebook
Operating AI: Bridging the Gap Between Technology and Business
byUlrika Jagare
Rating: 0 out of 5 stars
0 ratings
Beginning with Machine Learning: The Ultimate Introduction to Machine Learning, Deep Learning, Scikit-learn, and TensorFlow (English Edition)
Ebook
Beginning with Machine Learning: The Ultimate Introduction to Machine Learning, Deep Learning, Scikit-learn, and TensorFlow (English Edition)
byDr. Amit Dua
Rating: 0 out of 5 stars
0 ratings
Pragmatic Machine Learning with Python: Learn How to Deploy Machine Learning Models in Production
Ebook
Pragmatic Machine Learning with Python: Learn How to Deploy Machine Learning Models in Production
byAvishek Nag
Rating: 0 out of 5 stars
0 ratings
Data Science Fundamentals and Practical Approaches: Understand Why Data Science Is the Next
Ebook
Data Science Fundamentals and Practical Approaches: Understand Why Data Science Is the Next
byRupam Kumar Sharma
Rating: 0 out of 5 stars
0 ratings
Smarter Data Science: Succeeding with Enterprise-Grade Data and AI Projects
Ebook
Smarter Data Science: Succeeding with Enterprise-Grade Data and AI Projects
byNeal Fishman
Rating: 0 out of 5 stars
0 ratings
IoT Data Analytics using Python: Learn how to use Python to collect, analyze, and visualize IoT data (English Edition)
Ebook
IoT Data Analytics using Python: Learn how to use Python to collect, analyze, and visualize IoT data (English Edition)
byM S Hariharan
Rating: 0 out of 5 stars
0 ratings
Machine Learning: A Comprehensive, Step-by-Step Guide to Learning and Understanding Machine Learning Concepts, Technology and Principles for Beginners: 1
Ebook
Machine Learning: A Comprehensive, Step-by-Step Guide to Learning and Understanding Machine Learning Concepts, Technology and Principles for Beginners: 1
byPeter Bradley
Rating: 0 out of 5 stars
0 ratings
Stratégie: Business Intelligence & Analytics
Ebook
Stratégie: Business Intelligence & Analytics
byDr. Anupama Rajesh
Rating: 0 out of 5 stars
0 ratings
Risk Modeling: Practical Applications of Artificial Intelligence, Machine Learning, and Deep Learning
Ebook
Risk Modeling: Practical Applications of Artificial Intelligence, Machine Learning, and Deep Learning
byTerisa Roberts
Rating: 0 out of 5 stars
0 ratings
Artificial Intelligence meets Augmented Reality: Redefining Regular Reality
Ebook
Artificial Intelligence meets Augmented Reality: Redefining Regular Reality
byChitra Lele
Rating: 0 out of 5 stars
0 ratings
Data Driven: Solving the Biggest Problems in Startup Investing
Ebook
Data Driven: Solving the Biggest Problems in Startup Investing
byAmal Bhatnagar
Rating: 0 out of 5 stars
0 ratings
Machine Learning in Python: Essential Techniques for Predictive Analysis
Ebook
Machine Learning in Python: Essential Techniques for Predictive Analysis
byMichael Bowles
Rating: 4 out of 5 stars
4/5
The Book of Alternative Data: A Guide for Investors, Traders and Risk Managers
Ebook
The Book of Alternative Data: A Guide for Investors, Traders and Risk Managers
byAlexander Denev
Rating: 0 out of 5 stars
0 ratings
Big Data, Data Mining, and Machine Learning: Value Creation for Business Leaders and Practitioners
Ebook
Big Data, Data Mining, and Machine Learning: Value Creation for Business Leaders and Practitioners
byJared Dean
Rating: 3 out of 5 stars
3/5
Designing Machine Learning Systems with Python
Ebook
Designing Machine Learning Systems with Python
byDavid Julian
Rating: 0 out of 5 stars
0 ratings
Artificial Intelligence for Healthcare Applications and Management
Ebook
Artificial Intelligence for Healthcare Applications and Management
byBoris Galitsky
Rating: 0 out of 5 stars
0 ratings
The Value of Business Analytics: Identifying the Path to Profitability
Ebook
The Value of Business Analytics: Identifying the Path to Profitability
byEvan Stubbs
Rating: 0 out of 5 stars
0 ratings
Tech Trends in Practice: The 25 Technologies that are Driving the 4th Industrial Revolution
Ebook
Tech Trends in Practice: The 25 Technologies that are Driving the 4th Industrial Revolution
byBernard Marr
Rating: 0 out of 5 stars
0 ratings
Artificial Intelligence for Asset Management and Investment: A Strategic Perspective
Ebook
Artificial Intelligence for Asset Management and Investment: A Strategic Perspective
byAl Naqvi
Rating: 0 out of 5 stars
0 ratings

Computers For You

Skip carousel

SQL QuickStart Guide: The Simplified Beginner's Guide to Managing, Analyzing, and Manipulating Data With SQL
Ebook
SQL QuickStart Guide: The Simplified Beginner's Guide to Managing, Analyzing, and Manipulating Data With SQL
byWalter Shields
Rating: 4 out of 5 stars
4/5
Elon Musk
Ebook
Elon Musk
byWalter Isaacson
Rating: 4 out of 5 stars
4/5
The Invisible Rainbow: A History of Electricity and Life
Ebook
The Invisible Rainbow: A History of Electricity and Life
byArthur Firstenberg
Rating: 4 out of 5 stars
4/5
Slenderman: Online Obsession, Mental Illness, and the Violent Crime of Two Midwestern Girls
Ebook
Slenderman: Online Obsession, Mental Illness, and the Violent Crime of Two Midwestern Girls
byKathleen Hale
Rating: 4 out of 5 stars
4/5
Standard Deviations: Flawed Assumptions, Tortured Data, and Other Ways to Lie with Statistics
Ebook
Standard Deviations: Flawed Assumptions, Tortured Data, and Other Ways to Lie with Statistics
byGary Smith
Rating: 4 out of 5 stars
4/5
Mastering ChatGPT: 21 Prompts Templates for Effortless Writing
Ebook
Mastering ChatGPT: 21 Prompts Templates for Effortless Writing
byCea West
Rating: 5 out of 5 stars
5/5
Everybody Lies: Big Data, New Data, and What the Internet Can Tell Us About Who We Really Are
Ebook
Everybody Lies: Big Data, New Data, and What the Internet Can Tell Us About Who We Really Are
bySeth Stephens-Davidowitz
Rating: 4 out of 5 stars
4/5
101 Awesome Builds: Minecraft® Secrets from the World's Greatest Crafters
Ebook
101 Awesome Builds: Minecraft® Secrets from the World's Greatest Crafters
byTriumph Books
Rating: 4 out of 5 stars
4/5
Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees
Ebook
Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees
bySteven Cooper
Rating: 4 out of 5 stars
4/5
Python for Beginners. A Smarter Way to Learn Python in 5 Days and Remember it Longer. With Easy Step by Step Guidance and Hands on Examples. (Python Crash Course-Programming for Beginners)
Ebook
Python for Beginners. A Smarter Way to Learn Python in 5 Days and Remember it Longer. With Easy Step by Step Guidance and Hands on Examples. (Python Crash Course-Programming for Beginners)
byArthur T. Brooks
Rating: 0 out of 5 stars
0 ratings
The Simulation Hypothesis: An MIT Computer Scientist Shows Why AI, Quantum Physics and Eastern Mystics All Agree We Are In a Video Game
Ebook
The Simulation Hypothesis: An MIT Computer Scientist Shows Why AI, Quantum Physics and Eastern Mystics All Agree We Are In a Video Game
byRizwan Virk
Rating: 5 out of 5 stars
5/5
CompTIA IT Fundamentals (ITF+) Study Guide: Exam FC0-U61
Ebook
CompTIA IT Fundamentals (ITF+) Study Guide: Exam FC0-U61
byQuentin Docter
Rating: 0 out of 5 stars
0 ratings
Machine Learning for Beginners: An Introduction for Beginners, Why Machine Learning Matters Today and How Machine Learning Networks, Algorithms, Concepts and Neural Networks Really Work
Ebook
Machine Learning for Beginners: An Introduction for Beginners, Why Machine Learning Matters Today and How Machine Learning Networks, Algorithms, Concepts and Neural Networks Really Work
bySteven Cooper
Rating: 4 out of 5 stars
4/5
Alan Turing: The Enigma: The Book That Inspired the Film The Imitation Game - Updated Edition
Ebook
Alan Turing: The Enigma: The Book That Inspired the Film The Imitation Game - Updated Edition
byAndrew Hodges
Rating: 4 out of 5 stars
4/5
Procreate for Beginners: Introduction to Procreate for Drawing and Illustrating on the iPad
Ebook
Procreate for Beginners: Introduction to Procreate for Drawing and Illustrating on the iPad
byAaron Smith
Rating: 0 out of 5 stars
0 ratings
The Hacker Crackdown: Law and Disorder on the Electronic Frontier
Ebook
The Hacker Crackdown: Law and Disorder on the Electronic Frontier
byBruce Sterling
Rating: 4 out of 5 stars
4/5
Excel Essentials: A Step-by-Step Guide with Pictures for Absolute Beginners to Master the Basics and Start Using Excel with Confidence
Ebook
Excel Essentials: A Step-by-Step Guide with Pictures for Absolute Beginners to Master the Basics and Start Using Excel with Confidence
byNigel Tillery
Rating: 0 out of 5 stars
0 ratings
Dark Aeon: Transhumanism and the War Against Humanity
Ebook
Dark Aeon: Transhumanism and the War Against Humanity
byJoe Allen
Rating: 5 out of 5 stars
5/5
The ChatGPT Millionaire Handbook: Make Money Online With the Power of AI Technology
Ebook
The ChatGPT Millionaire Handbook: Make Money Online With the Power of AI Technology
byTJ Books
Rating: 0 out of 5 stars
0 ratings
How to Write a Book: An 11-Step Process to Build Habits, Stop Procrastinating, Fuel Self-Motivation, Quiet Your Inner Critic, Bust Through Writer's Block, & Let Your Creative Juices Flow (Short Read)
Ebook
How to Write a Book: An 11-Step Process to Build Habits, Stop Procrastinating, Fuel Self-Motivation, Quiet Your Inner Critic, Bust Through Writer's Block, & Let Your Creative Juices Flow (Short Read)
byDavid Kadavy
Rating: 5 out of 5 stars
5/5
Creating Online Courses with ChatGPT | A Step-by-Step Guide with Prompt Templates
Ebook
Creating Online Courses with ChatGPT | A Step-by-Step Guide with Prompt Templates
byCea West
Rating: 4 out of 5 stars
4/5
Childhood Unplugged: Practical Advice to Get Kids Off Screens and Find Balance
Ebook
Childhood Unplugged: Practical Advice to Get Kids Off Screens and Find Balance
byKatherine Johnson Martinko
Rating: 0 out of 5 stars
0 ratings
AP Computer Science Principles Premium, 2024: 6 Practice Tests + Comprehensive Review + Online Practice
Ebook
AP Computer Science Principles Premium, 2024: 6 Practice Tests + Comprehensive Review + Online Practice
bySeth Reichelson
Rating: 0 out of 5 stars
0 ratings
CompTIA Security+ Practice Questions
Ebook
CompTIA Security+ Practice Questions
byIP Specialist
Rating: 2 out of 5 stars
2/5
Grokking Algorithms: An illustrated guide for programmers and other curious people
Ebook
Grokking Algorithms: An illustrated guide for programmers and other curious people
byAditya Bhargava
Rating: 4 out of 5 stars
4/5
Going Text: Mastering the Command Line
Ebook
Going Text: Mastering the Command Line
byBrian Schell
Rating: 4 out of 5 stars
4/5
The Professional Voiceover Handbook: Voiceover training, #1
Ebook
The Professional Voiceover Handbook: Voiceover training, #1
byPeter Baker
Rating: 5 out of 5 stars
5/5
People Skills for Analytical Thinkers
Ebook
People Skills for Analytical Thinkers
byGilbert Eijkelenboom
Rating: 5 out of 5 stars
5/5
Remote/WebCam Notarization : Basic Understanding
Ebook
Remote/WebCam Notarization : Basic Understanding
byJeannie Eunice Franks
Rating: 3 out of 5 stars
3/5
How to Create Cpn Numbers the Right way: A Step by Step Guide to Creating cpn Numbers Legally
Ebook
How to Create Cpn Numbers the Right way: A Step by Step Guide to Creating cpn Numbers Legally
byAlex Parkinson
Rating: 4 out of 5 stars
4/5

Related podcast episodes

Skip carousel

#77 Acing the Data Science Interview
Podcast episode
#77 Acing the Data Science Interview
byDataFramed
0 ratings
0% found this document useful
Build Better Machine Learning Models With Confidence By Adding Validation With Deepchecks: A cross-over episode from The Machine Learning Podcast with the team from Deepchecks, exploring the challenges of testing and validating machine learning applications and their work to make it easier.
Podcast episode
Build Better Machine Learning Models With Confidence By Adding Validation With Deepchecks: A cross-over episode from The Machine Learning Podcast with the team from Deepchecks, exploring the challenges of testing and validating machine learning applications and their work to make it easier.
byThe Python Podcast.__init__
0 ratings
0% found this document useful
Unlocking The Power of Data Lineage In Your Platform with OpenLineage: An interview with Julien Le Dem about the OpenLineage specification and the opportunity that it offers for simplifying the tracking and analysis of data lineage across your data platform.
Podcast episode
Unlocking The Power of Data Lineage In Your Platform with OpenLineage: An interview with Julien Le Dem about the OpenLineage specification and the opportunity that it offers for simplifying the tracking and analysis of data lineage across your data platform.
byData Engineering Podcast
0 ratings
0% found this document useful
One Shot and Metric Learning - Quadruplet Loss (Machine Learning Dojo)
Podcast episode
One Shot and Metric Learning - Quadruplet Loss (Machine Learning Dojo)
byMachine Learning Street Talk (MLST)
0 ratings
0% found this document useful
#13 Fake News Detection with Data Science: <p>Fake news: how can data science and deep learning be leveraged to detect it? Come on a journey with Mike Tamir, Head of Data Science at Uber ATG, who is building out a data science product that classifies text as news, editorial, satire, hate speech...
Podcast episode
#13 Fake News Detection with Data Science: <p>Fake news: how can data science and deep learning be leveraged to detect it? Come on a journey with Mike Tamir, Head of Data Science at Uber ATG, who is building out a data science product that classifies text as news, editorial, satire, hate speech...
byDataFramed
100%
100% found this document useful
Building LLM-Based Applications with Azure OpenAI with Jay Emery - #657
Podcast episode
Building LLM-Based Applications with Azure OpenAI with Jay Emery - #657
byThe TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
0 ratings
0% found this document useful
Big Data: The money-making world of big data is discussed by Evan Davis and guests.
Podcast episode
Big Data: The money-making world of big data is discussed by Evan Davis and guests.
byThe Bottom Line
0 ratings
0% found this document useful
#78 How Data & Culture Unlock Digital Transformation
Podcast episode
#78 How Data & Culture Unlock Digital Transformation
byDataFramed
0 ratings
0% found this document useful
Show 212 - Jesse Anderson - Big Data: Today’s episode is an interview with Jesse Anderson, a preeminent expert who teaches software engineers how to become data scientists and data engineers. He has years under his belt teaching at Fortune 100 companies and startups alike. Jesse is a data...
Podcast episode
Show 212 - Jesse Anderson - Big Data: Today’s episode is an interview with Jesse Anderson, a preeminent expert who teaches software engineers how to become data scientists and data engineers. He has years under his belt teaching at Fortune 100 companies and startups alike. Jesse is a data...
byThe Ultimate Entrepreneur
0 ratings
0% found this document useful
#65 Preventing Fraud in eCommerce with Data Science
Podcast episode
#65 Preventing Fraud in eCommerce with Data Science
byDataFramed
0 ratings
0% found this document useful
This Week In Machine Learning & AI - 5/27/16: The White House on AI & Aggressive Self-Driving Cars: This Week in Machine Learning & AI brings you the…
Podcast episode
This Week In Machine Learning & AI - 5/27/16: The White House on AI & Aggressive Self-Driving Cars: This Week in Machine Learning & AI brings you the…
byThe TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
0 ratings
0% found this document useful
Delivering on the Chief Data Officer Imperatives: A Chief Data Officer (CDO) is expected to use data to continually improve internal operations and create a competitive advantage while aligning with partners, vendors, and customers. But complexities related to data quality, availability, visibility,...
Podcast episode
Delivering on the Chief Data Officer Imperatives: A Chief Data Officer (CDO) is expected to use data to continually improve internal operations and create a competitive advantage while aligning with partners, vendors, and customers. But complexities related to data quality, availability, visibility,...
byCIO Talk Network Podcast
0 ratings
0% found this document useful
An Agile Approach To Master Data Management with Mark Marinelli - Episode 46: Building A Master Data Catalog Using Machine Learning (Interview)
Podcast episode
An Agile Approach To Master Data Management with Mark Marinelli - Episode 46: Building A Master Data Catalog Using Machine Learning (Interview)
byData Engineering Podcast
100%
100% found this document useful
Reflections On Designing A Data Platform From Scratch: A monologue by Tobias Macey, the host of the show, about the design considerations involved in building a data platform and how the lessons learned from running the Data Engineering Podcast are influencing the choices made.
Podcast episode
Reflections On Designing A Data Platform From Scratch: A monologue by Tobias Macey, the host of the show, about the design considerations involved in building a data platform and how the lessons learned from running the Data Engineering Podcast are influencing the choices made.
byData Engineering Podcast
100%
100% found this document useful
Declarative Machine Learning For High Performance Deep Learning Models With Predibase
Podcast episode
Declarative Machine Learning For High Performance Deep Learning Models With Predibase
byThe Python Podcast.__init__
0 ratings
0% found this document useful
#42: Meta’s Segment Anything Model (SAM) for Computer Vision, ChatGPT’s Safety Problem, and the Limitations of ChatGPT Detectors
Podcast episode
#42: Meta’s Segment Anything Model (SAM) for Computer Vision, ChatGPT’s Safety Problem, and the Limitations of ChatGPT Detectors
byThe Artificial Intelligence Show
0 ratings
0% found this document useful
Learning Long-Time Dependencies with RNNs w/ Konstantin Rusch - #484: Today we conclude our 2021 ICLR coverage joined by Konstantin Rusch, a PhD Student at ETH Zurich. In our conversation with Konstantin, we explore his recent papers, titled coRNN and uniCORNN respectively, which focus on a novel architecture of...
Podcast episode
Learning Long-Time Dependencies with RNNs w/ Konstantin Rusch - #484: Today we conclude our 2021 ICLR coverage joined by Konstantin Rusch, a PhD Student at ETH Zurich. In our conversation with Konstantin, we explore his recent papers, titled coRNN and uniCORNN respectively, which focus on a novel architecture of...
byThe TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
0 ratings
0% found this document useful
Balancing long-term vision with near-term action with Vercel’s VP of Data: Alex Viana, VP of Data at Vercel, has had a truly unique career. Starting with a role at the Hubble Space Telescope, Alex found his way into the data space by way of data security and searching for leaked data assets. Today, he leads the data organization at Vercel, where he views building – teams, technology processes, and metrics – as his primary responsibility. In this episode Alex shares his thoughts on leading data teams at different (but fast-growing) tech companies, the importance of building scalable data platforms, delivering value through stakeholder engagement, and balancing long-term vision with short-term action as a key to success.
Podcast episode
Balancing long-term vision with near-term action with Vercel’s VP of Data: Alex Viana, VP of Data at Vercel, has had a truly unique career. Starting with a role at the Hubble Space Telescope, Alex found his way into the data space by way of data security and searching for leaked data assets. Today, he leads the data organization at Vercel, where he views building – teams, technology processes, and metrics – as his primary responsibility. In this episode Alex shares his thoughts on leading data teams at different (but fast-growing) tech companies, the importance of building scalable data platforms, delivering value through stakeholder engagement, and balancing long-term vision with short-term action as a key to success.
byThe Data Chief
0 ratings
0% found this document useful
Ali Ghodsi – The Past, Present, and Future of Big Data – [Founder’s Field Guide, EP.18]: My Guest today is Ali Ghodsi, founder and CEO of Databricks, a data analytics platform for data scientists and developers. He's also the founder of Apache Spark, the open-source project that Databricks is built on, and is an accomplished researcher at...
Podcast episode
Ali Ghodsi – The Past, Present, and Future of Big Data – [Founder’s Field Guide, EP.18]: My Guest today is Ali Ghodsi, founder and CEO of Databricks, a data analytics platform for data scientists and developers. He's also the founder of Apache Spark, the open-source project that Databricks is built on, and is an accomplished researcher at...
byInvest Like the Best with Patrick O'Shaughnessy
0 ratings
0% found this document useful
Why It's Exceedingly Difficult to Build and Adopt AI in Business: A lot of AI in the press is CMOs or marketing people talking about what a company can do in a way that really is aspirational. They're speaking about what they can do, but in reality, the things that they're talking about, the capabilities won't be...
Podcast episode
Why It's Exceedingly Difficult to Build and Adopt AI in Business: A lot of AI in the press is CMOs or marketing people talking about what a company can do in a way that really is aspirational. They're speaking about what they can do, but in reality, the things that they're talking about, the capabilities won't be...
byThe AI in Business Podcast
0 ratings
0% found this document useful
CFO lessons learned in planning and forecasting - with Dan Fletcher, CFO Planful
Podcast episode
CFO lessons learned in planning and forecasting - with Dan Fletcher, CFO Planful
byMetrics that Measure Up
0 ratings
0% found this document useful
A Multipurpose Database For Transactions And Analytics To Simplify Your Data Architecture With Singlestore: An interview with Shireesh Thota about how the Singlestore database engine allows you to reduce architectural sprawl in your data systems by combining performant and scalable transactional and analytical capabilities into a single platform
Podcast episode
A Multipurpose Database For Transactions And Analytics To Simplify Your Data Architecture With Singlestore: An interview with Shireesh Thota about how the Singlestore database engine allows you to reduce architectural sprawl in your data systems by combining performant and scalable transactional and analytical capabilities into a single platform
byData Engineering Podcast
0 ratings
0% found this document useful
Experimentation and A/B Testing For Modern Data Teams With Eppo: An interview with Eppo founder Chetan Sharma about the challenges of designing, running, and analyzing product experiments and the work that he is doing to make it more accessible to organizations of every size.
Podcast episode
Experimentation and A/B Testing For Modern Data Teams With Eppo: An interview with Eppo founder Chetan Sharma about the challenges of designing, running, and analyzing product experiments and the work that he is doing to make it more accessible to organizations of every size.
byData Engineering Podcast
0 ratings
0% found this document useful
Azure Databricks: I sat down with Ali Ghodsi, CEO and found of Databricks, and John Chirapurath, GM for Data Platform Marketing at Microsoft related to the recent announcement of Azure Databricks. When I heard about the announcement, my first thoughts were...
Podcast episode
Azure Databricks: I sat down with Ali Ghodsi, CEO and found of Databricks, and John Chirapurath, GM for Data Platform Marketing at Microsoft related to the recent announcement of Azure Databricks. When I heard about the announcement, my first thoughts were...
byData Skeptic
0 ratings
0% found this document useful
Reframing Data Strategy Alignment: Reframing Data Strategy Alignment
Podcast episode
Reframing Data Strategy Alignment: Reframing Data Strategy Alignment
byInsights Tomorrow
0 ratings
0% found this document useful
What is beyond PoCs? ML project-hurdles you should be prepared to take with Balázs Kégl - 016: Why do we do PoCs all the time and why do we struggle with Real projects? We are going to talk about ML project-hurdles with the head of AI at Huawei Paris, Balazs Kegl.
Podcast episode
What is beyond PoCs? ML project-hurdles you should be prepared to take with Balázs Kégl - 016: Why do we do PoCs all the time and why do we struggle with Real projects? We are going to talk about ML project-hurdles with the head of AI at Huawei Paris, Balazs Kegl.
byMachine Learning Cafe
0 ratings
0% found this document useful
Keeping Your Data Warehouse In Order With DataForm - Episode 102: An interview about Dataform and how it helps you to keep your data warehouse in good working order
Podcast episode
Keeping Your Data Warehouse In Order With DataForm - Episode 102: An interview about Dataform and how it helps you to keep your data warehouse in good working order
byData Engineering Podcast
0 ratings
0% found this document useful
What does the D2C Model means for Shopping Centres and Retailers?: While the D2C selling model is becoming popular among brands and product companies, there are several supply chain challenges and complexities involved for businesses in this space. Logistics and supply chain management can make or break a...
Podcast episode
What does the D2C Model means for Shopping Centres and Retailers?: While the D2C selling model is becoming popular among brands and product companies, there are several supply chain challenges and complexities involved for businesses in this space. Logistics and supply chain management can make or break a...
byVoice on Demand - Retail Podcast by MECS+R
0 ratings
0% found this document useful
Generative AI, cybercrime, and scamability, with Stacey Edmonds
Podcast episode
Generative AI, cybercrime, and scamability, with Stacey Edmonds
byLondon Futurists
100%
100% found this document useful
535. Insights: Rethinking AML: Sarah Kocianski is joined by some great guests to talk about anti-money laundering — or AML — practices in banking and financial services and how they can be rethought. We want to lift the lid on the impact of the pandemic and the increase in digital services on AML. To do that, we're going to look at the data sources that go into AML practices, why AML has a bad rep and what the future of AML could look like.
Podcast episode
535. Insights: Rethinking AML: Sarah Kocianski is joined by some great guests to talk about anti-money laundering — or AML — practices in banking and financial services and how they can be rethought. We want to lift the lid on the impact of the pandemic and the increase in digital services on AML. To do that, we're going to look at the data sources that go into AML practices, why AML has a bad rep and what the future of AML could look like.
byFintech Insider Podcast by 11:FS
0 ratings
0% found this document useful

Skip carousel

Top Five AI-ML Books For Business Leaders
Techfastly
Article
Top Five AI-ML Books For Business Leaders
Aug 2, 2021
5 min read
Embracing AI in Financial Services
Rotman Management
Article
Embracing AI in Financial Services
Jan 1, 2020
You are the Chief Science Officer at RBC and you also oversee its AI research institute. Describe the bank’s interest in this arena. There are many aspects to our interest in AI. First of all, financial services is a very data-driven business. From t
6 min read
AI Adoption In American Business: Who Is Doing What, And Where?
Rotman Management
Article
AI Adoption In American Business: Who Is Doing What, And Where?
Sep 1, 2023
10 min read
The Dawn of a Global Cryptocurrency
Rotman Management
Article
The Dawn of a Global Cryptocurrency
Jan 1, 2020
AS OF JUNE 18, 2019, Facebook’s closely guarded cryptocurrency project was no longer a secret. That’s the day the Creative Destruction Lab (CDL) announced that it would be joining Facebook and 26 other organizations as a founding partner of the Libra
7 min read
The Key Success Factors Of A Powerful AI Factory
The European Business Review
Article
The Key Success Factors Of A Powerful AI Factory
Jan 26, 2024
5 min read
How Google Is Making The AI That Powers Its Products Better.
HWM Singapore
Article
How Google Is Making The AI That Powers Its Products Better.
Jun 3, 2019
3 min read
Precision Medicine Is Crushing Once-Untreatable Cancers
Newsweek
Article
Precision Medicine Is Crushing Once-Untreatable Cancers
Jul 26, 2019
12 min read
Open For Business: Cashless & Cardless
Money Magazine
Article
Open For Business: Cashless & Cardless
Jan 29, 2020
3 min read
Cybersecurity May Be Beating Cyber Fear
The Christian Science Monitor
Article
Cybersecurity May Be Beating Cyber Fear
Apr 5, 2018
Despite the drumbeat of data breaches, such as Facebook’s, the good news is that companies and governments are putting security first, according to a new survey.
1 min read
When to Say No to Venture Capital
Entrepreneur
Article
When to Say No to Venture Capital
Dec 1, 2013
2 min read
ETL OR ELT: Build vs Buy
Techfastly
Article
ETL OR ELT: Build vs Buy
Apr 1, 2021
2 min read
DeFi A Growing Arm Behind NFT
Techfastly
Article
DeFi A Growing Arm Behind NFT
Nov 1, 2021
4 min read
Deep Learning
TechLife News
Article
Deep Learning
Dec 28, 2017
5 min read
AI: Problems and Promises
Beijing Review
Article
AI: Problems and Promises
May 11, 2023
Will artificial intelligence (AI) replace my job? This is a question that lingers in the minds of many these days. Similar concerns surfaced a couple of years ago when said technology narrowly defeated a champion of the game Go. At present, the appli
1 min read
How European Companies Can Use The Cloud To Increase Their Competitiveness
The European Business Review
Article
How European Companies Can Use The Cloud To Increase Their Competitiveness
Nov 25, 2021
5 min read
Artificial Empathy: The Last Step Of Humanizing Machines
Techfastly
Article
Artificial Empathy: The Last Step Of Humanizing Machines
Jul 1, 2021
1 min read
Machine Learning Makes A Cost-effective Environmental Watchdog
Futurity
Article
Machine Learning Makes A Cost-effective Environmental Watchdog
Oct 10, 2018
Machine learning could help safeguard public health and spot environmental dangers, according to new research. As Hurricane Florence ground its way through North Carolina, it released what might politely be called an excrement storm. Massive hog farm
3 min read
BUILDING THE SMARTER FUTURE OF BANKING & FINANCIAL SERVICES
The European Business Review
Article
BUILDING THE SMARTER FUTURE OF BANKING & FINANCIAL SERVICES
Nov 25, 2021
4 min read
Recognizing 'Value Patterns'
Rotman Management
Article
Recognizing 'Value Patterns'
May 1, 2018
7 min read
A New Generation of Bank Robbers Infiltrates Global Finance
TIME
Article
A New Generation of Bank Robbers Infiltrates Global Finance
Jun 4, 2016
2 min read
How This Startup is Making Mobile App Development Easier
Entrepreneur
Article
How This Startup is Making Mobile App Development Easier
Apr 1, 2016
1 min read
Ultra-Precision, Super-Speed, Zero-Error Inspection; Cognitive Visual Inspection in Manufacturing
Techfastly
Article
Ultra-Precision, Super-Speed, Zero-Error Inspection; Cognitive Visual Inspection in Manufacturing
Dec 1, 2021
5 min read
The Role Of Big-Data In Healthcare Sector
Techfastly
Article
The Role Of Big-Data In Healthcare Sector
Aug 2, 2021
5 min read
Look to REITs for Retirement Income as Inflation Heats Up
Kiplinger
Article
Look to REITs for Retirement Income as Inflation Heats Up
Aug 27, 2018
Bond yields remain low, inflation is ticking up, and stock valuations look lofty. That all sounds like bad news for investors--but those holding real estate investment trusts may stand to benefit. REITs are companies that own, operate or finance prop
4 min read
After Years of Challenges, Foursquare Has Found its Purpose -- and Profits
Entrepreneur
Article
After Years of Challenges, Foursquare Has Found its Purpose -- and Profits
Apr 1, 2017
8 min read
Why Is ELT Better For Cloud Data Warehousing?
Techfastly
Article
Why Is ELT Better For Cloud Data Warehousing?
Apr 1, 2021
2 min read
When Big Data Runs Into a Little Reality
Inc.
Article
When Big Data Runs Into a Little Reality
Feb 1, 2018
IN 2013, WHEN MY CO-FOUNDER AND I started Iodine, we—like pretty much any startup—suffered from certain delusions of grandeur. Our slick slide deck touted our esteemed pedigrees, our unfair advantages, and our uniquely brilliant business idea. We wer
2 min read
The Big Idea Behind Big Data
NPR
Article
The Big Idea Behind Big Data
Nov 17, 2017
As we find our way in a world shaped by Big Data, it's not the reams of information we gather but the networks they illuminate that's the newest addition to science's index of things, says Adam Frank.
6 min read
HOW SOME COMPANIES BEAT THE COMPETITION . . . For Decades And Even Centuries
The European Business Review
Article
HOW SOME COMPANIES BEAT THE COMPETITION . . . For Decades And Even Centuries
Aug 2, 2019
5 min read
Editor’s Note
Techfastly
Article
Editor’s Note
Aug 2, 2021
Dear Readers We are delighted, once again, to bring you the August edition of Techfastly. We have focused on healthcare technologies for this edition. The advancement of healthcare technologies is helping the healthcare industry in various important
1 min read

Related categories

Skip carousel

Reviews for Capitalizing Data Science

Rating: 0 out of 5 stars

0 ratings

0 ratings0 reviews

Book preview

Capitalizing Data Science - Mathangi Sri Ramachandran

CHAPTER 1

Data-Driven Decisions from Beginning to Now

Introduction

Data and Data Science is ubiquitous. From driving cars to finding out the nearest restaurants, data science is the backbone of technology today. In order to appreciate the current state better, we need to turn to the pages of history. This chapter traces the genesis of data science and its evolution by highlighting some of the key applications over a period of time. Broadly, there are two phases in the history of data-driven decisioning in organizations—one in which data plays a supporting role and another in which data plays the central and pivotal role. In this chapter, we will discuss the use cases in both phases and also what led to the current boom in the adoption of data science across different organizations. Toward the end of the chapter, we discuss the current challenges that could be addressed to further improve the impact that AI and Data Science can together create

Data-driven decisions and their phases

If I were to ask you whether you would like to have "coffee with sugar or without—your response would be a function of your preference, the numerous articles that you would have read on the impact of sugar, your exercise routine that week, existing health conditions and so forth. All these data are processed in your brain before you provide an answer. Human beings seem, by nature, to be data-driven. Data for making decisions have been prevalent for centuries now. It is not a recent concept. For example- there is an early mention of data in ancient India—in both the Rig Veda and in Arthashastra" wherein it referred to the governance with the help of data ( https://www.drishtiias.com/to-the-points/Paper2/census-in-india )

However, what we have witnessed in the recent past has been an increase in the intensity and penetration of data in decision-making processes for commercial purposes. We could possibly trace the history of using data for day-to-day decisioning in a company called Manchester Guardian Society in 1826. This company used to publish the creditworthiness of customers as a newsletter every week. Banks could then use these newsletters to make decisions for their customers. This company later on, became "Experian. Experian" is one of the pioneers of credit bureau companies in the world. Credit bureaus provide financial risk information about consumers to financial institutions. As we can see, banks have been the forerunners in using data for decisions and have been making large-scale decisions for more than a century now. Once banks started using data for decisioning soon, other industries followed suit.

We can divide data-driven decisioning in organizations into two distinct phases—"Human led and Data Supported decisions", the other one being Data led and Human guided decisions. (For ease of reading, let us refer to data-driven decisioning as D3 from henceforth). From the 1950s till about the early 2000s, the former type of decisioning was the most prevalent industry, and in the last 10+ years, we see the prevalence of the latter.

Human-led and data-supported decisions

In the first phase of D3, data was restricted to a support function. Data was used for testing hypotheses and to validate human decisions. Analysis was more guided by domain expertise and understanding of business processes. This was also a phase of instrumenting and storing large-scale data. There was advancement in "Business Intelligence and tools that support dashboarding and reporting. Toward the beginning of 2000, there was a push toward deriving actionable results from this large storage of data. However, data remained a back-office function and did not come into mainstream decision-making. This was an era of statistical analysis". Organizations were hiring statisticians who could help them with experimenting and analyzing data. Some of the key industries that benefited from the use of statistics would be banking, health care—especially clinical trials, manufacturing, and retail. Let us discuss some of the use cases in detail here:

Risk scores in banking: Predictive models like "regression, CHAID, and CART were used to understand the risk profile of the customers. Given the transaction history of the user and the data from the credit bureau, the problem is to predict the default probability of the user. Statisticians built decision trees that could predict the risk score of a user, which were then used to approve or decline the loans. Decision trees is a popular tool that tries to classify the target/dependent variable (in this case, default probability) by using the relationship of independent variables (such as age, income, and so on) on the dependent variable. An example could be to explain the default rate with age and income ranges. A decision tree is created with default rate as the dependent variable and age, and income" being the independent variables. In this case, shown in figure 1.1, the default probability is higher in the lower age bucket than in the higher age bucket. After the split on the age bucket, the default probability is explained by income buckets. Please refer to the following figure:

Figure 1.1: Decision tree for default rates

If the bank wants to reduce its default rate to 2%, then it needs to target users with Age > 30 and "Medium and High" income ranges. This tree could be arrived at using algorithms that find the best split at each node, or this could also be built using what a risk manager thinks is the right split to go with. As the default rate gets better, the set of users who are eligible will also decrease. The risk manager arrives at a risk and user coverage trade-off to come up with a suitable set of rules. These trees are very easy to explain and to make a decision. Such techniques were widely used to take decisions on credit approvals in the banking industry.

Clinical trials in health care: Before a new drug is launched in the market, it is tested on a set of patients to determine whether it will act on the underlying condition and produce the desired result. Patients are divided into two groups—"test and control. The test set of patients is administered the medicine in question, and the control set of patients are administered a placebo drug. Placebo is an interesting idea where you give a drug with no therapeutic value to patients. The results are then compared between these two groups to know if the drug is producing a significant difference in the test" group.

Interestingly, the idea of "placebo" as a control group treatment originated as early as in 1800. When a ship captain administered different treatments to his crew to cure them of a disease that they caught on-board. Formal experimentations in trials started taking a definitive shape in the 1940s.

The 1962 drug amendment act (https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5299804/) in the US made it mandatory for all drugs to show "substantial evidence of drug safety using clinical trials. This gave rise to a new field called Biostatistics". Biostatistics is the application of quantitative methods in the field of biology. Some common areas of applications are clinical trials, genomics, epidemiology, and so on.

Survival Analysis, another area that developed as part of biostatistics, is used in the medical to predict the mortality rate of subjects under treatment. Given a set of factors and historic data, the statistical tool can provide "duration of survival". This technique then also started to get used in fields other than medicine—for instance, to predict how long users will take to unsubscribe in a subscription-based business-like Telco.

Design of experiments in manufacturing: Design of experiments (DOE) is a branch of statistics that helps in setting up and studying unbiased experiments. Manufacturing (where this field primarily evolved) has a lot of process parameters that need to be tuned to get the best outcome. Process parameters could be the setting of temperate, pressure, or any other variable that controls the quality of the output. DOE helps in designing such experiments and studying the impact of input parameters on the output. Let us say that process A is dependent on Pressure and Temperature. Each of these factors says to operate at two settings—T1, T2 for temperature and P1, P2 for pressure. We want to understand the quality of the output based on these settings and choose the best setting. This could be measured in terms of acceptance rates of the product after a quality check. The design of the experiment table would look like as shown in Table 1.1:

Table 1.1: Design of experiments

The experiment trials are randomized and repeated for replication and to get the errors in the experimentations down. Once the results are measured, we analyze the results to understand the best set of temperature and pressure. This may sound trivial when the factors (for example—temperature and pressure) and their levels (T1, T2, P1, and P2) are limited. This table could become very large as we increase the number of factors and their levels. In such scenarios, there are experimental designs called "fractional factorial that helps us to test a subset of combinations. This, in turn, reduces the overall cost. DOE is extensively used in industrial processes and has now started being used in consumer studies as well. Some e-commerce" organizations use DOE to design and test—Banner Ads.

Providing actionable insights: One of the key use cases of data in phase1 of D3 was to provide actionable insights to businesses. Examples would be to provide answers for questions like "Why is the sales lower this month, Estimate of intended product sales, Why are customers attriting, Are customers liking our Ads on TV, and so on. The responses to these questions would involve generating a set of hypotheses and validating if they are statistically accurate. Such questions are critical to many businesses, and these continue to be solved in today’s businesses as well. In the earlier era, insights were produced by generating a set of hypotheses and validating each of those. For example, to the question—Why are customers attriting. There would be a list of hypotheses generated between the analyst/ statistician and the domain expert. The domain expert could be a business manager, marketing manager, or customer support head. These hypotheses would involve typically two variables—in this case, the attrition rate of the customer and another, the variable in question (For example—increase in price, tenure of the customer, % customer complaints, and so on.). Hypotheses are investigated one at a time to conclude which could be causing customer attrition. Such techniques are nowadays driven by multivariate machine learning models", which help us to investigate the impact of many different variables in a single mathematical formulation. However, understanding the cause of an event is not straightforward. Correlations do not imply causation, and this is an important philosophy to keep in mind while mining data for causal impacts. One way to understand causation is to simulate the causal effect (based on correlations) through experiments than by using inferential methods on observational data. In our example, to understand the reasons for attrition, the company could decrease/increase the price (keeping other variables constant) or other such correlated variables and study if it impacts attrition rates.

Preceding are the use cases that played a vital role during Phase 1 of D3. As you can see, the use cases are more static and focused on providing insights to decision-makers. Human intuition and domain knowledge played a greater role in shaping the data strategy of organizations. Decision-makers consumed these insights and acted on those as deemed. This phase is, hence, "insight driven rather than impact-driven. It was difficult to quantify the dollars impacted by these insights or analysis. Organizations could not pinpoint the value the analytics teams were driving. Any analytical project that only provides insights suffers from possibly two states—Either the provided insights are very intuitive and hence already known to the decision-makers, or they are too unexpected, and hence, difficult to believe and act upon. Also, generating new" insights on a periodic basis is also not possible if the trends do not change significantly. Hence, data could not add incremental value.

Data-led and human-guided

This phase marks the beginning of the widespread use of "Data Science.. The term Data Science can be traced to Peter Naur, who used the term freely in his work Concise Survey of Computer Methods in 1974 (https://www.forbes.com/sites/gilpress/2013/05/28/a-very-short-history-of-data-science/?sh=2df2f2b055cf"). Three things needed to happen for D3 to shift from human-led to data led—the immense growth in data powered accelerated by advancement in computing power, availability of machine learning algorithms and suitable use cases that prove the value of a data-first approach. Let us see each of these factors in detail:

Growth of data: The growth of the internet in the 1990s resulted in the tremendous growth of the volume of data collected. The chart from Michael Lesk’s article on the growth of data from 1995 to 1998 (http://www.lesk.com/mlesk/ksg97/ksg.html) in figure 1.2 illustrates the point of "data explosion". Please refer to the following figure:

Figure 1.2: Growth of data

We can also see the explosion of growth in data in the 2000s. The chart in figure 1.3 is from IDC’s study (https://www.seagate.com/files/www-content/our-story/trends/files/idc-seagate-dataage-whitepaper.pdf). Please refer to the following figure:

Figure 1.3: Growth of data from 2000

With this scale of growth in data, organizations started playing data at the forefront. The data growth was also accelerated by an increase in computing power. Cheaper computing resources enabled data ingestion, storage, and retrieval at scale. High volume and variety of data (text, audio, and so on) enabled predictive models to take better decisions that had a higher business impact.

Availability of machine learning algorithms: To churn this ocean of data, you need more efficient tools than a purely statistical approach. It is humanly impossible to make sense out of this data by analyzing one dimension at a time. When we have just 10 variables to predict, it is easy to come up with and guide the hypotheses. When there are 1000s of variables, we need sophisticated techniques. Hence, some of the "hypothesis led approaches of the earlier era got replaced by large-scale data mining techniques. Interestingly, machine learning algorithms were themselves around since the 1960s, and the data explosion" of the 1990s and early 2000s provided the right canvas and the right use cases. Machine learning is a stochastic process that learns from the errors of one iteration of prediction and improves the prediction of the next iteration based on the errors. A large number of instances or data points available works in its favor to increase prediction accuracy. Heuristic processes, in contrast, are static and cannot learn as dynamically as machine learning models do.

The need to solve the right use cases: The earlier adopters of using machine learning algorithms happened to be the "digital first organizations like Google, Amazon, Netflix, and so on. Take the example of the search and recommendations problem in an eCommerce company, and these are impossible to be solved by uni-dimensional or bi-dimensional analysis and hypothesis. The Netflix challenge (https://en.wikipedia.org/wiki/Netflix_Prize) of 2006 is an example of the initial use cases that needed to be solved by machine learning models. The dataset itself was fairly huge even by today’s standards—over 100 million ratings of 17,770 movies from 480,189 customers (https://www.thrillist.com/entertainment/nation/the-netflix-prize). Solving the Netflix challenge included applying plenty of machine learning methods. In a lot of these use cases, a 1% improvement in accuracy could improve the top-line by multiple millions of dollars. Once the dollar impact got established by the early adopters, it soon got picked up for traditional use cases as well, which were earlier solved using statistical techniques.

"Data Science" became a separate department in many organizations starting in the late 2000s. Data science teams started building real-world applications that impacted the top-line and bottom-line of various organizations. Today, every industry uses data science in various ways—right from credit decisioning of banks to recommending products in an eCommerce website to inventory management in manufacturing industries. The uptake of data science solutions across all industries is phenomenal.

Applications of data science

Let us see the applications of data science across the customer lifecycle for the eCommerce industry. The customer lifecycle is divided into seven stages (https://www.sciencedirect.com/science/article/pii/S2212567115000313?via%3Dihub) — Initiation, Acquisition, Regain, Maintenance, Expansion, Retention, and Exit. Following are some of the key use cases of an eCommerce industry in each of these phases. We have combined the maintenance and expansion phases into one. So, we have listed six phases here:

Initiation

This is a phase of seeking new users for the product. Digital ads optimization for new user acquisition, as well as ads budget optimization, could be examples of cases where data science comes very useful in the eCommerce industry. The focus is more on awareness about the product or the company.

Acquisition

In this stage, the user is targeted through various channels and offers. The targeted customers are on-boarded with the right messaging and value proposition. In a transaction platform like eCommerce, this phase extends till the first transaction. Some organizations consider customers as "new users" till the first 30 days from on-boarding or the first transaction. Some of the relevant data science use cases here could be as follows:

Optimizing targeting campaigns to provide a better return on investment.

Once the user onboards, understand the drivers for the early drop-offs. Early drop-offs are customers who drop off after on-boarding without making their first transaction.

Predicting early drop-offs using signals like a channel of acquisition, device details, temporal variables (time of day, day of the week, and so on), and early browsing behavior. At this stage, we have very less data about the user, and this poses some challenges for the machine learning algorithms used for such problems.

Understanding early indicators of loyal users. This understanding is super critical to retain the users at a later stage.

Maintenance and expansion

These two phases have a lot in common and are practically treated the same in eCommerce and similar industries. Hence, we will treat this as a single stage/phase of the customer lifecycle. This is an active customer management phase. This phase involves active servicing of the customers making sure the customer is active and transacting. Generally, the customers in this phase contribute to the profitability of the organization. Hence, growing the customers involves providing the right user experience to the users and shaping some of the behavior to purchase profitable products as well. Key use cases of data science here could be as follows:

Optimizing search and recommendations in eCommerce sites. This produces relevant results for the user and helps her to purchase faster. The search results today are powered by machine learning algorithms and are optimized based on user, transaction history, and query attributes rather than being restricted to the keyword the user searched.

Identifying delivery or logistics-related problems so that they can be solved to improve customer satisfaction. Higher satisfaction drives better repeat behavior from the user. Examples here include predicting the "Expected time of arrival" of the product based on product attributes and delivery location, assigning the right logistics partner depending on serviceable areas, penalizing or closing suppliers based on poor serviceability in the past, and so on.

Targeting existing customers with better offers so that they engage better and transact more.

Summarizing product reviews so that users can easily make a purchase decision.

Cross-sell and upsell using machine learning models.

Retention

Before customers decide to exit the product or service, companies try to retain the customers. Hence predicting the customers who would potentially attrite and intervene with the right mechanism are the key use cases that data science solves in this phase. If a company chooses to use discount offers to retain existing customers, one of the key things that machine learning models need to solve is to improve the efficiency of the campaigns. A campaign is said to be of high efficiency when the users targeted by the campaign purchase much more than when they are given the offer as compared to when they are not.

Exit

To grow and focus on profitable "good" customers involves removing or penalizing bad customers. Customers are not always right. We need to be able to stop the transactions of the wrong customers to provide a better experience to the right customers. In the eCommerce industry, for example, where fraudsters abuse the system and take undue advantage of returns and "cash on delivery" kind of options. Penalizing fraudsters helps provide the right user experience to the right users. Some of the use cases here are listed as follows:

Machine learning models focus on identifying fraudulent customers and fraudulent transactions.

Identifying the right penalty for the right set of customers

Identifying "communities" or groups of customers who are related to each other and do fraudulent transactions that benefit each other.

Identity fraud - Customers provide the wrong identity to make use of offers for "first time users".

Identifying and removing fake reviews so that genuine users are not impacted by the fake reviews of the product.

Regain

Regain is the stage when the existing customers

Enjoying the preview?

Page 1 of 1

Capitalizing Data Science: A Guide to Unlocking the Power of Data for Your Business and Products (English Edition)

About this ebook

Mathangi Sri Ramachandran

Related authors

Related to Capitalizing Data Science

Related ebooks

Computers For You

Related podcast episodes

Related articles

Related categories

Reviews for Capitalizing Data Science

What did you think?

Book preview

Capitalizing Data Science - Mathangi Sri Ramachandran

Introduction

Data-driven decisions and their phases

Human-led and data-supported decisions

Data-led and human-guided

Applications of data science

Initiation

Acquisition

Maintenance and expansion

Retention

Exit

Regain