57 min listen
SE4ML - Software Engineering for Machine Learning - Nadia Nahar
FromDataTalks.Club
ratings:
Length:
54 minutes
Released:
Mar 24, 2023
Format:
Podcast episode
Description
We talked about:
Nadia’s background
Academic research in software engineering
Design patterns
Software engineering for ML systems
Problems that people in industry have with software engineering and ML
Communication issues and setting requirements
Artifact research in open source products
Product vs model
Nadia’s open source product dataset
Failure points in machine learning projects
Finding solutions to issues using Nadia’s dataset and experience
The problem of siloing data scientists and other structure issues
The importance of documentation and checklists
Responsible AI
How data scientists and software engineers can work in an Agile way
Links:
Model Card: https://arxiv.org/abs/1810.03993
Datasheets: https://arxiv.org/abs/1803.09010
Factsheets: https://arxiv.org/abs/1808.07261
Research Paper: https://www.cs.cmu.edu/~ckaestne/pdf/icse22_seai.pdf
Arxiv version: https://arxiv.org/pdf/2110.
Free data engineering course: https://github.com/DataTalksClub/data-engineering-zoomcamp
Join DataTalks.Club: https://datatalks.club/slack.html
Our events: https://datatalks.club/events.html
Nadia’s background
Academic research in software engineering
Design patterns
Software engineering for ML systems
Problems that people in industry have with software engineering and ML
Communication issues and setting requirements
Artifact research in open source products
Product vs model
Nadia’s open source product dataset
Failure points in machine learning projects
Finding solutions to issues using Nadia’s dataset and experience
The problem of siloing data scientists and other structure issues
The importance of documentation and checklists
Responsible AI
How data scientists and software engineers can work in an Agile way
Links:
Model Card: https://arxiv.org/abs/1810.03993
Datasheets: https://arxiv.org/abs/1803.09010
Factsheets: https://arxiv.org/abs/1808.07261
Research Paper: https://www.cs.cmu.edu/~ckaestne/pdf/icse22_seai.pdf
Arxiv version: https://arxiv.org/pdf/2110.
Free data engineering course: https://github.com/DataTalksClub/data-engineering-zoomcamp
Join DataTalks.Club: https://datatalks.club/slack.html
Our events: https://datatalks.club/events.html
Released:
Mar 24, 2023
Format:
Podcast episode
Titles in the series (100)
Effective Communication with Business for Data Professionals - Lior Barak by DataTalks.Club