cover of episode scikit-learn & data science you own

scikit-learn & data science you own

2024/11/19
logo of podcast Practical AI: Machine Learning, Data Science, LLM

Practical AI: Machine Learning, Data Science, LLM

AI Deep Dive AI Chapters Transcript
People
G
Guillaume Lemaitre
Y
Yann Lechelle
Topics
Yann Lechelle作为Probabl公司的CEO,介绍了公司的起源、使命以及对scikit-learn的未来规划。Probabl公司源于法国研究中心Inria,致力于构建包括scikit-learn在内的一系列开源数据科学工具。公司以维护开源数据科学为使命,并将其写入公司章程。Yann Lechelle强调,Probabl的目标是成为一家类似Red Hat的开源公司,并最终通过IPO实现可持续发展,从而更好地服务于全球数据科学家,并促进数据科学领域的多样化发展。 Guillaume Lemaitre作为Probabl的开源工程师,详细阐述了scikit-learn的技术特点、应用场景以及社区贡献方式。scikit-learn是一个机器学习库,专注于预测建模,相比深度学习,它更加简单、成本更低,并适用于多种数据类型。Guillaume Lemaitre还介绍了Probabl正在开发的其他库,例如Scrub和Scribes,旨在改进数据持久化、数据库集成、可视化和模型评估等方面。他鼓励开发者积极参与scikit-learn社区,并提供了具体的参与途径和指导。

Deep Dive

Chapters
Yann Lechelle explains the origins of Probabl, a company spun off from a French research center, and its mission to steward open-source technologies like scikit-learn.
  • Probabl is a spin-off from a French research center called INRIA.
  • The company's mission is to build a suite of open-source technologies for data science, with scikit-learn at its core.
  • Scikit-learn is used by nearly every data scientist globally and has been downloaded over 1.5 billion times.

Shownotes Transcript

We are at GenAI saturation, so let’s talk about scikit-learn, a long time favorite for data scientists building classifiers, time series analyzers, dimensionality reducers, and more! Scikit-learn is deployed across industry and driving a significant portion of the “AI” that is actually in production. :probabl is a new kind of company that is stewarding this project along with a variety of other open source projects. Yann Lechelle and Guillaume Lemaitre share some of the vision behind the company and talk about the future of scikit-learn!

Join the discussion)

Changelog++) members save 9 minutes on this episode because they made the ads disappear. Join today!

Sponsors:

  • Timescale) – Purpose-built performance for AI Build RAG, search, and AI agents on the cloud and with PostgreSQL and purpose-built extensions for AI: pgvector, pgvectorscale, and pgai.

  • WorkOS) – A platform that gives developers a set of building blocks for quickly adding enterprise-ready features to their application. Add Single Sign-On (Okta, Azure, Google, Microsoft OAuth), sync users from any SCIM directory, HRIS integration, audit trails (SIEM), free magic link sign-in. WorkOS is designed for developers and offers a single, elegant interface that abstracts dozens of enterprise integrations. Learn more and get started at WorkOS.com)

  • Shopify) – Sign up for a $1/month trial period at shopify.com/practicalai)

Featuring:

Show Notes:

Something missing or broken? PRs welcome!)