2 minute read

During the week of June 19 to 23, 2023, the scikit-learn team held its first developers sprint since 2019! The sprint took place in Paris, France at the Dataiku office. The sprint event was an in-person event and had 32 participants.

The following scikit-learn team members joined the sprint:

  1. Adrin Jalali
  2. Arturo Amor Quiroz
  3. François Goupil (@francoisgoupil)
  4. Frank Charras (@fcharras)
  5. Gael Varoquaux (@GaelVaroquaux)
  6. Guillaume Lemaitre (@glemaitre)
  7. Jérémie du Boisberranger (@jeremiedbb)
  8. Joris Van den Bossche
  9. Julien Jerphanion (@jjerphan)
  10. Loïc Estève
  11. Maren Westermann
  12. Olivier Grisel (@ogrisel)
  13. Roman Yurchak
  14. Thomas Fan
  15. Tim Head (@betatim)

The following community members joined the sprint:

  1. Alexandre Landeau
  2. Alexandre Vigny
  3. Chaine San Buenaventura
  4. Camille Troillard
  5. Denis Engemann
  6. Franck Charras
  7. Harizo Rajaona
  8. Ines (intern at Dataiku)
  9. Jovan Stojanovic
  10. Leo Dreyfus-Schmidt
  11. Léo Grinsztajn
  12. Lilian Boulard
  13. Louis Fouquet
  14. Riccardo Cappuzzo
  15. Samuel Ronsin
  16. Vincent Maladière
  17. Yann Lechelle
group of people who participated in the sprint
scikit-learn Developer Sprint, Paris, June 2023; Photo credit: Copyright: Inria / Photo B. Fourrier, June 2023; (from left to right, back to front): Last Row: Denis Engemann, Riccardo Cappuzzo, François Goupil, Tim Head, Guillaume Lemaitre, Louis Fouquet, Jérémie du Boisberranger, Frank Charras, Léo Grinsztajn, Arturo Amor Quiroz. Middle Row: Thomas Fan, Lilian Boulard, Gaël Varoquaux, Ines, Jovan Stojanovic, Chaine San Buenaventura. First Row: Olivier Grisel, Harizo Rajaona, Vincent Maladière.

Sponsors

  • Dataiku provided the space and some of the food, as well as all of the coffee.
  • The scikit-learn consortium organized the sprint, paid for the lunch, the travel and accommodation expenses.

Topics covered at the sprint

  • PR #13649: Monotonic constraints for Tree-based models
  • Discussed the vision/future directions for the project. What is important to keep the project relevant in the future.
  • Should we share some points beyond the vision statement?
  • Thomas F will try and create a vision statement
  • Discussed what people are keeping an eye on with a two year time scale in mind in terms of technology and developments that are relevant.
  • Tim: keep improving our documentation (not just expanding it but also “gardening” to keep it readable)
  • Tim: increase active outreach and communication about new features/improvements and other changes. A lot of cool things in scikit-learn are virtually unknown to the wider public (e.g. Hist grad boosting being on par with lightgbm in terms of performance, …)

What is next?

We are discussing co-locating with OpenML in 2024 in Berlin, Germany to organize another developers’ sprint.

group of people who participated in the sprint
scikit-learn Developer Sprint, Paris, June 2023; Photo credit: Copyright Inria / Photo B. Fourrier, June 2023; (from left to right): Thomas Fan, Olivier Grisel