Skip to main content
SearchLoginLogin or Signup

Doing Data Science on the Shoulders of Giants: The Value of Open Source Software for the Data Science Community

Published onMay 31, 2020
Doing Data Science on the Shoulders of Giants: The Value of Open Source Software for the Data Science Community
·
history

You're viewing an older Release (#1) of this Pub.

  • This Release (#1) was created on Apr 30, 2020 ()
  • The latest Release (#6) was created on May 23, 2022 ().

Abstract

Open source software is ubiquitous throughout data science and enables the work of nearly every data scientist in some way or another. Open source projects, however, are disproportionately maintained by a small number of individuals, some of whom are institutionally supported but many of whom do this maintenance on a purely volunteer basis. The health of the data science ecosystem depends on the support of open source projects, on an individual and institutional level.

Keywords: open source software, data science community, software licenses, computing

Comments
1
Daniel S. Katz:

This article has a lot of good content, but I want to point out that it is missing the concept of Research Software Engineers (see https://society-rse.org and https://us-rse.org) as a job family and career path for non-faculty software developers in universities (and beyond).