About Data Umbrella
Data Umbrella is a non-profit global community for underrepresented persons in data science. We organize online speaker series on data science, open source libraries, and open source sprints (hackathons). We also curate resources for the community on data science and inclusive practices. All levels are welcome, beginners and experts. Our Code of Conduct applies to all of our spaces.
Announcements
Call for Speakers
We are looking for speakers on the following topics. If you or someone you know can speak on these topics, please email us: info@dataumbrella.org
Open Source Literacy (history, challenges, education or other related topics)
How to Debug in Python
Community Blogs
Check out our blogs on tutorials, scikit-learn reports and more.
March is Women’s History Month
We are having a special series of events, one every week (yay!) for the month of March. Watch our 1-minute video on what is coming up and Register on Meetup for the events. All these events are free and open to all.
Upcoming Events
Geospatial Data and Maps with Python
Maps are powerful tools that we use every day, and Python is well-equipped to handle geospatial data, perform geospatial analysis, and create insightful maps. In this talk, we'll go over some common Python mapping libraries, considerations when working with geospatial data, and use cases.
Creating a Python Plotly Dashboard
In this workshop we will use pandas, plotly and Dash to create a dashboard that explores changes in the average housing price in various provinces in Canada for the last 5 years. We will start by generating interactive visualizations using plotly and turn exploratory code into reusable functions. We will then work together to bring our functions into a script. Participants will be introduced into dashboarding, layout options, and will work together to generate and test a local dashboard.
Arrays, Linked Lists and Graphs
It is easy and convenient to treat all data as an array. They are the basis of much of Python and a simple data structure to deal with. But there are times that arrays fail us, such as on element insertion and the pre-allocation of memory. There is power and efficiency in linking data through data structures such as linked lists. We will see how using linked lists can reduce Big-O complexity and solve a variety of problems. Then we will explore how graph data structures take this a step further and open up a world of new options and opportunities for efficient computation.
Setting up a Personal Website with Jekyll & GitHub Pages
Jekyll is a static site generator that can be used to create a custom website simply, efficiently, and for free of charge. In this session, you will learn to set up a Jekyll-based website and blog, install a basic theme, add customization, and host it via GitHub Pages.
Introduction to Holoviz
Learn how to use HoloViz tools to visualize tabular and array data in Python.
After attending this, you should be able to:
Quickly inspect your data using `hvplot`.
Aggregate large data to the pixel level using `datashader`.
Produce dashboards that allow others to easily view your outputs using `panel`.
Recent Events: Subscribe to Data Umbrella YouTube
Oriol Abril Pla: Contributing to PyMC Documentation
The best software libraries of the world are severely restrictive and inaccessible if they are not properly documented, resulting in few or no users. PyMC has an extensive documentation corpus with examples of all kinds which has been built since its initial release back in 2015! In this webinar we’ll explain how to contribute to PyMC documentation to update it to our new infrastructure built during the last 8 months thanks to Outreachy and GSoD (Google Season of Docs). You'll learn how to use MyST (a superset of Markdown) to write technical documentation, using multiple types of cross-references, adding beautiful styling elements, figures with captions... all while using executable Jupyter notebooks!
Ricardo Vieira: Contributing to PyMC
In this talk, Ricardo: (a) Illustrates the Open Source collaborator experience; (b) Shares his biased views; (c) Invites you to give it a try. A viewer described this presentation as: "Indeed this talk is inspiring and authentic."
Reshama Shaikh: An Example Pull Request to PyMC
In this video, Reshama covers how to set up a virtual environment, install the dependencies, build PyMC from source, make a small documentation change, and submit a pull request.
Featured Resources
Video Playlists
Highlighted Resource
Data Umbrella Team
In this section, we share updates from our team.
Blog
Blog with step-by-step example: Example of `check_scalar` Function in a Contribution to scikit-learn.
Video
Supporting Data Umbrella
Data Umbrella is now on Benevity. If your company uses Benevity, which is a donation platform for employer-matching contributions to non-profits, please consider making a contribution to Data Umbrella. Note: this link is active for registered users of Benevity: Data Umbrella on Benevity
For users not on Benevity, donations can be made directly to the Open Collective.
Data Umbrella Resources
Visit our blog site: blog.dataumbrella.org, and see articles written by our community members on their experience in recent sprints.
We have a Job Board. You can: post jobs (for free), search jobs, subscribe to a weekly update to see postings.
Our Data Umbrella YouTube is growing! Subscribe to our channel to receive notifications of when our event videos are posted.
Accessibility Corner
Accessibility Update: Closed Captioning
Our webinars have closed captioning available! This feature makes our live events more accessible to those with hearing needs and for folks in general who like to see the transcript live during presentation to fully process information.
Connect with Us
dataumbrella.org (*resources*)
Meetup: Data Umbrella & Data Umbrella Africa (*upcoming events*)
YouTube (*past recorded talks*)
Twitter: @DataUmbrella