In the tradition of previous Africa Data Science workshops, a summer school on machine learning and data science was held prior to the main workshop. This summer school targeted graduate students, researchers, and professionals working with large amounts of data or unique datasets.
The summer school focused on introductory and advanced lectures in data science and machine learning, as well as moderate to advanced practical and tutorial sessions where participants got hands-on experience wrangling and processing datasets and applying cutting-edge machine learning techniques to derive insights from data. Lectures were delivered by distinguished, world-renowned researchers and practitioners, including experts from Sheffield University, Amazon, Swansea University Medical School, Facebook, Pulse Lab Kampala, the AI and Data Science (AIR) Lab–Makerere University, ARM, and Dedan Kimathi University of Technology (DeKUT).
The school also involved end-to-end tutorial sessions led by professionals who guided participants through real data analytics problems, from data acquisition to data presentation.
Lecture Schedule
Pre-workshop
Software Installation
To ensure participants were well-prepared, they were required to install the prerequisite software and test it beforehand to confirm it was working on their computers. Although the summer school venue provided some computers with the software preinstalled, participants were advised to bring their own laptops with the software ready.
Anaconda
All the required software had been prepackaged in a bundle called Anaconda. Participants downloaded the appropriate versions for their laptop operating systems and architectures from the Anaconda website, specifically the Python 3.6 version. Detailed installation instructions were provided alongside the download links on the website.
Preliminary Exercises
To ensure everything worked correctly and to get participants familiar with the tools, they were required to download a Jupyter notebook file (via “right click and save as”) and complete the exercises provided. To access the notebook, participants launched Jupyter Notebook following the provided instructions.
Troubleshooting and Comments
Participants used the comment section to ask questions that had not already been addressed and to help their peers by providing answers whenever possible.
Summer School Day 1
The first day of the summer school introduced participants to Jupyter Notebook and provided an overview of using Python for data analysis. The day also introduced the machine learning technique of classification and included lab practicals exploring these methods, along with an introduction to the fundamentals of the Internet of Things (IoT).
Time |
Activity |
Material |
08:00-08:30 |
Arrival and Registration |
|
08:30-09:00 |
Opening Remarks |
|
09:00-10:00 |
Introduction to Machine Learning |
|
10:00-10:30 |
Tea Break |
|
10:30-11:30 |
Python, Pandas and Jupyter Tutorial - with Practical Session |
|
11:30-12:30 |
Data Visualisation with Practical Session |
|
12:30-13:30 |
Lunch |
|
13:30-15:30 |
Fundamentals of IoT - with Practical Session |
|
15:30-16:00 |
Tea Break |
|
16:00-17:30 |
ML at the edge |
|
17:30-19:00 |
Labs |
Summer School Day 2
The second day featured two parallel tracks focusing on deep learning methods, mechanism design techniques, and the DisARM Project. Participants explored advanced neural network architectures, learned how mechanism design principles could be applied to real-world data problems, and examined the DisARM Project as a case study in the practical use of data science for societal impact.
Time |
Activity |
Material |
09:00-10:30 |
Introduction to Deep Learning - with Practical Session on Computer Vision |
|
10:30-11:00 |
Tea Break |
|
11:00-12:00 |
Image Representation and fine-grained recognition - with practical session |
|
12:00-13:00 |
Lunch Break |
|
13:00-14:00 |
Mechanism Design |
|
14:00-15:30 |
Natural Language Processing |
|
15:30-16:00 |
Tea Break |
|
16:00-17:00 |
Cyber Security |
|
17:00-18:30 |
Labs |
Summer School Day 3
Time |
Activity |
Material |
09:00-10:30 |
Introduction to Non parametric modelling with Gaussian Processes |
|
10:30-11:00 |
Tea Break |
|
11:00-12:30 |
Introduction to Reinforcement Learning |
|
11:00-12:00 |
Lunch Break |
|
12:00-13:30 |
Data for good presentation |
|
13:30-14:30 |
Electrical grid mapping tutorial |
|
14:30-15:00 |
Tea Break |
|
15:00-16:30 |
Research Clinic (Q&A for researchers), Feedback |