- Deep Learning Indaba

Grace, Njogu

Empowering Africa’s Health: AI-Driven Solutions for Universal Access Presentation

Poster presented at the Deep Learning Indaba 2025, Kigali, Rwanda, 01.08.2025, (Non-archival).

Mutembesa, Daniel

MSense: Dynamic Route Planning and Adaptive Incentive Mechanisms for Mobile Sensors Presentation

Poster presented at the Deep Learning Indaba 2025, Kigali, Rwanda, 01.08.2025, (Non-archival).

@misc{MutembesaDaniel2025,

title = {MSense: Dynamic Route Planning and Adaptive Incentive Mechanisms for Mobile Sensors},

author = {Daniel Mutembesa},

url = {https://storage.googleapis.com/download/storage/v1/b/indaba-2025-posters/o/Daniel_Mutembesa.pdf?generation=1755026600833055&alt=media},

year  = {2025},

date = {2025-08-01},

address = {Kigali, Rwanda},

abstract = {In dynamic route planning for mobile sensors used in air quality monitoring, incorporating 

heterogeneity among sensors is crucial for optimizing coverage, reducing costs, and enhancing data quality. 

Heterogeneity considers different sensor capabilities, priorities, roles, and requirements, making the model 

more realistic and effective. This paper presents an optimal dynamic route planning model designed for 

mobile sensors, incorporating diversity guarantees for data collection and accomodating sensor heterogeneity 

under resource-constrained conditions. Our approach introduces heterogeneity by accounting for varying 

capabilities, priorities, roles, and requirements of different mobile sensors. Specifically, we model the 

sensors’ different speeds, coverage radii, and battery lives, allowing for a comprehensive optimization 

of routes based on these heterogeneous characteristics. The algorithm also integrates priority weights to 

reflect the differing importance of nodes and areas for each sensor, and includes incentive structures to 

encourage efficient coverage. Additionally, we introduce constraints to ensure collocation requirements and 

diversity thresholds are met, promoting effective coordination and reducing redundancy among sensors. The 

objective function maximizes overall coverage and efficiency while minimizing travel time and overlap. 

This dynamic route planning model is crucial for applications where multiple mobile sensors operate under 

diverse conditions, providing a robust solution for optimizing sensor deployment and resource utilization. 

Simulation experiments on an urban road network graph with two mobile sensors deployed across various 

source-destination pairs revealed key differences between three routing models. Model 1, though straight- 

forward and efficient, failed to account for sensor path redundancies, resulting in less effective coverage. 

Model 2 addressed this by penalizing overlapping routes, improving path diversity at the cost of added 

complexity. Model 3, the most complex, considered both path diversity and sensor heterogeneity, offering 

the best performance with minimal costs across all scenarios. While Model 1 is suitable for small networks, 

Model 2 balances efficiency and complexity for moderate networks, and Model 3 is optimal for large-scale, 

critical applications. Future work should focus on incorporating real-time data to enhance adaptability in 

dynamic environments.},

howpublished = {Poster presented at the Deep Learning Indaba 2025},

note = {Non-archival},

keywords = {},

pubstate = {published},

tppubtype = {presentation}

}

Close

In dynamic route planning for mobile sensors used in air quality monitoring, incorporating
heterogeneity among sensors is crucial for optimizing coverage, reducing costs, and enhancing data quality.
Heterogeneity considers different sensor capabilities, priorities, roles, and requirements, making the model
more realistic and effective. This paper presents an optimal dynamic route planning model designed for
mobile sensors, incorporating diversity guarantees for data collection and accomodating sensor heterogeneity
under resource-constrained conditions. Our approach introduces heterogeneity by accounting for varying
capabilities, priorities, roles, and requirements of different mobile sensors. Specifically, we model the
sensors’ different speeds, coverage radii, and battery lives, allowing for a comprehensive optimization
of routes based on these heterogeneous characteristics. The algorithm also integrates priority weights to
reflect the differing importance of nodes and areas for each sensor, and includes incentive structures to
encourage efficient coverage. Additionally, we introduce constraints to ensure collocation requirements and
diversity thresholds are met, promoting effective coordination and reducing redundancy among sensors. The
objective function maximizes overall coverage and efficiency while minimizing travel time and overlap.
This dynamic route planning model is crucial for applications where multiple mobile sensors operate under
diverse conditions, providing a robust solution for optimizing sensor deployment and resource utilization.
Simulation experiments on an urban road network graph with two mobile sensors deployed across various
source-destination pairs revealed key differences between three routing models. Model 1, though straight-
forward and efficient, failed to account for sensor path redundancies, resulting in less effective coverage.
Model 2 addressed this by penalizing overlapping routes, improving path diversity at the cost of added
complexity. Model 3, the most complex, considered both path diversity and sensor heterogeneity, offering
the best performance with minimal costs across all scenarios. While Model 1 is suitable for small networks,
Model 2 balances efficiency and complexity for moderate networks, and Model 3 is optimal for large-scale,
critical applications. Future work should focus on incorporating real-time data to enhance adaptability in
dynamic environments.

Close

serrhini,

Enhancing XSS Detection with LLM-Generated Obfuscation and Graph Neural Networks Presentation

Poster presented at the Deep Learning Indaba 2025, Kigali, Rwanda, 01.08.2025, (Non-archival).

Abstract | Links | BibTeX

Mwaibale, Upendo

Enhancing Detection of Common Bean Diseases Via Fast Gradient Sign Method – Trained Vision Transformers Presentation

Poster presented at the Deep Learning Indaba 2025, Kigali, Rwanda, 01.08.2025, (Non-archival).

Abstract | Links | BibTeX

Aliyu, Mahi Aminu

Towards Robust Generalization in African AI: Causal Inference and Domain Shift Mitigation Presentation

Poster presented at the Deep Learning Indaba 2025, Kigali, Rwanda, 01.08.2025, (Non-archival).

Abstract | Links | BibTeX

BADRI, Nabil

A Multilingual and Multidialect Deep Learning Approach for Hate and Abusive Speech Classification Presentation

Poster presented at the Deep Learning Indaba 2025, Kigali, Rwanda, 01.08.2025, (Non-archival).

Abstract | Links | BibTeX

Abdelkader, Mahmoud

Multi-Objective Route Optimization Using Graph Neural Networks Presentation

Poster presented at the Deep Learning Indaba 2025, Kigali, Rwanda, 01.08.2025, (Non-archival).

Links | BibTeX

Naira, Abdou Mohamed

DVoice: Open-Source Voice AI for Africa and Beyond Presentation

Poster presented at the Deep Learning Indaba 2025, Kigali, Rwanda, 01.08.2025, (Non-archival).

Abstract | Links | BibTeX

Awak, Mbuotidem

EfikNLP: Parallel Corpora and Machine Translation System for Digital Inclusion Presentation

Poster presented at the Deep Learning Indaba 2025, Kigali, Rwanda, 01.08.2025, (Non-archival).

Abstract | Links | BibTeX

Nakiranda, Proscovia

Detection of Stationary Pollution Sources and Profiling Using Satellite Imagery and Machine Learning Presentation

Poster presented at the Deep Learning Indaba 2025, Kigali, Rwanda, 01.08.2025, (Non-archival).

Abstract | Links | BibTeX

Mohamed, Benayad

Generative AI for Urban Sustainability: Enhanced Basemaps from High-Resolution Satellite Imagery Presentation

Poster presented at the Deep Learning Indaba 2025, Kigali, Rwanda, 01.08.2025, (Non-archival).

Abstract | Links | BibTeX

@misc{MohamedBenayad2025,

title = {Generative AI for Urban Sustainability: Enhanced Basemaps from High-Resolution Satellite Imagery},

author = {Benayad Mohamed},

url = {https://docs.google.com/presentation/d/182r9Ch2G9f9dwHIn3nKV8hhkQYKMAy48/edit?usp=drive_link&ouid=112038764170136351774&rtpof=true&sd=true},

year  = {2025},

date = {2025-08-01},

address = {Kigali, Rwanda},

abstract = {The growing adoption of electric vehicles (EVs) demands the deployment of robust and well-planned charging infrastructure, particularly in emerging regions like Africa, where urban development is rapidly evolving, and spatial data is often underutilized. This study introduces a deep learning-based geospatial analysis framework to support the strategic planning of EV charging stations using high-resolution RGB satellite imagery and land cover/land use (LCLU) data. Our approach utilizes semantic segmentation models to extract detailed urban features from RGB imagery, enabling the identification of suitable charging station sites based on real-world land use characteristics. We constructed a land cover dataset comprising 13 urban classes: roads, buildings, vegetation, trees, bare land, solar PV, sidewalks, tracks, playgrounds, cars, grass, brown soil, and water. Multiple state-of-the-art deep learning models were evaluated, including SegFormer, UNet, PSPNet, and DeepLabV3. Among them, SegFormer achieved the highest performance with 97.3% accuracy, 0.938 F1-score, and 0.898 Intersection over Union (IoU), clearly outperforming the other models. The trained model was deployed on high-resolution satellite imagery of a Moroccan city to generate precise land cover maps. These maps were then analyzed to detect optimal locations for EV charging infrastructure, considering accessibility, available space, and urban activity patterns. The prototype serves as a proof of concept for a scalable, automated planning tool tailored to the African context. By combining remote sensing, land use analysis, and deep learning, this work provides a reproducible and adaptable method for improving EV infrastructure planning across African cities, contributing to smarter, data-driven urban development and more sustainable mobility transitions.},

howpublished = {Poster presented at the Deep Learning Indaba 2025},

note = {Non-archival},

keywords = {},

pubstate = {published},

tppubtype = {presentation}

}

Close

Mwaba, Natasha

Empowering communities in Zambia through innovation Presentation

Poster presented at the Deep Learning Indaba 2025, Kigali, Rwanda, 01.08.2025, (Non-archival).

Abstract | Links | BibTeX

Weya, Melissah

Building African Stereotypes Datasets For Responsible AI Evaluation Presentation

Poster presented at the Deep Learning Indaba 2025, Kigali, Rwanda, 01.08.2025, (Non-archival).

Abstract | Links | BibTeX

@misc{WeyaMelissah2025,

title = {Building African Stereotypes Datasets For Responsible AI Evaluation},

author = {Melissah Weya},

url = {https://storage.googleapis.com/download/storage/v1/b/indaba-2025-posters/o/Melissah_Weya.pdf?generation=1755026589852217&alt=media},

year  = {2025},

date = {2025-08-01},

address = {Kigali, Rwanda},

abstract = {AI risk assessments often overlook local socio-cultural perspectives, especially in underrepresented African regions (1–2% of NLP data). This results in biased AI outputs, reinforcing harmful stereotypes with real-world consequences in health, finance, and education, like misdiagnoses or loan denials. To bridge this critical gap, we are introducing an open-source, socio-culturally grounded extension of existing stereotype evaluation resources. Building on prior work (Dev et al., 2023; Davani et al., 2025), we surveyed participants in Senegal, Kenya, and Nigeria to capture top-of-mind societal associations and cultural stereotypes. While this method enabled authentic responses, it occasionally yielded superficial or overtly biased data, revealing both the richness and challenges of the format. For this pilot, we collected 1164 stereotypes from 107 respondents across the three countries, classifying responses by gender, religion, and ethnicity. We will continuously refine and share this dataset, incorporating local languages and voice-based responses for more diverse and culturally relevant data. We'll leverage on-the-ground surveyors and community collaborations for scalable data collection, allowing us to rapidly respond to evolving biases and expand to new countries. Our initial evaluation assesses language models' tendency to reflect these societal biases using Stereo Anti-Stereo (S-AS) pairs (Nangia et al., 2020). We will also explore complementary methods like the NLI-based framework (Dev et al., 2019) for stereotypical inferences. Beyond identification, future efforts will classify the types and degrees of harm these biases inflict across various sectors, aiming for a more nuanced and inclusive understanding of cultural stereotypes in Africa.},

howpublished = {Poster presented at the Deep Learning Indaba 2025},

note = {Non-archival},

keywords = {},

pubstate = {published},

tppubtype = {presentation}

}

Close

Simon, Nebiyu

Large Vocabulary Read-Mode Speech Corpora for Low-Resourced Ometo Languages: Gamo, Gofa, Dawuro and Wolaita Presentation

Poster presented at the Deep Learning Indaba 2025, Kigali, Rwanda, 01.08.2025, (Non-archival).

Abstract | Links | BibTeX

NINYIM, Astride Melvin FOKAM

Federated Learning for Respiratory Disease Forecasting in Africa Presentation

Poster presented at the Deep Learning Indaba 2025, Kigali, Rwanda, 01.08.2025, (Non-archival).

Abstract | Links | BibTeX

@misc{FOKAMNINYIMAstrideMelvin2025,

title = {Federated Learning for Respiratory Disease Forecasting in Africa},

author = {Astride Melvin FOKAM NINYIM},

url = {https://drive.google.com/drive/folders/1udB7WMF5Te6Rn36vaX3Sj3fjjwVTvqnY?usp=drive_link},

year  = {2025},

date = {2025-08-01},

address = {Kigali, Rwanda},

abstract = {Respiratory diseases, including COPD, asthma, and pneumonia, claim over 6 million lives annually in Africa, representing 80% of the continent's infectious disease burden and causing $450 billion in losses from premature deaths plus $800 billion in productivity losses yearly. Driven by climate change and air pollution, the increasing frequency of extreme weather events underscores the urgent need for accurate outbreak prediction systems. 

This study proposes a federated learning framework combined with deep learning to forecast climate-driven respiratory disease outbreaks across Africa. The approach addresses key challenges including data availability and quality issues, spatial and temporal variability in disease patterns, real-time prediction requirements, and forecasting uncertainty. The framework enables privacy-preserving model training on decentralized health and environmental data, addressing data sensitivity and infrastructure constraints. 

The methodology integrates environmental data from ERA5, NASA POWER, and OpenAQ with health data from Synthea, OpenMRS, and WHO sources. A predictive LSTM model incorporates environmental variables (air quality, pollution) and health data (asthma, COPD diagnoses) using federated learning. Results demonstrate strong correlations between environmental factors and health outcomes, with PM2.5 showing 0.87 correlation and temperature 0.74 correlation with respiratory cases. Model optimization revealed that a 20-day input window maximizes early warning performance, achieving consistent predictive accuracy across both urban and rural settings with 1-week lead time. 

This scalable, privacy-preserving solution supports multiple UN Sustainable Development Goals (SDGs 3, 9, 11, 13, and 17) and provides a foundation for timely interventions to reduce respiratory disease burden across Africa.},

howpublished = {Poster presented at the Deep Learning Indaba 2025},

note = {Non-archival},

keywords = {},

pubstate = {published},

tppubtype = {presentation}

}

Close

KABORE, Nematou

Virtual reality and anatomy learning with voice recognition Presentation

Poster presented at the Deep Learning Indaba 2025, Kigali, Rwanda, 01.08.2025, (Non-archival).

Abstract | Links | BibTeX

Lucas, Mgasa

Video Understanding Using LLMs for Intelligent CCTV Surveillance Presentation

Poster presented at the Deep Learning Indaba 2025, Kigali, Rwanda, 01.08.2025, (Non-archival).

Abstract | Links | BibTeX

NIBIGIRA, Nadine

Real time cardiac monitoring system based on Artificial Intelligence Presentation

Poster presented at the Deep Learning Indaba 2025, Kigali, Rwanda, 01.08.2025, (Non-archival).

Abstract | Links | BibTeX

@misc{NIBIGIRANadine2025,

title = {Real time cardiac monitoring system based on Artificial Intelligence},

author = {Nadine NIBIGIRA},

url = {https://storage.googleapis.com/download/storage/v1/b/indaba-2025-posters/o/Nadine_NIBIGIRA.pdf?generation=1755026568915876&alt=media},

year  = {2025},

date = {2025-08-01},

address = {Kigali, Rwanda},

abstract = {Cardiovascular diseases (CVDs) remain the leading cause of mortality globally, disproportionately affecting populations in low- and middle-income countries due to limited access to timely diagnosis and care. 

This study presents UMUTIMA, a real-time cardiac monitoring system that leverages the Internet of Things (IoT) and Artificial Intelligence (AI) to provide an end-to-end framework for remote cardiovascular healthcare. The system architecture combines wearable IoT sensors to continuously collect vital signs such as heart rate, blood pressure, and cardiac rhythm and transmits the data securely via MQTT with TLS encryption to a cloud-based server. In the cloud, lightweight AI models perform real-time analysis to detect anomalies and trigger alerts. 

A hybrid AI model combining Random Forest and CNN+LSTM achieved a training accuracy of 98.6% and test accuracy of 93.8%, demonstrating strong predictive performance. Another model, the Bagging Classifier, reached 89.7% on the training set but only 78.5% on the test set, indicating overfitting. The system integrates explainable AI techniques to identify and visualize the features contributing to each alert, thereby enhancing clinical transparency and decision-making. 

UMUTIMA supports early detection of cardiac anomalies, enables continuous out-of-hospital monitoring, and reduces healthcare costs. It features both mobile and web interfaces to deliver notifications via SMS, email, and push messages, and is designed to interoperate with existing telemedicine platforms and Electronic Health Records (EHRs). Despite its promise, challenges remain, including ensuring data privacy, regulatory compliance, stable wireless connectivity, and energy-efficient operation. 

Overall, UMUTIMA illustrates the transformative potential of integrating AI and IoT in cardiovascular care. With continued development and clinical validation, it could significantly improve cardiac health outcomes, particularly in resource-limited settings.},

howpublished = {Poster presented at the Deep Learning Indaba 2025},

note = {Non-archival},

keywords = {},

pubstate = {published},

tppubtype = {presentation}

}

Close

Omer, Muhammad Abdulghaffar Muhammad

Rule-Based Reward Modeling for Large Reasoning Models Post-Training Presentation

Poster presented at the Deep Learning Indaba 2025, Kigali, Rwanda, 01.08.2025, (Non-archival).

Abstract | Links | BibTeX

@misc{MuhammadOmerMuhammadAbdulghaffar2025,

title = {Rule-Based Reward Modeling for Large Reasoning Models Post-Training},

author = {Muhammad Abdulghaffar Muhammad Omer},

url = {https://storage.googleapis.com/download/storage/v1/b/indaba-2025-posters/o/Muhammad%20Abdulghaffar%20_Muhammad%20Omer%20.pdf?generation=1755026567278976&alt=media},

year  = {2025},

date = {2025-08-01},

address = {Kigali, Rwanda},

abstract = {Recent developments in the field of LLMs and the emergence of reasoning capabilities from foundational 

 models has sparked a wave of specialized models for reasoning and math tasks employing novel and 

 sophisticated prompting, training and finetuning techniques, these models are often referred to as Large 

 Reasoning Models LRMs. Some of the most prominent models tailored for such tasks are Open AI’s 

 o-series models, Llama Nemotron, Deepseek R1, and Qwen-Math model series. At the center of these 

 models’ development are ideas such as Chain of Thought (CoT) and Tree of Thought (ToT) prompting as 

 well as RLHF post-training. This work investigates the potential of simple reward modeling using rule 

based techniques to enhance finetuning without using labeled examples. Our work builds on a recent 

 method called ”Test Time Reinforcement Learning (TTRL)”, that performs finetuning on LLMs using RL 

 with unlabeled data. TTRL takes a finite set of samples from the model during inference and uses majority 

 voting to construct binary rewards that are fed to the RL pipeline for finetuning. Our work develops an 

 additional intermediate step that adds or subtracts additional reward signals based on a set of rules such 

 as prompt-to-response length ratio, compression ratio, and the presence of code and other patterns in 

 the response. Two of our methods show significant improvement in reward and reward accuracy over 

 two math tasks, AIME 2024 and AMC.},

howpublished = {Poster presented at the Deep Learning Indaba 2025},

note = {Non-archival},

keywords = {},

pubstate = {published},

tppubtype = {presentation}

}

Close

Iliya, Nasiru

Targeting the Right Farmer: Predictive Analytics for Agricultural Input Subsidy Allocation in Sub-Saharan Africa Presentation

Poster presented at the Deep Learning Indaba 2025, Kigali, Rwanda, 01.08.2025, (Non-archival).

Abstract | Links | BibTeX

MKUMBO, HAPPINESS

Multilingual Natural Language Processing Conversational Platform for Promoting Blood Donation in Tanzania Presentation

Poster presented at the Deep Learning Indaba 2025, Kigali, Rwanda, 01.08.2025, (Non-archival).

Abstract | Links | BibTeX

Nimo, Charles

Africa Health Check: Probing Cultural Bias in Medical LLMs Presentation

Poster presented at the Deep Learning Indaba 2025, Kigali, Rwanda, 01.08.2025, (Non-archival).

Abstract | Links | BibTeX

Leventhal, Michael

AI for Universal Literacy in the Languages People Speak Presentation

Poster presented at the Deep Learning Indaba 2025, Kigali, Rwanda, 01.08.2025, (Non-archival).

Abstract | Links | BibTeX

Onyango, Nelson

A BI-DIRECTIONAL LONG SHORT-TERM MEMORY BASED DEEP LEARNING MODEL FOR POLITICAL HATE SPEECH DETECTION IN SWAHILI AND CODE-SWITCHED ENGLISH-SWAHILI TEXTUAL DATA Presentation

Poster presented at the Deep Learning Indaba 2025, Kigali, Rwanda, 01.08.2025, (Non-archival).

Abstract | Links | BibTeX

Mayienga, Marlyn

Predicting Electoral Violence Using Integrated Conflict Data Presentation

Poster presented at the Deep Learning Indaba 2025, Kigali, Rwanda, 01.08.2025, (Non-archival).

Abstract | Links | BibTeX

@misc{MayiengaMarlyn2025,

title = {Predicting Electoral Violence Using Integrated Conflict Data},

author = {Marlyn Mayienga},

url = {https://storage.googleapis.com/download/storage/v1/b/indaba-2025-posters/o/Marlyn_Mayienga.pdf?generation=1755026564366368&alt=media},

year  = {2025},

date = {2025-08-01},

address = {Kigali, Rwanda},

abstract = {Electoral violence severely undermines democratic stability, with approximately 60% of elections in fragile states experiencing conflict, from voter intimidation to lethal clashes 

that claim thousands of lives. The 2007 Kenyan election, for instance, resulted in over 1,000 deaths and displaced 300,000 people, highlighting the urgent need for predictive 

tools.[1].With over 50 countries facing elections in 2025, many in high-risk regions, this issue demands immediate attention. Current models often fail to capture electoral violence’s distinct temporal and spatial dynamics, such as clustering around election periods, limiting early warning capabilities. This study integrates institutional, economic, and event-based data from the Electoral Contention and Violence (ECAV) dataset, Uppsala Deadly Electoral Conflict Dataset (DECO), Varieties of Democracy (V-Dem), and World Bank indicators to forecast violence at the election-year-country level. 

Using machine learning models such as Logistic Regression, K-Nearest Neighbors, XGBoost, Long Short-Term Memory (LSTM), and a Bayesian Ensemble, we achieve an Area 

Under the Curve (AUC) of 0.8124 with the ensemble, outperforming baselines like Logistic Regression (AUC 0.61). Analysis reveals 67% of incidents occur within 30 days of 

elections, with institutional factors like electoral intimidation and social group exclusion 

as key predictors. These findings enable targeted interventions, such as enhanced election monitoring by electoral bodies and international organizations, enhancing violence prevention in high-risk 2025 elections.},

howpublished = {Poster presented at the Deep Learning Indaba 2025},

note = {Non-archival},

keywords = {},

pubstate = {published},

tppubtype = {presentation}

}

Close

Ochieng, Ronnie

INTELLIGENT DIGITAL STETHOSCOPE FOR AUTOMATED LUNG SOUND ANALYSIS AND RESPIRATORY DISEASE CLASSIFICATION Presentation

Poster presented at the Deep Learning Indaba 2025, Kigali, Rwanda, 01.08.2025, (Non-archival).

Abstract | Links | BibTeX

@misc{OchiengRonnie2025,

title = {INTELLIGENT DIGITAL STETHOSCOPE FOR AUTOMATED LUNG SOUND ANALYSIS AND RESPIRATORY DISEASE CLASSIFICATION},

author = {Ronnie Ochieng},

url = {https://storage.googleapis.com/download/storage/v1/b/indaba-2025-posters/o/Ronnie_Ochieng.pdf?generation=1755026603209913&alt=media},

year  = {2025},

date = {2025-08-01},

address = {Kigali, Rwanda},

abstract = {Respiratory disorders significantly impact global health, accounting for approximately 6.97% of all deaths worldwide. Chronic obstructive pulmonary disease (COPD) alone caused 3.5 million deaths in 2021, representing about 5% of all global deaths. Notably, nearly 90% of COPD deaths in individuals under 70 occur in low-and middle-income countries (LMICs), where limited access to healthcare and diagnostic tools, such as chest X-rays, often necessitates reliance on clinical expertise and auscultation for diagnosis. However, accurate interpretation of lung sounds requires specialized training and traditional stethoscopes usually present challenges due to low signal levels and interference from bodily noises. These factors contribute to potential misdiagnosis and underdiagnosis, exacerbating the burden of respiratory diseases in LMICs. 

This project presents an intelligent digital stethoscope integrated with advanced machine-learning algorithms to analyze lung sounds and classify respiratory diseases. Using a Boya M1 microphone, the device captures the patient's respiratory sounds and leverages machine-learning algorithms to classify lung sounds such as crackles or wheezing and identify patterns indicative of common lung diseases like chronic obstructive pulmonary disease (COPD) and pneumonia. The classification results and recorded sounds are displayed in real-time on the device’s screen, aiding healthcare providers, especially in resource-limited settings, to make informed diagnoses, facilitating early diagnosis and management of respiratory conditions.},

howpublished = {Poster presented at the Deep Learning Indaba 2025},

note = {Non-archival},

keywords = {},

pubstate = {published},

tppubtype = {presentation}

}

Close

Mnyawami, Yuda

Predictive System for Characterizing Student Dropout using K-Nearest Oracle with Automated Machine Learning (KNORA-AutoML) Model Presentation

Poster presented at the Deep Learning Indaba 2025, Kigali, Rwanda, 01.08.2025, (Non-archival).

Abstract | Links | BibTeX

@misc{MnyawamiYuda2025,

title = {Predictive System for Characterizing Student Dropout using K-Nearest Oracle with Automated Machine Learning (KNORA-AutoML) Model},

author = {Yuda Mnyawami},

url = {https://storage.googleapis.com/download/storage/v1/b/indaba-2025-posters/o/Yuda_Mnyawami.pdf?generation=1755026602458589&alt=media},

year  = {2025},

date = {2025-08-01},

address = {Kigali, Rwanda},

abstract = {Students continue to drop out of basic education in developing countries because student dropout features are dynamic. It is challenging to reduce student dropout in developing countries because factors keep changing periodically. Conventional machine learning models have been used to address the persisting problem, but still, student dropout in developing countries, particularly Tanzania is taking place. Conventional machine learning models can not accurately determine features leading to student dropouts. This study used the Twaweza information repository to establish a suitable dataset using the KNORA-AutoML prediction model. The KNORA-AutoML model demonstrated 97% accuracy and 87% AUC, outperforming previous studies. The suggested model was used to develop the predictive system. Results reveal that students who drop out due to the number of household children walk a long distance to school in rural areas. In urban areas, students travel more than 11 kilometers to school, which makes them fail to do their homework accurately. Parents’ occupations, particularly housewives with more than five children, have a likelihood of not supporting their children. Features such as distance, household size, household children, modes of transport, and parents' occupations highly contribute to student dropouts. The proposed predictive system reveals its effectiveness in identifying students at risk of dropping out and proposes early interventions.},

howpublished = {Poster presented at the Deep Learning Indaba 2025},

note = {Non-archival},

keywords = {},

pubstate = {published},

tppubtype = {presentation}

}

Close

Okewunmi, Paul

Evaluating Robustness of LLMs to Typographical Noise in Yorùbá QA Presentation

Poster presented at the Deep Learning Indaba 2025, Kigali, Rwanda, 01.08.2025, (Non-archival).

Abstract | Links | BibTeX

Ochieng, Millicent

Benchmarking LLMs: From Standard NLP Tasks to Real-World Multilingual Challenges Presentation

Poster presented at the Deep Learning Indaba 2025, Kigali, Rwanda, 01.08.2025, (Non-archival).

Abstract | Links | BibTeX

Sowole, Oladimeji Samuel Sowole

Integrating Network Curvature into Epidemic Dynamics: A Curvature-Aware SIR Model Framework Presentation

Poster presented at the Deep Learning Indaba 2025, Kigali, Rwanda, 01.08.2025, (Non-archival).

Links | BibTeX

Rabothata, Moyahabo

Supervised Machine Learning and Deep Learning Techchniques for Legal Text Classification Presentation

Poster presented at the Deep Learning Indaba 2025, Kigali, Rwanda, 01.08.2025, (Non-archival).

Links | BibTeX

Rakotondranisa, Onintsoa Anjara

Constraining the Hubble Tension with Fast Radio Bursts using Machine Learning Presentation

Poster presented at the Deep Learning Indaba 2025, Kigali, Rwanda, 01.08.2025, (Non-archival).

Abstract | Links | BibTeX

Rabothata, Moyahabo Muriel

Network Analysis and Topic Modelling to Identify Influential Authors Presentation

Poster presented at the Deep Learning Indaba 2025, Kigali, Rwanda, 01.08.2025, (Non-archival).

Abstract | Links | BibTeX

@misc{RabothataMoyahaboMuriel2025,

title = {Network Analysis and Topic Modelling to Identify Influential Authors},

author = {Moyahabo Muriel Rabothata},

url = {https://storage.googleapis.com/download/storage/v1/b/indaba-2025-posters/o/Moyahabo%20Muriel_Rabothata.pdf?generation=1755026588075740&alt=media},

year  = {2025},

date = {2025-08-01},

address = {Kigali, Rwanda},

abstract = {In assisting the University of Pretoria faculty of private law to understand the underlying insight of the Fontes Juris data consisting of law sources noted in South African court cases from 1825 to 2015. The project aimed at using machine learning (ML) techniques to assess the impact of Law Research in South African (SA) courts. The project employed Topic modelling to identify topics in the corpus and used Network Analysis to model author and judgment pair and author and topic pair using various centrality measures. Degree of centrality measure has shown interesting results emerging from the data indicating most influential authors. Topic modelling has revealed the decrease in publications cited in court although they are not topic specific. Topic modelling also shows large influence of Old Laws, Roman-Dutch and English Laws on their mission, the project aims at using machine learning (ML) techniques to assess the impact of Law Research in South African (SA) courts. The project employed Topic modelling to identify topics in the corpus and used Network Analysis to model author and judgment pair and author and topic pair using various centrality measures. Degree of centrality measure has shown interesting results emerging from the data indicating most influential authors. Topic modelling has revealed the decrease in publications cited in court although they are not topic specific. Topic modelling also shows large influence of Old Laws, Roman-Dutch and English Laws.},

howpublished = {Poster presented at the Deep Learning Indaba 2025},

note = {Non-archival},

keywords = {},

pubstate = {published},

tppubtype = {presentation}

}

Close

Ooko, Samson

TinyML-Based Acoustic Bird Detection for Crop Protection in African Farms Presentation

Poster presented at the Deep Learning Indaba 2025, Kigali, Rwanda, 01.08.2025, (Non-archival).

Links | BibTeX

Musinguzi, Denis

PaliGemma-CXR: Multitask Multimodal for Chest X-ray Interpretation Presentation

Poster presented at the Deep Learning Indaba 2025, Kigali, Rwanda, 01.08.2025, (Non-archival).

Abstract | Links | BibTeX

Oyelaja, Iremide

SECURE AND SCALABLE HORIZONTAL FEDERATED LEARNING FOR BANK FRAUD DETECTION Presentation

Poster presented at the Deep Learning Indaba 2025, Kigali, Rwanda, 01.08.2025, (Non-archival).

Links | BibTeX

Akanni, Comfort

AI-BASED EARLY DETECTION OF DEVELOPMENTAL DELAYS IN CHILDREN WITH SICKLE CELL DISEASE (AGES 0–5) Presentation

Poster presented at the Deep Learning Indaba 2025, Kigali, Rwanda, 01.08.2025, (Non-archival).

Abstract | Links | BibTeX

@misc{AkanniComfort2025,

title = {AI-BASED EARLY DETECTION OF DEVELOPMENTAL DELAYS IN CHILDREN WITH SICKLE CELL DISEASE (AGES 0–5)},

author = {Comfort Akanni},

url = {https://storage.googleapis.com/download/storage/v1/b/indaba-2025-posters/o/Comfort_Akanni.pdf?generation=1755026596315997&alt=media},

year  = {2025},

date = {2025-08-01},

address = {Kigali, Rwanda},

abstract = {Children under the age of 5 years with Sickle Cell Disease (SCD) are at high risk of having developmental delay. This is of a major concern because the delay is due to the frequent episodes of chronic anemia, crises and cerebrovascular complications on cognitive and motor development. Early detection is essential for timely intervention but the current screening methods lack predictive accuracy and accessibility. 

This study aims to develop and evaluate DevSickleNet, a multimodal deep learning model designed to predict developmental delays in children with Sickle Cell Disease (SCD) using clinical, caregiver, and developmental milestone data. 

This study proposes DevSickleNet, a multimodal deep learning model that integrates clinical, caregiver, and milestone time-series data to predict developmental delays in children with SCD. A synthetic dataset of 300 samples was generated. This incorporates clinical features (hemoglobin levels, pain crises history,stroke history), caregivers factors (educational factors, socioeconomic status), and developmental milestones progression over a span of 12 months. These features were encoded and normalized using standard preprocessing pipelines. The multimodal data was input into DevSickleNet, which combines an LSTM network for milestone time-series, fully connected layers for static inputs, and an attention-based fusion layer. The model was trained using cross-entropy loss and evaluated using accuracy, precision, recall, F1-score, and ROC-AUC. 

The DevSickleNet achieved an accuracy of 85.4, a precision of 82.9, a recall of 79.5, a F1-score of 81.1 and a ROC-AUC of 0.87, outperforming traditional models like Logistic Regression, Random Forest and XGBoost. The feature importance analysis identified pain crises frequency, levels of hemoglobin and the educational status of the caregivers as the key predictors of developmental delays. The model’s performance is attributed to its ability to process sequential milestone data through LSTM and combine it with static clinical and caregiver features using attention-based fusion. This architecture allowed DevSickleNet to capture both temporal progression and static risk factors effectively. 

These results highlight the potential of AI-driven multimodal learning for the early developmental delay screening in SCD patients. Even though the findings are promising, there is still a need to validate the model using a clinical dataset gotten from tertiary healthcare facilities in order to confirm its clinical applicability. DevSickleNet therefore provides for AI-powered early detection of developmental delay in SCD children and intervention strategies aiming to improve their developmental outcome. 

Keywords: Sickle Cell Disease, Developmental delays, Deep learning, DevSickleNet, Pediatric AI, Early detection, Multimodal AI.},

howpublished = {Poster presented at the Deep Learning Indaba 2025},

note = {Non-archival},

keywords = {},

pubstate = {published},

tppubtype = {presentation}

}

Close

Children under the age of 5 years with Sickle Cell Disease (SCD) are at high risk of having developmental delay. This is of a major concern because the delay is due to the frequent episodes of chronic anemia, crises and cerebrovascular complications on cognitive and motor development. Early detection is essential for timely intervention but the current screening methods lack predictive accuracy and accessibility.
This study aims to develop and evaluate DevSickleNet, a multimodal deep learning model designed to predict developmental delays in children with Sickle Cell Disease (SCD) using clinical, caregiver, and developmental milestone data.
This study proposes DevSickleNet, a multimodal deep learning model that integrates clinical, caregiver, and milestone time-series data to predict developmental delays in children with SCD. A synthetic dataset of 300 samples was generated. This incorporates clinical features (hemoglobin levels, pain crises history,stroke history), caregivers factors (educational factors, socioeconomic status), and developmental milestones progression over a span of 12 months. These features were encoded and normalized using standard preprocessing pipelines. The multimodal data was input into DevSickleNet, which combines an LSTM network for milestone time-series, fully connected layers for static inputs, and an attention-based fusion layer. The model was trained using cross-entropy loss and evaluated using accuracy, precision, recall, F1-score, and ROC-AUC.
The DevSickleNet achieved an accuracy of 85.4, a precision of 82.9, a recall of 79.5, a F1-score of 81.1 and a ROC-AUC of 0.87, outperforming traditional models like Logistic Regression, Random Forest and XGBoost. The feature importance analysis identified pain crises frequency, levels of hemoglobin and the educational status of the caregivers as the key predictors of developmental delays. The model’s performance is attributed to its ability to process sequential milestone data through LSTM and combine it with static clinical and caregiver features using attention-based fusion. This architecture allowed DevSickleNet to capture both temporal progression and static risk factors effectively.
These results highlight the potential of AI-driven multimodal learning for the early developmental delay screening in SCD patients. Even though the findings are promising, there is still a need to validate the model using a clinical dataset gotten from tertiary healthcare facilities in order to confirm its clinical applicability. DevSickleNet therefore provides for AI-powered early detection of developmental delay in SCD children and intervention strategies aiming to improve their developmental outcome.
Keywords: Sickle Cell Disease, Developmental delays, Deep learning, DevSickleNet, Pediatric AI, Early detection, Multimodal AI.

Close

Mba, Patience

Estimation of physico-chemical properties of soil using machine learning Presentation

Poster presented at the Deep Learning Indaba 2025, Kigali, Rwanda, 01.08.2025, (Non-archival).

Abstract | Links | BibTeX

Talotsing, Gaelle Patricia

A stacking Ensemble Machine Learning Model for Emergency Call Forecasting Presentation

Poster presented at the Deep Learning Indaba 2025, Kigali, Rwanda, 01.08.2025, (Non-archival).

Abstract | Links | BibTeX

@misc{TalotsingGaellePatricia2025,

title = {A stacking Ensemble Machine Learning Model for Emergency Call Forecasting},

author = {Gaelle Patricia Talotsing},

url = {https://storage.googleapis.com/download/storage/v1/b/indaba-2025-posters/o/Gaelle%20Patricia_Talotsing.pdf?generation=1755026579961972&alt=media},

year  = {2025},

date = {2025-08-01},

address = {Kigali, Rwanda},

abstract = {One of the greatest challenges of Emergency medical services providers is to handle the large number of Emergency Medical Service (EMS) calls coming from the population. An accurate forecast of EMS calls is involved in ambulance fleet dispatching and routing to minimize response times to emergency calls and enhance the efficacy of assistance. Yet, the demand for emergency services exhibits significant variability, posing a challenge in accurately predicting the future occurrence of emergency calls and their spatial-temporal distribution. Here, we propose a stacking ensemble machine learning model to forecast EMS calls, combining different base learners to enhance the overall performance of generalization. Additionally, we conducted experiments using Boruta, Lasso, RFFI and SHAP feature selection methods to identify the most informative attributes from the EMS dataset. The proposed ensemble model integrates a base layer and a meta layer. In the base layer, we applied four base learners: Decision Tree, Gradient Boosting Regression Tree, Light Gradient Boosting Machine and Random Forest. In the meta layer, we used an optimized Random Forest model to integrate the outputs of base learners. We evaluate the performance of our proposed model using the R2 -score and four different error metrics. Based on a real data set including spatial, temporal and weather features, the findings of this study demonstrated that the proposed stacking-based ensemble model showed a better score and the minimum errors compared to the traditional single algorithms, online machine learning methods and voting ensemble methods. We achieved a higher score of 0.9954, mse of 0.8938, rmse of 0.9454, mae of 0.2923 and mape of 0.0724 compared to state-of-the-art models. This work is an aid for emergency managers in making well-informed decisions, improving outcomes for ambulance dispatch and routing, and enhancing ambulance response time.},

howpublished = {Poster presented at the Deep Learning Indaba 2025},

note = {Non-archival},

keywords = {},

pubstate = {published},

tppubtype = {presentation}

}

Close

Falola, Peace

From Play to Preservation: KọÈdè– Where Tech Meets Language Learning Presentation

Poster presented at the Deep Learning Indaba 2025, Kigali, Rwanda, 01.08.2025, (Non-archival).

Abstract | Links | BibTeX

Falola, Peace

A Multi-Domain Annotated Yoruba NER Dataset: Expanding NLP Resources for Low-Resource African Languages Presentation

Poster presented at the Deep Learning Indaba 2025, Kigali, Rwanda, 01.08.2025, (Non-archival).

Abstract | Links | BibTeX

Silima, Walter

Machine Learning Approaches to Study Star Formation and Black Hole Accretion in the MeerKAT/MIGHTEE survey Presentation

Poster presented at the Deep Learning Indaba 2025, Kigali, Rwanda, 01.08.2025, (Non-archival).

Abstract | Links | BibTeX

@misc{SilimaWalter2025,

title = {Machine Learning Approaches to Study Star Formation and Black Hole Accretion in the MeerKAT/MIGHTEE survey},

author = {Walter Silima},

url = {https://docs.google.com/presentation/d/1uHVLsoGUuRCFyOT1W5qXfNJHsSgBUIHkoz3iYdfkMn8/edit?usp=sharing},

year  = {2025},

date = {2025-08-01},

address = {Kigali, Rwanda},

abstract = {Radio synchrotron emission originates from both massive star formation and black hole accretion, two processes that drive galaxy evolution. Therefore, current high-sensitivity and wide-field extragalactic radio continuum surveys require efficient and reliable classification of radio sources dominated by star formation or black hole accretion before utilizing radio continuum for exploring cosmic evolution. In this study, we implement, optimize, and compare five widely used supervised machine-learning (ML) algorithms to classify radio sources detected in the MeerKAT International GHz Tiered Extragalactic Exploration (MIGHTEE)–COSMOS survey as star-forming galaxies (SFGs) and active galactic nuclei (AGN). We utilize conventionally classified SFGs and AGN from MIGHTEE-COSMOS to construct training and test sets for evaluating the ML algorithm’s performance. To select input features for our ML analyses, we incorporate 18 physical parameters of MIGHTEE-detected radio sources. As anticipated, our feature analyses rank the six parameters used in conventional classification as the most effective: the infrared-radio correlation parameter (qIR), the optical compactness morphology parameter (class_star), three combined mid-infrared colors, and stellar mass. By optimizing the ML models with these selected features and testing classifiers across various feature combinations, we find that model performance generally improves as additional features are incorporated. Overall, all five algorithms yield an F1-score (the harmonic mean of precision and recall) 90% even when only 20% of the data is used for training. These findings highlight the strong potential of ML techniques for classifying radio sources in upcoming large radio continuum surveys.},

howpublished = {Poster presented at the Deep Learning Indaba 2025},

note = {Non-archival},

keywords = {},

pubstate = {published},

tppubtype = {presentation}

}

Close

Ssempeebwa, Phillip

Hierarchical CXR-Net: A Two-Stage Framework for Efficient and Interpretable Chest X-Ray Diagnosis Presentation

Poster presented at the Deep Learning Indaba 2025, Kigali, Rwanda, 01.08.2025, (Non-archival).

Abstract | Links | BibTeX

@misc{SsempeebwaPhillip2025,

title = {Hierarchical CXR-Net: A Two-Stage Framework for Efficient and Interpretable Chest X-Ray Diagnosis},

author = {Phillip Ssempeebwa},

url = {https://storage.googleapis.com/download/storage/v1/b/indaba-2025-posters/o/Phillip%20_Ssempeebwa%20.pdf?generation=1755026612009720&alt=media},

year  = {2025},

date = {2025-08-01},

address = {Kigali, Rwanda},

abstract = {Chest radiography is one of the most common and vital diagnostic imaging tools globally. However, the interpretation of chest X-rays can be challenging, time-consuming, and subject to inter-observer variability, particularly in resource-limited settings where there is a shortage of expert radiologists. To address this, we present an end-to-end deep learning model designed to provide comprehensive radiological assistance by not only identifying abnormalities but also localizing them and generating structured reports. 

Our methodology leverages the large-scale NIH Chest X-ray dataset, comprising over 112,000 images. We trained a DenseNet-121 model for multi-label classification across 14 common thoracic pathologies. Crucially, to overcome the scarcity of location-specific annotations, we employ a weakly-supervised learning approach. Gradient-weighted Class Activation Mapping (Grad-CAM) is used on the trained classification model to generate visual heatmaps that highlight the regions indicative of predicted diseases, providing effective abnormality localization without requiring explicit bounding box labels during training. 

The classification model achieved a strong mean Area Under the ROC Curve (AUROC) of 0.796 across all pathologies on a held-out test set, demonstrating robust diagnostic performance. Qualitative results show that the Grad-CAM heatmaps successfully and plausibly highlight relevant pathological regions. The system's final output is a structured, human-readable report that synthesizes the classification and localization findings, mimicking a preliminary radiological summary. 

This work demonstrates a viable and effective pipeline for AI-powered radiological assistance. By combining classification, localization, and automated reporting, our system has the potential to enhance diagnostic accuracy, improve workflow efficiency, and serve as a valuable educational and decision-support tool for healthcare professionals.},

howpublished = {Poster presented at the Deep Learning Indaba 2025},

note = {Non-archival},

keywords = {},

pubstate = {published},

tppubtype = {presentation}

}

Close

Pidy, Pidy

Enhancing Urban Mobility in Developing Countries: A VANET Architecture with Secure mRSU Routing and Machine Learning Presentation

Poster presented at the Deep Learning Indaba 2025, Kigali, Rwanda, 01.08.2025, (Non-archival).

Abstract | Links | BibTeX

John, Ngeta

Music Information Retrieval: Teaching Machines to Listen Presentation

Poster presented at the Deep Learning Indaba 2025, Kigali, Rwanda, 01.08.2025, (Non-archival).

Abstract | Links | BibTeX

@misc{JohnNgeta2025,

title = {Music Information Retrieval: Teaching Machines to Listen},

author = {Ngeta John},

url = {https://storage.googleapis.com/download/storage/v1/b/indaba-2025-posters/o/Ngeta_John.pdf?generation=1755026598516963&alt=media},

year  = {2025},

date = {2025-08-01},

address = {Kigali, Rwanda},

abstract = {Abstract: 

Music Information Retrieval (MIR) represents a fundamental challenge in teaching machines to understand music the way humans do. This poster explores the evolution of deep learning approaches to automatic chord recognition, from early CNN architectures achieving 77% accuracy to modern foundation models like MERT reaching 86.9% performance on standard benchmarks. We examine the complete technical pipeline from raw audio to musical understanding: audio preprocessing, chroma feature extraction, and the architectural evolution from CNNs (2012) through LSTM networks (2018) to Transformer-based models (2019) and foundation models (2023). Current systems achieve real-time processing with <100ms latency, enabling applications in music therapy, personalized education, and adaptive entertainment. However, a critical challenge remains: existing MIR systems exhibit significant Western bias, achieving 88% accuracy on Western pop music but <60% on traditional African music. We discuss how foundation models like MERT, with their self-supervised learning capabilities and scaling from 95M to 330M parameters, offer potential pathways to culturally-inclusive music AI. The poster demonstrates how techniques familiar to deep learning practitioners—CNNs for pattern recognition, LSTMs for sequence modeling, and Transformers for attention-based understanding—apply directly to music, creating systems that not only recognize chords but generate new musical content. We conclude with a vision for building MIR systems that celebrate musical diversity and serve all cultures, highlighting opportunities for African researchers to contribute to more inclusive AI development. 

Keywords: Music Information Retrieval, Deep Learning, Chord Recognition, Foundation Models, Cultural AI, MERT, Transformers},

howpublished = {Poster presented at the Deep Learning Indaba 2025},

note = {Non-archival},

keywords = {},

pubstate = {published},

tppubtype = {presentation}

}

Close

Idakwo, Patricia Ojonoka

Road Traffic Crash Severity Prediction in Low-Resource Contexts: Ensemble Machine Learning and Deep Learning Approaches Presentation

Poster presented at the Deep Learning Indaba 2025, Kigali, Rwanda, 01.08.2025, (Non-archival).

Abstract | Links | BibTeX

@misc{IdakwoPatriciaOjonoka2025,

title = {Road Traffic Crash Severity Prediction in Low-Resource Contexts: Ensemble Machine Learning and Deep Learning Approaches},

author = {Patricia Ojonoka Idakwo},

url = {https://storage.googleapis.com/download/storage/v1/b/indaba-2025-posters/o/Patricia%20Ojonoka%20_Idakwo.pdf?generation=1755026551811226&alt=media},

year  = {2025},

date = {2025-08-01},

address = {Kigali, Rwanda},

abstract = {Road traffic crash (RTC) severity prediction is critical for improving post-crash care and reducing fatalities, particularly in low-resource contexts like Nigeria, where structured crash data is scarce and emergency medical services (EMS) face operational constraints. This study presents a data-centric, multi-modal approach to RTC severity prediction, addressing these challenges. A Nigerian RTC dataset comprising 59 features across three data modes: unstructured textual data (mode 1), structured numerical data (mode 2), and a fusion of both (mode 3) was curated from unstructured online crash narratives using Natural Language Processing techniques - named entity recognition, one-hot encoding, and text mining. Owing to class imbalance, we utilized the weighted average F1-score to evaluate the performance of the ensemble machine learning and deep learning models in RTC severity prediction. Across all modes, the LSTM-CNN model with Word2Vec embeddings achieved the best performance on the mode 3 data with a weighted F1-score of 0.755, and on mode 1 data with 0.674, while Gradient Boosting achieved the highest score (0.520) on mode 2 data. These findings highlight the advantage of multi-modal data fusion and hybrid neural networks in enhancing RTC severity prediction for data-driven EMS resource allocation and road safety, supporting efforts to reduce mortality and serious injuries in resource-constrained emergency response systems.},

howpublished = {Poster presented at the Deep Learning Indaba 2025},

note = {Non-archival},

keywords = {},

pubstate = {published},

tppubtype = {presentation}

}

Close

Oketta, Peter

Enhancing Bean Crop Disease Diagnosis with Vision-Language Models: A Multitask Approach using PaliGemma Presentation

Poster presented at the Deep Learning Indaba 2025, Kigali, Rwanda, 01.08.2025, (Non-archival).

Links | BibTeX

Baker, Rameeze

[No title] Presentation

Poster presented at the Deep Learning Indaba 2025, Kigali, Rwanda, 01.08.2025, (Non-archival).

Abstract | Links | BibTeX

LUPYANI, REBECCA

Leveraging AI for Visual impairment: A Linguistically Inclusive Model for Zambian Learners Presentation

Poster presented at the Deep Learning Indaba 2025, Kigali, Rwanda, 01.08.2025, (Non-archival).

Abstract | Links | BibTeX

@misc{LUPYANIREBECCA2025,

title = {Leveraging AI for Visual impairment: A Linguistically Inclusive Model for Zambian Learners},

author = {REBECCA LUPYANI},

url = {https://storage.googleapis.com/download/storage/v1/b/indaba-2025-posters/o/REBECCA%20_LUPYANI%20.pdf?generation=1755026574730734&alt=media},

year  = {2025},

date = {2025-08-01},

address = {Kigali, Rwanda},

abstract = {In today’s digital era, emerging technologies are being harnessed to enhance and support various sectors, including inclusive education. Recognizing the imperative of educational equity, a number of international and national policies have been instituted to promote inclusive learning environments. One such policy is Article 24 of the United Nations Convention on the Rights of Persons with Disabilities (UNCRPD), adopted in 2006. It affirms the right of persons with disabilities to inclusive education at all levels and mandates that states ensure reasonable accommodations, individualized support, and accessibility within mainstream education systems. This aligns closely with the Envision 2030 Agenda, which articulates the 17 Sustainable Development Goals (SDGs) aimed at creating a more inclusive and equitable world for all, particularly persons with disabilities. Anchored in the principle of “leaving no one behind,” the agenda underscores inclusive education as a critical driver of sustainable development. 

Despite global advocacy, the realization of SDG 4, which seeks to ensure inclusive and equitable quality education for all remains elusive for learners who are blind or visually impaired across Africa. These learners continue to face entrenched barriers, including inaccessible learning materials, limited availability and affordability of assistive technologies, and a shortage of trained educators, all of which hinder their full participation in educational systems . However, recent advances in Artificial Intelligence (AI) and Machine Learning (ML) have introduced promising new possibilities for assistive technologies. Innovations such as AI-powered screen readers, object recognition tools, and voice-enabled navigation systems are increasingly being developed to support independent learning and mobility for individuals with visual impairments . 

Within African contexts, emerging studies reveal both potential and persisting challenges. In Nigeria, for instance, only about 36% of visually impaired adults are aware of existing AI- powered assistive technologies, and fewer than 18% possess the skills to use them effectively . In Kenya, AI-powered tools such as smart canes, screen readers, and applications like ‘Seeing AI’ and ‘Be My Eyes’ have contributed to enhanced mobility and digital inclusion. Nonetheless, inequities in internet access, affordability, and public awareness continue to impede widespread adoption. 

In Zambia, a study by Ndume (2025) that investigated the use of assistive devices for blind and visually impaired learners, revealed that 65.6% of visually impaired students relied on no-tech tools, namely, braille slates, styluses, abacuses, 25% used low-tech devices, while only 9.4% accessed high-tech assistive technologies such as computers, tablets, smartphones, and screen reader software like JAWS. Although AI-powered assistive technologies have gained traction globally, their potential remains largely untapped in Zambia. A critical gap exists not only in the adoption of AI-based tools but also in the capacity of these tools to function in Zambian local languages, which is crucial for inclusive learning. 

Therefore, this study seeks to design and develop an AI- Powered wearable assistive device for the blind and visually impaired, that is designed to provide learners with an awareness of their surrounding by providing facial recognition, object identification, screen description and providing auditory feedback using Bemba which is one of Zambia’s widely spoken local languages.},

howpublished = {Poster presented at the Deep Learning Indaba 2025},

note = {Non-archival},

keywords = {},

pubstate = {published},

tppubtype = {presentation}

}

Close

In today’s digital era, emerging technologies are being harnessed to enhance and support various sectors, including inclusive education. Recognizing the imperative of educational equity, a number of international and national policies have been instituted to promote inclusive learning environments. One such policy is Article 24 of the United Nations Convention on the Rights of Persons with Disabilities (UNCRPD), adopted in 2006. It affirms the right of persons with disabilities to inclusive education at all levels and mandates that states ensure reasonable accommodations, individualized support, and accessibility within mainstream education systems. This aligns closely with the Envision 2030 Agenda, which articulates the 17 Sustainable Development Goals (SDGs) aimed at creating a more inclusive and equitable world for all, particularly persons with disabilities. Anchored in the principle of “leaving no one behind,” the agenda underscores inclusive education as a critical driver of sustainable development.
Despite global advocacy, the realization of SDG 4, which seeks to ensure inclusive and equitable quality education for all remains elusive for learners who are blind or visually impaired across Africa. These learners continue to face entrenched barriers, including inaccessible learning materials, limited availability and affordability of assistive technologies, and a shortage of trained educators, all of which hinder their full participation in educational systems . However, recent advances in Artificial Intelligence (AI) and Machine Learning (ML) have introduced promising new possibilities for assistive technologies. Innovations such as AI-powered screen readers, object recognition tools, and voice-enabled navigation systems are increasingly being developed to support independent learning and mobility for individuals with visual impairments .
Within African contexts, emerging studies reveal both potential and persisting challenges. In Nigeria, for instance, only about 36% of visually impaired adults are aware of existing AI- powered assistive technologies, and fewer than 18% possess the skills to use them effectively . In Kenya, AI-powered tools such as smart canes, screen readers, and applications like ‘Seeing AI’ and ‘Be My Eyes’ have contributed to enhanced mobility and digital inclusion. Nonetheless, inequities in internet access, affordability, and public awareness continue to impede widespread adoption.
In Zambia, a study by Ndume (2025) that investigated the use of assistive devices for blind and visually impaired learners, revealed that 65.6% of visually impaired students relied on no-tech tools, namely, braille slates, styluses, abacuses, 25% used low-tech devices, while only 9.4% accessed high-tech assistive technologies such as computers, tablets, smartphones, and screen reader software like JAWS. Although AI-powered assistive technologies have gained traction globally, their potential remains largely untapped in Zambia. A critical gap exists not only in the adoption of AI-based tools but also in the capacity of these tools to function in Zambian local languages, which is crucial for inclusive learning.
Therefore, this study seeks to design and develop an AI- Powered wearable assistive device for the blind and visually impaired, that is designed to provide learners with an awareness of their surrounding by providing facial recognition, object identification, screen description and providing auditory feedback using Bemba which is one of Zambia’s widely spoken local languages.

Close

N’guessan, Regis

The Reflexive Integrated Information Unit: A Differentiable Primitive for Artificial Consciousness Presentation

Poster presented at the Deep Learning Indaba 2025, Kigali, Rwanda, 01.08.2025, (Non-archival).

Links | BibTeX

Publication Archive

Presentations

2025

Stay Connected

Subscribe to our Newsletter

Deep Learning Indaba Limited