cv
General Information
Full Name | Amanuel Mersha |
Languages | English, Amharic |
mersha.aman@gmail.com |
Education
-
2023
M.Sc. in Artificial Intelligence
Addis Ababa University, Addis Ababa, Ethiopia
- Thesis - RL based Dynamic Transformer Networks for efficient inference.
-
2016
B.Sc. in Software Engineering
Addis Ababa University, Addis Ababa, Ethiopia
- Thesis - Dynamic Transformer Networks for faster inference.
Work Experience
-
July 2023 - Present
Machine Learning Engineer
BitzBrain Research
- Reproduced papers and conducted NLP research on low-resource multilingual datasets.
- Implemented image and text retrieval systems with a multimodal language model, and vector databases.
- Significantly reduced training and deploying cost of language models through innovative techniques.
-
Sept 2021 - Sept 2023
Software Engineer
Lumos Technologies
- Lead a development team
- Design System Architecture
-
Sept 2016 - Present
Researcher & Lecturer
Addis Ababa University
- Teach courses - Intro to AI, Intro to Algorithm and Data Structure.
- Develop software products for the university
- Conduct AI/ML Research
-
Sept 2021 - Sept 2023
Founder & Developer
Sefere App
- Launched a startup intending to boost the visibility of small businesses through mobile and web apps.
- Implemented a self-improving content-based recommendation and feed ranking system using various ML algorithms.
-
Jan 2017 - Sept 2018
Computer Vision and Robotics Engineer
4Kbots AI and Robotics Lab
- Built a ROS-based RoboCup Platform
- Implemented SLAM and Object Detection algorithms
Research Experience
-
Apr 2022 - July 2023
DynamicViT - Making Vision Transformer faster by skipping layers - GitHub
Addis Ababa University
- Developed a transformer network that employs a reinforcement learning (RL) agent to learn to skip irrelevant layers during processing an input, resulting in a 40% increase in throughput compared to the vanilla ViT
-
Jun 2021 - May 2023
DistillEmb - Distilling word-embeddings via contrastive learning - GitHub
Addis Ababa University
- Compressed word embeddings into a CNN to address out-of-vocabulary issues, conserve memory, and facilitate cross-lingual transfer, resulting in a 7% improvement in F1 accuracy and 1400% less memory usage, as well as out-of-the-box low resource cross-lingual transfer capabilities.
-
Sept 2021 - Jan 2022
Boosting image retrieval with accurately labeled small data - GitHub
Addis Ababa University
- Explored the utilization of a detailed labeled dataset to enhance the feature set in an image retrieval task, enabling a small, shallow network to surpass the performance of a more complex, larger model
-
Apr 2018 - Sept 2019
Morphological generation for Wolayita using CNN based Seq2Seq model
Addis Ababa University
- Constructed a CNN-based encoder-decoder morphological analyzer for Wolayita language utilizing fewer parameters than LSTM-based models
-
Apr 2018 - Jun 2019
Morphology-rich alphasyllabary embeddings - GitHub
Addis Ababa University
- Examined the relationship between language morphology and corresponding word embeddings, proposing a technique to enhance the syntactic features of embeddings
Open Source Projects
-
Urban Land Usage Optimization Using NSGA II - Jupyter Notebook
-
ROS and Gazebo-based Delta-3 robot model - GitHub
-
Genetic Algorithm based classroom timetable generator - GitHub
Skills
Softskills | Problem-solving, Teamwork, Communication skills, Attention to detail, Curiosity & Adaptability |
Major ML Areas | Deep Learning, Information Retrieval, Image Retrieval, Computer Vision, Language Models, Sequence Modeling, Reinforcement Learning, Image Processing, Prompt Engineering, Robotics, NLP, Boosting, Bayesian Optimization, ML Debugging, Feature Engineering, Data Cleaning & Pre-processing |
Languages | Python, C/C++, Java, C#, Kotlin, JavaScript, TypeScript, Julia |
ML Frameworks | PyTorch, TensorFlow, Keras, TensorFlow.Js, Lightning, Sci-kit Learn, Pandas, Gensim, VectorDB, MLFlow, AutoML, SciPy, ONNX, OpenCV, NLTK, SpaCy |
ML Tools | TensorBoard, Weight \& Biases, Matplotlib, Seaborn, Bokeh, Google Cloud Platform (GCP), ROS, Gazebo |
Application Frameworks | Angular, ReactJs, AspNet Core, NodeJs, Django, Flask, FastAPI, Android, ReactNative |
Database | MySQL, MS SQL Server, MongoDB, PostgreSQL, Redis, NoSQL |
Software Engineering | Git, Docker, Github Action, CI/CD, MLOps, DevOps, GraphQL, REST, GraphQL, gRPC, Testing, OOP, Architecture Design, Distributed & Parallel Computing |
Leadership and Volunteering Activities
-
Oct 2023
ICCV Student Volunteer
International Conference on Computer Vision (ICCV) 2023 - Paris, France
-
May 2023
ICLR Student Volunteer
International Conference on Learning Representation (ICLR) 2023 - Kigali, Rwanda
-
Jan 2022
Program Chair
Deep Learning IndabaX ASTU Conference - Adama, Ethiopia
-
Oct 2021
Ambassador of Ethiopia
Zindi Africa - Addis Ababa, Ethiopia
-
Oct 2019
General Co-Chair
AI in Ethiopia Conference - Addis Ababa, Ethiopia
-
May 2019
Tutorial Chair
Deep Learning IndabaX WSU Conference - Wolaita Sodo, Ethiopia
Other Interests
- Gaming, Basketball, Hiking, Traveling