cv

General Information

Full Name Amanuel Mersha
Languages English, Amharic
Email mersha.aman@gmail.com

Education

  • 2023
    M.Sc. in Artificial Intelligence
    Addis Ababa University, Addis Ababa, Ethiopia
    • Thesis - RL based Dynamic Transformer Networks for efficient inference.
  • 2016
    B.Sc. in Software Engineering
    Addis Ababa University, Addis Ababa, Ethiopia
    • Thesis - Dynamic Transformer Networks for faster inference.

Work Experience

  • July 2023 - Present
    Machine Learning Engineer
    BitzBrain Research
    • Reproduced papers and conducted NLP research on low-resource multilingual datasets.
    • Implemented image and text retrieval systems with a multimodal language model, and vector databases.
    • Significantly reduced training and deploying cost of language models through innovative techniques.
  • Sept 2021 - Sept 2023
    Software Engineer
    Lumos Technologies
    • Lead a development team
    • Design System Architecture
  • Sept 2016 - Present
    Researcher & Lecturer
    Addis Ababa University
    • Teach courses - Intro to AI, Intro to Algorithm and Data Structure.
    • Develop software products for the university
    • Conduct AI/ML Research
  • Sept 2021 - Sept 2023
    Founder & Developer
    Sefere App
    • Launched a startup intending to boost the visibility of small businesses through mobile and web apps.
    • Implemented a self-improving content-based recommendation and feed ranking system using various ML algorithms.
  • Jan 2017 - Sept 2018
    Computer Vision and Robotics Engineer
    4Kbots AI and Robotics Lab
    • Built a ROS-based RoboCup Platform
    • Implemented SLAM and Object Detection algorithms

Research Experience

  • Apr 2022 - July 2023
    DynamicViT - Making Vision Transformer faster by skipping layers - GitHub
    Addis Ababa University
    • Developed a transformer network that employs a reinforcement learning (RL) agent to learn to skip irrelevant layers during processing an input, resulting in a 40% increase in throughput compared to the vanilla ViT
  • Jun 2021 - May 2023
    DistillEmb - Distilling word-embeddings via contrastive learning - GitHub
    Addis Ababa University
    • Compressed word embeddings into a CNN to address out-of-vocabulary issues, conserve memory, and facilitate cross-lingual transfer, resulting in a 7% improvement in F1 accuracy and 1400% less memory usage, as well as out-of-the-box low resource cross-lingual transfer capabilities.
  • Sept 2021 - Jan 2022
    Boosting image retrieval with accurately labeled small data - GitHub
    Addis Ababa University
    • Explored the utilization of a detailed labeled dataset to enhance the feature set in an image retrieval task, enabling a small, shallow network to surpass the performance of a more complex, larger model
  • Apr 2018 - Sept 2019
    Morphological generation for Wolayita using CNN based Seq2Seq model
    Addis Ababa University
    • Constructed a CNN-based encoder-decoder morphological analyzer for Wolayita language utilizing fewer parameters than LSTM-based models
  • Apr 2018 - Jun 2019
    Morphology-rich alphasyllabary embeddings - GitHub
    Addis Ababa University
    • Examined the relationship between language morphology and corresponding word embeddings, proposing a technique to enhance the syntactic features of embeddings

Open Source Projects

  • Urban Land Usage Optimization Using NSGA II - Jupyter Notebook
  • ROS and Gazebo-based Delta-3 robot model - GitHub
  • Genetic Algorithm based classroom timetable generator - GitHub

Skills

Softskills Problem-solving, Teamwork, Communication skills, Attention to detail, Curiosity & Adaptability
Major ML Areas Deep Learning, Information Retrieval, Image Retrieval, Computer Vision, Language Models, Sequence Modeling, Reinforcement Learning, Image Processing, Prompt Engineering, Robotics, NLP, Boosting, Bayesian Optimization, ML Debugging, Feature Engineering, Data Cleaning & Pre-processing
Languages Python, C/C++, Java, C#, Kotlin, JavaScript, TypeScript, Julia
ML Frameworks PyTorch, TensorFlow, Keras, TensorFlow.Js, Lightning, Sci-kit Learn, Pandas, Gensim, VectorDB, MLFlow, AutoML, SciPy, ONNX, OpenCV, NLTK, SpaCy
ML Tools TensorBoard, Weight \& Biases, Matplotlib, Seaborn, Bokeh, Google Cloud Platform (GCP), ROS, Gazebo
Application Frameworks Angular, ReactJs, AspNet Core, NodeJs, Django, Flask, FastAPI, Android, ReactNative
Database MySQL, MS SQL Server, MongoDB, PostgreSQL, Redis, NoSQL
Software Engineering Git, Docker, Github Action, CI/CD, MLOps, DevOps, GraphQL, REST, GraphQL, gRPC, Testing, OOP, Architecture Design, Distributed & Parallel Computing

Leadership and Volunteering Activities

  • Oct 2023
    ICCV Student Volunteer
    International Conference on Computer Vision (ICCV) 2023 - Paris, France
  • May 2023
    ICLR Student Volunteer
    International Conference on Learning Representation (ICLR) 2023 - Kigali, Rwanda
  • Jan 2022
    Program Chair
    Deep Learning IndabaX ASTU Conference - Adama, Ethiopia
  • Oct 2021
    Ambassador of Ethiopia
    Zindi Africa - Addis Ababa, Ethiopia
  • Oct 2019
    General Co-Chair
    AI in Ethiopia Conference - Addis Ababa, Ethiopia
  • May 2019
    Tutorial Chair
    Deep Learning IndabaX WSU Conference - Wolaita Sodo, Ethiopia

Other Interests

  • Gaming, Basketball, Hiking, Traveling