Amanuel Mersha

Currently leading AI/ML at TestSavant.AI, building a autonomous adaptive security system for AI.

prof_pic.jpg

Addis Ababa Institute of Technology

King George St, 5 Kilo

Addis Ababa, Ethiopia

Hi there,

I’m currently working on building an autonomous and adaptive system to defend attacks against AI systems such as Chatbots, Agents, Multi-modal services and so on. Its a combination of both research and engineering, and we will soon release our product. If you want to learn more, please reach out.

I’m broadly interested in understanding and building intelligent systems. This include theoretical research on deep learning and applying them on novel tasks. Furthermore, making models more efficient while keeping performance very high is my a personal challenge. I also enjoy developing products, and my current work at TestSavant.AI is training LLMs and RL agents to defend attacks and learn autonomously as time goes by.

news

Sep 1, 2024 I have joined TestSavant.AI as Head of AI to lead on developing an adaptive and autonomous AI to defend attacks on AI systems.
May 1, 2024 I will be working as a visiting researcher at the SymbioticLab in the CSE department in University of Michigan till end of July. My research work will be developing algorithms in the area of large scale distributed LLM and MMLM training and inference.
Jul 17, 2023 I will be attending the DLRL Summer School at MILA in Montreal, CA from July 17th to 21st, 2023.
Jul 14, 2023 I finished my M.Sc. thesis titled “Reinforcement Learning Based Layer Skipping Vision Transformer for Efficient Inference” at Addis Ababa Institute of Technology.

selected publications

  1. dyna.jpg
    DynamicViT: Making Vision Transformer faster through layer skipping
    Amanuel Mersha, and Sammy Assefa
    Vision Transformers: Theory and Applications Workshop at NeurIPS 2022, Nov 2022
  2. distill-emb.png
    DistillEmb: Distilling Word Embeddings via Contrastive Learning
    Amanuel Mersha, and Stephen Wu
    Transfer Learning for NLP Workshop at NeurIPS 2022, Nov 2022
  3. wsdc-right.png
    Dynamic Transformer Network
    Amanuel Mersha
    Workshop on Dynamic Neural Networks at ICML 2022, Jul 2022
  4. morph.png
    Morphology-rich Alphasyllabary Embeddings
    Amanuel Mersha, and Stephen Wu
    Proceedings of the 12^th Language Resources and Evaluation Conference, Jan 2020