Research - Ashioya Jotham Victor

Research Interests

I am driven by a fascination with aligning AI systems with human values. My experience in data analysis and modeling has laid a strong foundation for exploring the frontiers of safe and reliable AI.

AI Alignment

Techniques for aligning AI systems with human values and ensuring beneficial outcomes.

Reasoning

Study of reasoning capabilities in language models and methods for enhancement.

Red-teaming

Adversarial testing of AI systems to identify and mitigate vulnerabilities.

Mechanistic Interpretability

Opening the black box of neural networks to understand internal mechanisms.

"Hallucination" Control

Methods for reducing false or unsupported outputs from language models.

Publications

2024 Journal Article

Enhancing HIV Testing Indicator Reporting

Victor, A.J., et al.

This paper presents novel approaches to improving the accuracy and efficiency of HIV testing indicator reporting through data science techniques.

View Paper

Victor, A.J., et al. (2024). Enhancing HIV Testing Indicator Reporting. DSAI Journal.

2024 Technical Article

What is few shot learning?

Victor, A.J.

A comprehensive exploration of few-shot learning techniques and their application in In-Context Learning scenarios.

View Article

2023 Conference Paper

The Future Remains Unsupervised

Victor, A.J.

An exploration of the untapped potential of unsupervised learning in the era of large language models and foundation models.

View Paper

Victor, A.J. (2023). The Future Remains Unsupervised. Deep Learning Indaba.

2023 Journal Article

Effective Web Scraping for Data Scientists

Victor, A.J.

A comprehensive guide to ethical and efficient web scraping methods tailored specifically for data science applications.

View Paper

Victor, A.J. (2023). Effective Web Scraping for Data Scientists. DSAI Journal.

Current Projects

Sparse Autoencoder Interpretability

Ongoing 2023-Present

Using sparse autoencoders to improve interpretability of neural networks, with a focus on understanding internal representations of language models.

Hallucination Metrics for LLMs

Ongoing 2024-Present

Developing robust evaluation metrics for measuring and quantifying hallucinations in large language models.

Research Blog

Check out my research blog for detailed articles, analyses, and tutorials on AI safety, alignment, and more.

View Blog

Research Portfolio

Research Interests

AI Alignment

Reasoning

Red-teaming

Mechanistic Interpretability

"Hallucination" Control

Publications

Enhancing HIV Testing Indicator Reporting

What is few shot learning?

The Future Remains Unsupervised

Effective Web Scraping for Data Scientists

Current Projects

Sparse Autoencoder Interpretability

Hallucination Metrics for LLMs

Research Blog