Research Interests
I am driven by a fascination with aligning AI systems with human values. My experience in data analysis and modeling has laid a strong foundation for exploring the frontiers of safe and reliable AI.
Publications
Research Focus
- Alignment
- Reasoning
- Red-teaming
- Mechanistic Interpretability
- "Hallucination" control