Research
20 publications · 225+ total citations
Curveball Steering: The Right Direction To Steer Isn't Always Linear
S Raval, HJ Song, L Wu, A Harrasse, J Phillips, A Abdullah
DreamReader: An Interpretability Toolkit for Text-to-Image Models
N Prakash, N Oozeer, M Lan, L Samkharadze, P Howard, RKW Lee, S Raval, ...
Narrow Fine-Tuning Erodes Safety Alignment in Vision-Language Agents
I Gulati, S Raval
Spectral Superposition: A Theory of Feature Geometry
G Ivanov, N Oozeer, S Raval, T Pejovic, S Upadhyay, A Abdullah
Measure What Matters: Psychometric Evaluation of AI with Situational Judgment Tests
A Yost, S Jain, S Raval, G Corser, A Roush, N Xu, J Hammack, ...
Mapping LLMs with Sparse Autoencoders
N Hussein, S Raval, E Reif, J Wilson, A Alberich, N Nanda, L Dixon, ...
Improving Mars Colour Camera Images Through a Multi-Step Enhancement Pipeline
S Raval, I Misra, SM Moorthi, D Dhar
Towards Mitigating Information Leakage when Evaluating Safety Monitors
G Boxo, A Neelappa, S Raval
Caught in the Act: A Mechanistic Approach to Detecting Deception
G Boxo, R Socha, D Yoo, S Raval
Where Do Doctors Disagree? Characterizing Decision Points for Safe RL in Choosing Vasopressor Treatment
E Brown, S Raval, A Rojas, J Yao, S Parbhoo, LA Celi, S Swaroop, ...
Augmenting X-ray Astronomical Representations with Scientific Knowledge through Contrastive Learning
JR Martínez-Galarza, NOP Vago, S Raval, C Cuesta-Lazaro, M Weber, DA Melis, A Accomazzi, C Garraffo, J Knutson, R Thill, CB Green, I Ahangama
MoE Lens: An Expert Is All You Need
M Chaudhari, I Gulati, N Hundia, P Karra, S Raval
Hypertrix: An Indicatrix for High-Dimensional Visualizations
S Raval, F Viégas, M Wattenberg
Designing a Dashboard for Transparency and Control of Conversational AI
Y Chen, A Wu, T DePodesta, C Yeh, K Li, NC Marin, O Patel, J Riecke, S Raval, ...
Explain-and-Test: An Interactive Machine Learning Framework for Exploring Text Embeddings
S Raval, C Wang, F Viégas, M Wattenberg
Identifying Structure in the MIMIC ICU Dataset
Z Chin, S Raval, F Doshi-Velez, M Wattenberg, LA Celi
SupCL-Seq: Supervised Contrastive Learning for Downstream Optimized Sequence Representations
H Sedghamiz, S Raval, E Santus, T Alhanai, M Ghassemi
Exploring a Unified Sequence-to-Sequence Transformer for Medical Product Safety Monitoring in Social Media
S Raval, H Sedghamiz, E Santus, T Alhanai, M Ghassemi, E Chersoni
Establishing a Nearly Closed Cycling Transition in a Polyatomic Molecule
L Baum, NB Vilas, C Hallas, BL Augenbraun, S Raval, D Mitra, JM Doyle
Assessment and Mitigation of Aerosol Airborne SARS-CoV-2 Transmission in Laboratory and Office Environments
BL Augenbraun, ZD Lasner, D Mitra, S Prabhu, S Raval, H Sawaoka, ...