Shawn Yin

News

Will be joining Cerebras Systems as a Machine Learning Research Engineer Intern, Summer 2026!

May 2026

Awarded the Suzanne McIntosh Master's Research Fellowship by NYU Courant!

Apr 2026

DREAM-R accepted to ICML 2026!

Apr 2026

DREAM-R accepted to the ES-Reasoning Workshop at ICLR 2026!

Mar 2026

Received an offer for the PhD program in Electrical and Computer Engineering at NYU!

Feb 2026

CodeQuant accepted to ICLR 2026!

Jan 2026

Research Interests

Representation Learning

Studying what makes a visual representation universal, useful, and "good".

Vision

Multimodal

Embodied

AI Efficiency

Hardware-software co-design across speculative decoding, quantization, and pruning.

Quantization

Speculative Decoding

Pruning

Agents

Building agent tooling and constructing benchmarks for evaluating agent capabilities.

Benchmark

Tool

Application

Selected Works

View all

DREAM-R: Multimodal Speculative Reasoning with RL-Based Refined Drafting, Precise Verification, and Fully Parallel Execution

ICML, 2026

Yunhai Hu, Zining Liu, Xiangyang Yin, Tianhua Xia, BO BAO, Eric Sather, Vithursan Thangarasa, Sai Qian Zhang

CodeQuant: Unified Clustering and Quantization for Enhanced Outlier Smoothing in Low-Precision Mixture-of-Experts

ICLR, 2026

Xiangyang Yin*, Xingyu Liu*, Tianhua Xia, BO BAO, Vithursan Thangarasa, Valavan Manohararajah, Eric Sather, Sai Qian Zhang

Selected Projects

View all

ResNet Sparse Distillation

The increasing depth and width of neural networks improve accuracy but also raise hardware requirements and slow down inference. This project proposes a distillation loss function that enables immediate weight and activation pruning on the student model after distillation.

Efficiency

JEPA Architecture Probing Model

This project designed a JEPA architecture model with CNN and MLPs as encoder and predictor. The main task is to predict the trajectory of an object in an environment with wall and door.

Representation Learning World Model

Blogs

View all

From SFT to RL: Reward and Policy Gradient

Where RLHF's reward signal comes from, and how policy gradient turns a sequence-level score into token-level updates.

LLM Post-Training

Jul 18, 2026 18 min read

From SFT to RL: The Two Degrees of Freedom

Why SFT is reference-distribution fitting, and how changing token weights and sampling turns it into RL.

LLM Post-Training

Jul 7, 2026 11 min read

Xiangyang (Shawn) Yin

News

May 2026

Apr 2026

Apr 2026

Mar 2026

Feb 2026

Jan 2026

Research Interests

Representation Learning

AI Efficiency

Agents

Selected Works

DREAM-R: Multimodal Speculative Reasoning with RL-Based Refined Drafting, Precise Verification, and Fully Parallel Execution

CodeQuant: Unified Clustering and Quantization for Enhanced Outlier Smoothing in Low-Precision Mixture-of-Experts

Selected Projects

ResNet Sparse Distillation

JEPA Architecture Probing Model

Blogs

From SFT to RL: Reward and Policy Gradient

From SFT to RL: The Two Degrees of Freedom