

Biography: Yang Cao is an Associate Professor at the Department of Computer Science, Institute of Science Tokyo (Science Tokyo, formerly Tokyo Tech), and directing the Trustworthy Data Science and AI (TDSAI) Lab. He is passionate about studying and teaching on algorithmic trustworthiness in data science and AI. Two of his papers on data privacy were selected as best paper finalists in top-tier conferences IEEE ICDE 2017 and ICME 2020. He was a recipient of the IEEE Computer Society Japan Chapter Young Author Award 2019, Database Society of Japan Kambayashi Young Researcher Award 2021. His research projects were/are supported by JSPS, JST, MSRA, KDDI, LINE, WeBank, etc.
Abstract: Large language models are often fine-tuned on small and sensitive datasets, where individual training examples can strongly influence model behavior and lead to memorization and privacy leakage. This talk focuses on differential privacy in LLM fine-tuning and explains how DP acts as a learning constraint that limits the influence of individual samples, thereby mitigating such risks. We introduce the core intuition behind DP without assuming prior background, review practical DP fine-tuning mechanisms, and discuss how privacy and utility should be evaluated together. We also clarify what DP fine-tuning can and cannot protect, and outline open challenges in privacy-aware LLM adaptation.
Biography: Jiancheng Yang is a Principal Investigator at the ELLIS Institute Finland and Assistant Professor at Aalto University. He received his Bachelor's and PhD degrees from Shanghai Jiao Tong University, was a visiting fellow at Harvard, and completed postdoctoral training at EPFL. His research advances AI for health, with a focus on spatial intelligence, generative AI, and multimodal deep learning. He has authored 60+ publications and is recognized for developing MedMNIST. He is a Forbes 30 Under 30 honoree and a recipient of the WAIC Yunfan Award.
Abstract: The next step for medical AI is scaling. However, scaling medical AI faces intrinsic challenges: medical data are often limited, complex, and highly heterogeneous, while many existing AI techniques are not yet designed to fully exploit such data. This talk presents contributions unified by a common goal: building scalable medical AI through maximal reuse. First, I introduce LeFusion, a constrained generative framework for medical data synthesis (ICLR 2025 Spotlight), which provides guarantees on synthetic data quality and allows downstream models to better exploit underutilized information in limited real datasets. Second, I describe DiffAtlas, a new perspective on multimodal medical data (MICCAI 2025 Spotlight), where traditional input–output medical image segmentation is reformulated as an unconditional multimodal diffusion model guided only at inference time. Finally, I introduce PlaneCycle, a training-free, adapter-free operator that lifts arbitrary 2D foundation models into 3D by cyclically distributing spatial aggregation across orthogonal planes, adding no parameters while preserving pretrained inductive biases. Together, these works suggest a broader principle: medical AI can scale without scaling cost.
| Date | Speaker | Title | Materials |
|---|---|---|---|
| Mar 12, 2026 | Chenxi Yuan | Enhance Prediction of Alzheimer’s Disease with Generative AI | [Poster] |
| Mar 3, 2026 | Ren Wang | Robustness Through Collective Intelligence | [Poster] |
| Feb 28, 2026 | Huiyu Zhou | Constructing Masterpieces From Missing Pieces | [Poster] |
| Feb 10, 2026 | Guibo Luo | Benchmarking Multi-Party Privacy Computing and Exploring New Collaboration Paradigms | [Poster] |
| Feb 5, 2026 | Zeyu Zhang | Bridging Scene Understanding and Motion Generation in Robot Manipulation | [Poster] |
| Jan 28, 2026 | Md Sajid | Interpretable and Robust Randomized Neural Networks for Real-World Learning | [Poster] |
| Jan 21, 2026 | Bing Yang | Shape-Aware Deep Learning for AS-OCT Analysis: Segmentation and Structural Uncertainty | [Poster] |
| Jan 14, 2026 | Mengdi Zhao | Simulating Biological Intelligence: Bridging High-Fidelity Neuronal Modeling with Embodied Agents | [Poster] |
| Jan 7, 2026 | Xiangyu Chang | Research on Efficient and Fair Data Element Pricing Mechanisms | [Poster] |
| Dec 19, 2025 | Shujian Huang | Cross-lingual Knowledge Learning and Reasoning in Large Language Models | [Poster] |
| Dec 17, 2025 | Haotong Qin | Extreme Discretization: Towards Efficient Intelligence and Systems in the Scaling Era | [Poster] |
| Dec 3, 2025 | Ningning Ding | From Fair Unlearning Algorithms to Incentive-Compatible Mechanisms in Federated Unlearhing | [Poster] |
| Oct 23, 2025 | Raian Ali | AI Design vs. Human Attitude, Learning, and Dependency | [Poster] |
| Oct 22, 2025 | Yuwen Li | Higher Order Approximation Error Bounds for ReLU Neural Networks in Korobov Space | [Poster] |
| Oct 13, 2025 | Weijie Su | The ICML 2023 Ranking Experiment: Empirical Performance and Analysis of the Isotonic Mechanism | [Poster] |
| Sep 17, 2025 | Fanghui Liu | Bridging Theory and Practice: One-step Full Gradient Can Suffice for Low-rank Fine-tuning in LLMs | [Poster] |
| Aug 20, 2025 | Fan Yang | RNA Recognition and Targeted Degradation: Mechanisms and Engineering Strategies based on RNA-Binding Domains (RBDs) | [Poster] |
| Aug 4, 2025 | Tao Luo | The Theory of Parameter Condensation in Neural Networks | [Poster] |
| Jul 29, 2025 | Hangxin Liu | Embodied Mobile Manipulation: Trajectory Optimization vs. Diffusion | [Poster] |
| Jul 18, 2025 | Yide Liu | Constructing High-performance Robotic Insects With Origami Transmission Mechanism | [Poster] |
| Jul 15, 2025 | Hai Dong | Mobile Edge Intelligence: When AI Meets Mobile Edge Computing | [Poster] |
| Jun 30, 2025 | Guangyi Chen | Causal Representation Learning for Visual Understanding | [Poster] |