🐰

Aoxuan Silvia Zhang Ao-Sh-oo-en S-ih-l-v-ee-uh J-ah-ng

(she/her)

Computer Science and Engineering & Mathematics Student

Professional Summary

About Me

I am a Master’s student in the Machine Learning and Artificial Intelligence (MLAI) Lab at KAIST, advised by Prof. Sung Ju Hwang. I previously earned my B.S. in Computer Science and Engineering with a double major in Mathematics from Korea University.

My research interests center on large language models, generative modeling, and weight-space learning, with an emphasis on understanding and manipulating model representations for scalable and efficient learning. In particular, I am interested in weight generation, latent-space modeling of neural networks, and agentic systems built on top of foundation models. I work closely with Dr. Bruno Andreis and Dr. Bedionita Soro.

During my undergraduate studies and research training, I worked on hyperparameter optimization, weight-space model merging, and LLM-based agent workflows, and have co-authored papers at major machine learning venues. My current research at KAIST continues to investigate the theoretical and practical foundations of generative approaches to model design and adaptation.

Download CV

Education

M.S. The Kim Jaechul Graduate School of AI

KAIST MLAI Lab

B.S. Computer Science and Engineering & Mathematics

Korea University

Exchange Program, Department of Mathematics

The Hong Kong University of Science and Technology (HKUST)

Interests

Large Language Models Generative Modeling Weight-Space Learning Agentic AI

📚 My Research

Welcome! I am Silvia Zhang, a computer science and engineering & mathematics double‐major at Korea University, and incoming M.S. student at KAIST’s MLAI Lab. My research interests: Large Language Models, generative modeling, weight-space learning, and agentic AI.

Featured Publications

Cost-Sensitive

Cost-Sensitive Freeze-thaw Bayesian Optimization for Efficient Hyperparameter Tuning

Dong Bok Lee, Aoxuan Silvia Zhang, Byungjoo Kim, Junhyeon Park, Juho Lee, Sung Ju Hwang, Hae Beom Lee

• Dec 1, 2025 • 1 min read

Recent Publications

Dong Bok Lee, Aoxuan Silvia Zhang, Byungjoo Kim, Junhyeon Park, Juho Lee, Sung Ju Hwang, Hae Beom Lee (2025). Cost-Sensitive Freeze-thaw Bayesian Optimization for Efficient Hyperparameter Tuning. In NeurIPS.

PDF DOI

Projects

Large Language Models

Nexus

Nexus is a clarity engine for life in a foreign world — a single interface that turns the chaos of unfamiliar systems, languages, and daily decisions into something intuitive and …

Nov 26, 2024 • 1 min read

AI Engineer
DeepAuto.ai June 2025 – October 2025
Agentic AI Systems
- Contributed to the development of a three-stage agentic AI workflow: compile, implement, and execute, supporting structured workflow automation.
- Assisted in building and testing modules that transform high-level workflow plans into atomic functions and execute them with LLM-powered agents.
Research Intern
KAIST MLAI Lab January 2025 – June 2025
Weight Generation for Large Language Models
- Conducted a comprehensive literature survey on generative models for weight generation and alternative approaches to weight-space learning in neural networks.
- Ran and analyzed experimental results from existing codebases, assisted in debugging and reproducing key experiments to validate methodologies.
- Co-authored a paper (under review at ICLR 2026): Language Models Merging, proposing a framework for merging heterogeneous large language models in latent space.
- Investigated weight distribution properties (kurtosis, compressibility) and contributed to the design and implementation of latent-space fusion experiments.
AI Researcher
DeepAuto.ai October 2024 – December 2024
LLM Agent on Hyperparameter Optimization
- Conducted a literature review on state-of-the-art hyperparameter optimization techniques for large language models (LLMs).
- Analyzed existing codebases and replicated experiments to understand optimization workflows.
- Implemented and tested existing optimization methods to assess their impact on model performance.
Research Intern
KAIST MLAI Lab January 2024 – May 2024
Hyperparameter Optimization
- Developed and implemented baseline optimization algorithms, including Bayesian Optimization with Hyperband (BOHB), Differential Evolution with Hyperband (DEHB), and Functional Surrogate-Based Optimization (FSBO) to efficiently optimize complex black-box functions.
- Evaluated the performance of these algorithms on benchmark problems and real-world applications, demonstrating their effectiveness in sample-efficient hyperparameter tuning and optimization.
- Co-authored a paper (NeurIPS 2025): Cost-Sensitive Freeze-Thaw Bayesian Optimization for Efficient Hyperparameter Tuning, introducing a novel cost-aware strategy to improve resource allocation in freeze-thaw Bayesian optimization.
AWS AI & ML Scholarship Recipient
Udacity July 2022 – December 2022
AI Programming with Python
- Participated in the AWS DeepRacer Student League and received the AWS AI & ML Scholarship.
- Completed a collaborative virtual course that teaches programming tools and techniques fundamental to machine learning, with support from Udacity teachers in weekly group sessions.
- Project 1: Use a Pre-trained Image Classifier to Identify Dog Breeds.
- Project 2: Create an Image Classifier.
Technical Consulting Virtual Intern
SAP (via Forage) May 2022 – May 2022
Virtual experience program participant in SAP, through Forage.
- Completed practical task modules in assembling the data, data analysis, and presenting the results.

Education

M.S. The Kim Jaechul Graduate School of AI
KAIST MLAI Lab March 2026 – Present
Incoming M.S. student at KAIST’s Machine Learning and Artificial Intelligence (MLAI) Lab, advised by Prof. Sung Ju Hwang. My graduate research will focus on large language models, generative modeling, weight space learning, and agentic workflows for scalable foundation models.
B.S. Computer Science and Engineering & Mathematics
Korea University March 2021 – August 2025
B.S. in Computer Science and Engineering with a double major in Mathematics at Korea University.
Exchange Program, Department of Mathematics
The Hong Kong University of Science and Technology (HKUST) September 2023 – June 2024
Exchange student at HKUST School of Science (Mathematics), where I broadened my academic perspective in applied mathematics, optimization, and machine learning. Completed advanced coursework in statistics, stochastic processes, regression analysis, time series, and machine learning — providing a rigorous theoretical foundation for research on LLMs and generative modeling.

Blog

🌟 Something About Me

Dec 22, 2024

A glimpse into my interests, hobbies, and the things that bring me joy outside of research

Dec 22, 2024

No results found

Aoxuan Silvia Zhang Ao-Sh-oo-en S-ih-l-v-ee-uh J-ah-ng

Professional Summary

About Me

Education

Interests

Cost-Sensitive Freeze-thaw Bayesian Optimization for Efficient Hyperparameter Tuning

Nexus

Experience

AI Engineer

Research Intern

AI Researcher

Research Intern

AWS AI & ML Scholarship Recipient

Technical Consulting Virtual Intern

Education

M.S. The Kim Jaechul Graduate School of AI

B.S. Computer Science and Engineering & Mathematics

Exchange Program, Department of Mathematics

🌟 Something About Me