The Streaming Batch Model for Efficient and Fault-Tolerant Heterogeneous Execution
Frank Sifei Luan, Ron Yifeng Wang, Yile Gu, Ziming Mao, Charlotte Lin, Amog Kamsetty, Hao Chen, Cheng Su, Balaji Veeramani, Scott Lee, SangBin Cho, Clark Zinzow, Eric Liang, Ion Stoica, Stephanie Wang
arXiv
(Jan 2025)
A hybrid of batch and streaming that enables efficient and fault-tolerant heterogeneous execution, implemented in Ray Data.
An Extensible Architecture for Distributed Heterogeneous Processing
Frank Sifei Luan
UC Berkeley EECS Technical Reports
(Dec 2024)
Exoshuffle: An Extensible Shuffle Architecture
Sifei Luan, Samyukta Yagati, Stephanie Wang, Sean Kim, Kenneth Lien, Isaac Ong, Tony Hong, SangBin Cho, Eric Liang, Ion Stoica
SIGCOMM 2023
(Sep 2023)
An extensible shuffle architecture that offers competitive performance and scalability as well as greater flexibility than monolithic shuffle systems.
Exoshuffle-CloudSort: The 2022 CloudSort Benchmark Winner
Sifei Luan, Samyukta Yagati, Stephanie Wang, Sean Kim, Kenneth Lien, Isaac Ong, Tony Hong, SangBin Cho, Eric Liang, Ion Stoica
arXiv
(Jan 2023)
Winner of the 2022 CloudSort Benchmark (Indy category) for sorting 100TB data at $0.97/TB.
Balsa: Learning a Query Optimizer Without Expert Demonstrations
Zongheng Yang, Wei-Lin Chiang, Sifei Luan, Gautam Mittal, Michael Luo, Ion Stoica
SIGMOD 2022
(Jun 2022)
A query optimizer built by deep reinforcement learning.
Ownership: A Distributed Futures System for Fine-Grained Tasks
Stephanie Wang, Eric Liang, Edward Oakes, Ben Hindman, Sifei Luan, Audrey Cheng, Ion Stoica
NSDI 2021
(Apr 2021)
A decentralized object metadata ownership system for fine-grained distributed tasks.
AI in Software Engineering at Facebook
Johannes Bader, Sonia Seohyun Kim, Sifei Luan, Satish Chandra, Erik Meijer
IEEE Software
(Feb 2021)
Three productivity tools that learn patterns from software artifacts that we deployed at Facebook.
🏆 2021 IEEE Computer Society IEEE Software Magazine Best Paper Award
NeuroCard: One Cardinality Estimator for All Tables
Zongheng Yang, Amog Kamsetty, Sifei Luan, Eric Liang, Yan Duan, Xi Chen, Ion Stoica
VLDB 2020
(Jun 2020)
A join cardinality estimator that builds a single neural density estimator over an entire database.
Aroma: Code Recommendation via Structural Code Search
Sifei Luan, Di Yang, Celeste Barnaby, Koushik Sen, Satish Chandra
OOPSLA 2019
(Oct 2019)
A tool and technique for code recommendation in big code corpora to improve developer productivity.
🏆 2019 ACM SIGPLAN Distinguished Paper Award
Retrieval on Source Code: A Neural Code Search
Saksham Sachdev, Hongyu Li, Sifei Luan, Seohyun Kim, Koushik Sen, Satish Chandra
MAPL 2018
(Jun 2018)
A technique for searching over large codebases using natural language queries.