Publications

Code and Dataset open sourced by NetMan:
http://github.com/NetManAIOps

2024:

  • A survey on intelligent management of alerts and incidents in IT services
    Qingyang Yu, Nengwen Zhao, Mingjie Li, Zeyan Li, Honglin Wang, Wenchi Zhang, Kaixin Sui, Dan Pei*.
    Journal of Network and Computer Applications (JNCA) (THU B, CCF C)
    Paper
  • Revisiting VAE for Unsupervised Time Series Anomaly Detection: A Frequency Perspective
    Zexin Wang, Changhua Pei*, Minghua Ma, Xin Wang, Zhihan Li, Dan Pei, Saravan Rajmohan, Dongmei Zhang, Qingwei Lin, Haiming Zhang, Jianhui li, Gaogang Xie.
    WWW 2024, Singapore, May 13 – May 17, 2024 (CCF A, accepted, to appear)
    Paper
  • Supervised Fine-Tuning for Unsupervised KPI Anomaly Detection for Mobile Web Systems
    Zhaoyang Yu, Shenglin Zhang, Mingze Sun, Yingke Li, Yankai Zhao, Xiaolei Hua, Lin Zhu, Xidao Wen, Dan Pei*.
    WWW 2024, Singapore, May 13 – May 17, 2024 (CCF A, accepted, to appear)
    Paper
  • Causality Enhanced Graph Representation Learning for Alert-Based Root Cause Analysis
    Zhaoyang Yu, Qianyu Ouyang, Changhua Pei*, Xin Wang, Wenxiao Chen, Liangfei Su, Huai Jiang, Xuanrun Wang, Jianhui Li, Dan Pei.
    CCGrid 2024, Philadelphia, May 6-9, 2024 (THU B, accepted, to appear)
    Paper

2023:

  • Beyond Sharing: Conflict-Aware Multivariate Time Series Anomaly Detection
    Haotian Si, Changhua Pei, Zhihan Li, Yadong Zhao, Jingjing Li, Haiming Zhang, Zulong Diao, Jianhui Li, Gaogang Xie, Dan Pei
    ESEC/FSE 2023 industry track. San Francisco, CA, USA, Dec. 3 – Dec. 9, 2023 (CCF A).
    Paper
  • From Point-wise to Group-wise: A Fast and Accurate Microservice Trace Anomaly Detection Approach
    Zhe Xie, Changhua Pei*, Wanxue Li, Huai Jiang, Liangfei Su, Jianhui Li, Gaogang Xie, and Dan Pei
    ESEC/FSE 2023 industry track. San Francisco, CA, USA, Dec. 3 – Dec. 9, 2023 (CCF A).
    PaperSlidesCode
  • AutoKAD: Empowering KPI Anomaly Detection with Label-Free Deployment
    Zhaoyang Yu, Changhua Pei*, Shenglin Zhang, Xidao Wen, Jianhui Li, Gaogang Xie, and Dan Pei.
    The 34th IEEE International Symposium on Software Reliability Engineering (ISSRE 2023). Florence, Italy, October 2023 (CCF B,  Best Paper Award).
    PaperSlides
  • Efficient and Robust Trace Anomaly Detection for Large-Scale Microservice Systems
    Shenglin Zhang, Zhongjie Pan, Heng Liu, Pengxiang Jin, Yongqian Sun*, Qianyu Ouyang, Jiaju Wang, Xueying Jia, Yuzhi Zhang, Hui Yang, Yongqiang Zou, and Dan Pei.
    The 34th IEEE International Symposium on Software Reliability Engineering (ISSRE 2023). Florence, Italy, October 2023 (CCF B, accepted, to appear).
    Paper
  • An Empirical Analysis of Anomaly Detection Issues for Multivariate Time Series
    Dongwen Li, Shenglin Zhang, Yongqian Sun*, Yang Guo, Zeyu Che, Shiqi Chen, Zhenyu Zhong, Minghan Liang, Minyi Shao, Mingjie Li, Shuyang Liu, Yuzhi Zhang, and Dan Pei.
    The 34th IEEE International Symposium on Software Reliability Engineering (ISSRE 2023). Florence, Italy, October 2023 (CCF B, accepted, to appear).
    Paper
  • Lindorm TSDB: A Cloud-native Time-series Database for Large-scale Monitoring Systems
    Chunhui Shen, Qianyu Ouyang, Feibo Li, Zhipeng Liu, Longcheng Zhu, Yujie Zou, Qing Su, Tianhuan Yu, Yi Yi, Jianhong Hu, Cen Zheng, Bo Wen, Hanbang Zheng, Lunfan Xu, Sicheng Pan, Bin Wu, Xiao He, Ye Li, Jian Tan, Sheng Wang, Dan Pei, Wei Zhang, Feifei Li
    VLDB 2023, Industrial, Applications, and Experience Track (CCF A, THU A)
    PaperCode
  • Generic and Robust Root Cause Localization for Multi-Dimensional Data in Online Service Systems
    Zeyan Li, Junjie Chen, Yihao Chen, Chengyang Luo, Yiwei Zhao, Yongqian Sun, Kaixin Sui, Xiping Wang, Dapeng Liu, Xing Jin, Qi Wang, Dan Pei
    Journal of Systems and Software, Volume 203Issue CSep 2023, May 4, 2023 (CCF B类,TH-CPL B类)
    PaperCode
  • 智能运维的实践:现状与标准化
    包航宇, 殷康璘*, 曹立, 李世宁, 孙永敬, 尹汇锋, 汤汝鸣, 侯岳, 王士强, 裴丹, 杨晓勤, 王立新
    软件学报,2023,34(9):2204-2230 (THU B)
    Paper
  • Unsupervised Anomaly Detection on Microservice Traces through Graph VAE
    Zhe Xie, Haowen Xu, Wenxiao Chen, Wanxue Li, Huai Jiang, Liangfei Su, Hanzhang Wang, Dan Pei*
    WWW 2023, Austin, Texas, USA, April 30 – May 4, 2023 (CCF A)
    PaperSlidesCode
  • CMDiagnostor: An Ambiguity-Aware Root Cause Localization Approach Based on Call Metric Data
    Qingyang Yu, Changhua Pei*, Bowen Hao, Mingjie Li, Zeyan Li, Shenglin Zhang, Xianglin Lu, Rui Wang, Jiaqi Li, Zhenyu Wu, Dan Pei
    WWW 2023, Austin, Texas, USA, April 30 – May 4, 2023 (CCF A)
    PaperCode
  • LogKG: Log Failure Diagnosis through Knowledge Graph
    Yicheng Sui, Yuzhe Zhang, Jianjun Sun, Ting Xu, Shenglin Zhang*, Zhengdan Li, Yongqian Sun, Fangrui Guo, Junyu Shen, Yuzhi Zhang, Dan Pei, Xiao Yang, Li Yu.
    IEEE Transactions on Services Computing, 2023 (CCF A, accepted, to appear).
    Paper
  • Robust Failure Diagnosis of Microservice System through Multimodal Data
    Shenglin Zhang, Pengxiang Jin, Zihan Lin, Yongqian Sun*, Bicheng Zhang, Sibo Xia, Zhengdan Li, Zhenyu Zhong, Minghua Ma, Wa Jin, Dai Zhang, Zhenyu Zhu, Dan Pei.
    IEEE Transactions on Services Computing, 2023 (CCF A, accepted, to appear).
    Paper
  • Robust Multimodal Failure Detection for Microservice Systems
    Chenyu Zhao, Minghua Ma, Zhenyu Zhong, Shenglin Zhang*, Zhiyuan Tan, Xiao Xiong, Lulu Yu, Jiayi Feng, Yongqian Sun, Yuzhi Zhang, Dan Pei, Qingwei Lin, Dongmei Zhang.
    THE 29TH ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Long Beach, CA, USA, August 2023 (CCF A, accepted, to appear).
    Paper
  • Efficient and Robust KPI Outlier Detection for Large-Scale Datacenters
    Yongqian Sun, Daguo Cheng, Tiankai Yang, Shenglin Zhang*, Man Zhu, Xiao Xiong, Qiliang Fan, Minghan Liang, Dan Pei, Tianchi Ma, Yu Chen.
    IEEE Transactions on Computers, 2023 (CCF A, accepted, to appear).
    Paper
  • LogSummary: Unstructured Log Summarization for Software Systems
    Weibin Meng, Yuzhe Zhang, Federico Zaiter, Ying Liu, Shenglin Zhang*, Shimin Tao, Yichen Zhu, Tao Han, Yongpeng Zhao, En Wang, Yuzhi Zhang, Dan Pei.
    IEEE Transactions on Network and Service Management (IEEE TNSM), January 2023 (SCI Indexed, IF: 4.758).
    Paper
  • A Study on Response Time Anomaly Diagnosis for Search Service 
    Sibo Xia, Minghua Ma, Pengxiang Jin, Liyue Cui, Shenglin Zhang*, Wa Jin, Yongqian Sun, Dan Pei. Journal of Computer Research and Development, 2023 (CCF T1, in Chinese, accepted)
    Paper

2022:

  • Identifying Erroneous Software Changes through Self-Supervised Contrastive Learning on Time Series Data
    Xuanrun Wang, Kanglin Yin, Qianyu Ouyang, Xidao Wen, Shenglin Zhang, Wenchi Zhang, Li Cao, Jiuxue Han, Xing Jin, Dan Pei
    ISSRE 2022, Charlotte, North Carolina, USA, October 31 – November 3, 2022 (CCF B)
    PaperSlidesVideoCode
  • Effective Attribute Selection for Multi-dimensional Root Cause Analysis
    Yiran Cheng, Bo Cheng, Pengxiang Jin, Yongqian Sun*, Xiaohui Nie, Nengwen Zhao, Shenglin Zhang, Dan Pei
    ISSRE 2022, Charlotte, North Carolina, USA, October 31 – November 3, 2022 (CCF B)
    PaperSlidesVideoCode
  • Constructing Large-Scale Real-World Benchmark Datasets for AIOps
    Zeyan Li, Nengwen Zhao, Shenglin Zhang, Yongqian Sun, Pengfei Chen, Xidao Wen, Minghua Ma, Dan Pei
    arXiv
  • Situation-Aware Multivariate Time Series Anomaly Detection through Active Learning and Contrast VAE-based Models in Large Distributed Systems
    Zhihan Li, Youjian Zhao, Yitong Geng, Zhanxiang Zhao, Hanzhang Wang, Wenxiao Chen, Huai Jiang, Amber Vaidya, Liangfei Su, Dan Pei*.
    JSAC 2022, Series on Machine Learning for Communications and Networks (CCF A)
    PaperSlides
  • Actionable and Interpretable Fault Localization for Recurring Failures in Online Service Systems
    Zeyan Li, Nengwen Zhao, Mingjie Li, Xianglin Lu, Lixin Wang, Dongdong Chang,  Xiaohui Nie, Li Cao, Wenzhi Zhang, Kaixin Sui, Yanhua Wang, Xu Du, Guoqiang Duan, Dan Pei*
    ESEC/FSE 2022, Nov 14-18, 2022, Singapore (CCF-A)
    PaperSlidesVideoCode
  • Mining Fluctuation Propagation Graph among Time Series with Active Learning
    Mingjie Li, Minghua Ma, Xiaohui Nie, Kanglin Yin, Li Cao, Xidao Wen*, Zhiyun Yuan, Duogang Wu, Guoying Li, Wei Liu, Xin Yang, Dan Pei
    DEXA 2022, Vienna, Austria, Aug 22-24, 2022 (CCF C)
    PaperSlides
  • Causal Inference-Based Root Cause Analysis for Online Service Systems with Intervention Recognition
    Mingjie Li, Zeyan Li, Kanglin Yin, Xiaohui Nie, Wenchi Zhang, Kaixin Sui, Dan Pei*
    KDD 2022, Washington, DC, USA, Aug 14-18, 2022 (CCF A)
    PaperSlidesVideoCode
  • Generic and Robust Performance Diagnosis via Causal Inference for OLTP Database systems
    Xianglin Lu, Zhe Xie, Zeyan Li, Mingjie Li, Xiaohui Nie, Nengwen Zhao, Qingyang Yu, Shenglin Zhang, Kaixin Sui, Lin Zhu* and Dan Pei.
    CCGrid 2022, 16-19 May 2022, Italy. (THU B)
    PaperSlides
  • Robust Anomaly Clue Localization of Multi-dimensional Derived Measure for Online Video Services
    Yongqian Sun, Daguo Cheng, Pengxiang Jin, Quan Ding, Shenglin Zhang*, Xu Chen, Yuzhi Zhang, Minghan Liang, Dan Pei, Jianyan Zheng, Sen Luo, Xinyu Tang.
    IEEE Transactions on Services Computing. (CCF B)
    PaperSlides
  • Robust System Instance Clustering for Large-Scale Web Services.
    Shenglin Zhang, Dongwen Li, Zhenyu Zhong, Jun Zhu, Minghan Liang, Jiexi Luo, Yongqian Sun*, Ya Su, Sibo Xia, Zhongyou Hu, Yuzhi Zhang, Dan Pei, Jiyan Sun and Yinlong Liu.
    The Web Conference (WWW 2022), Virtual Conference, April 25-29, 2022 (CCF A)
    PaperSlides
  • 基于调用链控制流分析的大型微服务系统性能建模与异常定位
    于庆洋,白晓颖*,李明杰,李奇原,刘涛,刘泽胤,裴丹
    软件学报,2022,33(5):1849-1864 (THU B)
    Paper

2021:

  • Robust KPI Anomaly Detection for Large-Scale Software Services with Partial Labels
    Shenglin Zhang, Chenyu Zhao, Yicheng Sui, Ya Su, Yongqian Sun, Yuzhi Zhang, Dan Pei, Yizhe Wang
    ISSRE 2021 , Wuhan, China, Oct 25, 2021.
    PaperSlides
  • Identifying Root-Cause Metrics for Incident Diagnosis in Online Service Systems
    Canhua Wu, Nengwen Zhao, Lixin Wang, Xiaoqin Yang, Shining Li, Ming Zhang, Xing Jin, Xidao Wen, Xiaohui Nie, Wenchi Zhang, Kaixin Sui, Dan Pei
    ISSRE 2021 , Wuhan, China, Oct 25, 2021.
    PaperSlides
  • An Empirical Investigation of Practical Log Anomaly Detection for Online Service Systems
    Nengwen Zhao, Honglin Wang, Zeyan Li, Xiao Peng, Gang Wang, Zhu Pan, Yong Wu, Zhen Feng, Xidao Wen ,Wenchi Zhang,Kaixin Sui ,Dan Pei
    ESEC/FSE 2021 (industry track), Athens, Greece, Aug 23, 2021.
    PaperSlides
  • Identifying Bad Software Changes via Multimodal Anomaly Detection for Online Service Systems
    Nengwen Zhao, Junjie Chen, Zhaoyang Yu, Honglin Wang, Jiesong Li, Bin Qiu, Hongyu Xu, Wenchi Zhang, Kaixin Sui, Dan Pei
    ESEC/FSE 2021, Athens, Greece, Aug 23, 2021.
    PaperSlides
  • Jump-Starting Multivariate Time Series Anomaly Detection for Online Service Systems
    Minghua Ma, Shenglin Zhang, Junjie Chen, Jim Xu, Haozhe Li, Yongliang Lin, Xiaohui Nie, Bo Zhou, Yong Wang, Dan Pei
    ATC 2021, virtual event, July 14, 2021.
    PaperSlides
  • Multivariate Time Series Anomaly Detection and Interpretation using Hierarchical Inter-Metric and Temporal Embedding
    Zhihan Li, Youjian Zhao, Jiaqi Han, Ya Su, Rui Jiao, Xidao Wen, Dan Pei
    KDD 2021, August 14-18, 2021, virtual conference
    PaperSlidesVideoCode
  • Practical Root Cause Localization for Microservice Systems via Trace Analysis
    Zeyan Li, Junjie Chen, Rui Jiao, Nengwen Zhao, Zhijun Wang, Shuwei Zhang, Yanjun Wu, Long Jiang, Leiqin Yan, Zikai Wang, Zhekang Chen, Wenchi Zhang, Xiaohui Nie, Kaixin Sui, Dan Pei
    IWQoS 2021, June 25-28, 2021, virtual conference
    PaperDatasetSlidesVideo
  • Detecting Outlier Machine Instances through Gaussian Mixture Variational Autoencoder with One Dimensional CNN
    Ya Su, Youjian Zhao, Ming Sun, Shenglin Zhang, Xidao Wen, Yongsu Zhang, Xian Liu, Xiaozhou Liu, Junliang Tang, Wenfei Wu, Dan Pei
    IEEE TC (IEEE Transactions on Computers),2021-03-10
    Paper (A journal paper, no slides)
  • CTF: Anomaly Detection in High-Dimensional Time Series with Coarse-to-Fine Model Transfer
    Ming Sun, Ya Su, Shenglin Zhang, Yuanpu Cao, Yuqing Liu, Dan Pei, Wenfei Wu, Yongsu Zhang, Xiaozhou Liu, Junliang Tang
    INFOCOM2021, May 10 – 13, 2021, virtual conference
    PaperSlidesVideo
  • LogClass: Anomalous Log Identification and Classification with Partial Labels
    Weibin Meng, Ying Liu, Shenglin Zhang, Federico Zaiter, Yuzhe Zhang, Yuheng Huang, Zhaoyang Yu, Yuzhi Zhang, Lei Song, Ming Zhang and Dan Pei
    IEEE TNSM (IEEE Transactions on Network and Service Management) 2021
    PaperCode

2020:

  • FluxInfer: Automatic Diagnosis of Performance Anomaly for Online Database System
    Ping Liu, Shenglin Zhang, Yongqian Sun, Yuan Meng, Jiahai Yang, Dan Pei
    IPCCC2020, November 6 – 8, 2020, virtual conference
    PaperSlides
  • VAEPP: Variational Autoencoder with a Pull-back Prior
    Wenxiao Chen, Wenda Liu, Zhenting Cai, Haowen Xu, Dan Pei
    ICONIP2020, November 18 – 22, 2020, Online
    PaperSlides
  • Shallow VAEs with RealNVP Prior Can Perform as Well as Deep Hierarchical VAEs
    Haowen Xu, Wenxiao Chen, Jinlni Lai, Zhihan Li, Youjian Zhao, Dan Pei
    ICONIP2020, November 18 – 22, 2020, Online
    PaperSlides
  • LogTransfer: Cross-System Log Anomaly Detection for Software Systems with Transfer Learning
    Rui Chen, Shenglin Zhang, Dongwen Li, Yuzhe Zhang, Fangrui Guo, Weibin Meng, Dan Pei, Yuzhi Zhang, Xu Chen, Yuqing Liu
    ISSRE2020, Oct 12 – 15, 2020, Coimbra, Portugal
    PaperSlides
  • Unsupervised Detection of Microservice Trace Anomalies through Service-Level Deep Bayesian Networks
    Ping Liu, Haowen Xu, Qianyu Ouyang, Rui Jiao, Zhekang Chen,  Shenglin Zhang, Jiahai Yang, Linlin Mo, Jice Zeng, Wenman Xue, Dan Pei
    ISSRE2020, Oct 12 – 15, 2020, Coimbra, Portugal
    PaperSlides
  • Real-time Incident Prediction for Online Service Systems
    Nengwen Zhao, Junjie Chen, Zhou Wang , Xiao Peng , Gang Wang, Yong Wu, Fang Zhou, Zhen Feng, Xiaohui Nie, Wenchi Zhang, Kaixin Sui, Dan Pei
    ESEC/FSE 2020,Sacramento, California, United States, November 8,  2020
    PaperSlidesVideo
  • A Practical Machine Learning-Based Framework to Detect DNS Covert Communication in Enterprises
    Ruming Tang, Cheng Huang, Yanti Zhou, Haoxian Wu, Xianglin Lu, Yongqian Sun, Qi Li, Jinjin Li, Weiyao Huang, Siyuan Sun, Dan Pei
    SecureComm 2020, Cyberspace , October 21-23, 2020
    PaperSlides
  • Efficient and Robust Syslog Parsing for Network Devices in Datacenter Networks
    Shenglin Zhang ; Ying Liu ; Weibin Meng ; Jiahao Bu ; Sen Yang ; Yongqian Sun ; Dan Pei ; Jun Xu ; Yuzhi Zhang ; Lei Song ; Ming Zhang
    IEEE Access
    Paper (A journal paper, no slides)
  • Diagnosing Root Causes of Intermittent Slow Queries in Cloud Databases
    Minghua Ma, Zheng Yin, Shenglin Zhang, Sheng Wang, Christopher Zheng, Xinhao Jiang, Hanwen Hu, Cheng Luo, Yilin Li, Nengjun Qiu, Feifei Li, Changcheng Chen, Dan Pei
    VLDB 2020, Tokyo, Japan, August 31, 2020.
    PaperSlidesVideo
  • A Semantic-aware Representation Framework forOnline Log Analysis
    Weibin Meng, Ying Liu, Yuheng Huang, Shenglin Zhang, Federico Zaiter, Bingjin Chen, Dan Pei
    ICCCN 2020, Honolulu, Hawaii, USA, Auguest 3 – August 6, 2020.
    PaperSlidesCode
  • LogParse: Making Log Parsing Adaptive through Word Classification
    Weibin Meng, Ying Liu, Federico Zaiter, Shenglin Zhang, Yihao Chen, Yuzhe Zhang, Yichen Zhu, En Wang, Ruizhi Zhang, Shimin Tao, Dian Yang, Rong Zhou, Dan Pei
    ICCCN 2020, Honolulu, Hawaii, USA, Auguest 3 – August 6, 2020.
    PaperSlidesCode
  • Practical and White-Box Anomaly Detection through Unsupervised and Active Learning
    Yao Wang, Zhaowei Wang, Zejun Xie, Nengwen Zhao, Junjie Chen, Wenchi Zhang, Kaixin Sui, Dan Pei
    ICCCN 2020, Honolulu, Hawaii, USA, August 3 – August 6, 2020.
    PaperSlides
  • Localizing Failure Root Causes in a Microservice through Causality Inference
    Yuan Meng , Shenglin Zhang, Yongqian Sun, Ruru Zhang , Zhilong Hu, Yiyin Zhang, Chenyang Jia, Zhaogang Wang, Dan Pei
    IWQOS2020, Virtual Conference, 15-17 June 2020
    PaperSlidesVideo
  • Unsupervised Clustering through Gaussian Mixture Variational AutoEncoder with Non-Reparameterized Variational Inference and Std Annealing
    Zhihan Li, Youjian Zhao, Haowen Xu, Wenxiao Chen, Shangqing Xu, Yilin Li, Dan Pei
    IJCNN (WCCI) 2020, Virtual Conference, Jul 19-24, 2020
    PaperSlidesVideo
  • Understanding and Handling Alert Storm for Online Service Systems
    Nengwen Zhao, Junjie Chen, Xiao Peng, Honglin Wang, Xinya Wu, Yuanzong Zhan, Zikai Chen , Xiangzhong Zheng, Xiaohui Nie, Gang Wang, Yong Wu, Fang Zhou, Wenchi Zhang, Kaixin Sui, Dan Pei
    ICSE SEIP 2020, Virtual Conference,  Jul 6-11 2020
    PaperSlides
  • Automatically and Adaptively Identifying Severe Alerts for Online Service Systems
    Nengwen Zhao, Panshi Jin, Lixin Wang, Xiaoqin Yang, Rong Liu, Wenchi Zhang, Kaixin Sui, Dan Pei
    INFOCOM 2020, Virtual Conference, Jul 6-9, 2020
    PaperSlidesVideo
  • ZeroWall: Detecting Zero-Day Web Attacks through Encoder-Decoder Recurrent Neural Networks
    Ruming Tang, Zheng Yang, Zeyan Li, Weibin Meng, Haixin Wang, Qi Li, Yongqian Sun, Dan Pei, Tao Wei, Yanfei Xu, Yan Liu
    INFOCOM 2020, Cyberspace, Jul 6-9, 2020
    PaperSlidesVideo
  • Batch Job Run Time Prediction for Auto-Scaling in the Cloud
    Minghua Ma, Christopher Zheng, Junjie Chen, Yilin Li, Dan Pei
    CIEMCS 2020 , New York, NY, USA, February 7, 2020.
    Paper (Poster, no slides)

2019:

  • The Search for Sparse, Robust Neural Networks
    Justin Cosentino*, Federico Zaiter*, Dan Pei, Jun Zhu
    SRDM 2019 , Vancouver, British Columbia, Canada, December 13th, 2019.
    PaperPoster
  • FluxRank: A Widely-Deployable Framework to Automatically Localizing Root Cause Machines for Software Service Failure Mitigation
    Ping Liu,Yu Chen,Xiaohui Nie,Jing Zhu,Shenglin Zhang,Kaixin Sui,Ming Zhang,Dan Pei
    ISSRE 2019, Berlin, Germany, Oct 28-31, 2019
    PaperSlides
  • Generic and Robust Localization of Multi-Dimensional Root Causes 
    Zeyan Li, Chengyang Luo, Yiwei Zhao, Yongqian Sun, Kaixin Sui, Xiping Wang, Dapeng Liu, Xing Jin, Qi Wang and Dan Pei
    ISSRE 2019, Berlin, Germany, Oct 28-31, 2019
    PaperSlidesDatasetCode
  • Walking without Friends: Publishing Anonymized Trajectory Dataset without Leaking Social Relationships
    Kai Zhao, Zhen Tu, Fengli Xu, Yong Li, Pengyu Zhang, Dan Pei, Li Su, Depeng Jin
    IEEE TNSM (IEEE Transactions on Network and Service Management)
    Paper (A journal paper, no slides)
  • Collaborative Learning of Human Behavior: An Empirical Study on Location Prediction
    Yan Lu, Yuanchao Shu, Xu Tan, Yunxin Liu, Mengyu Zhou, Qi Chen, Dan Pei
    SEC 2019 (The Fourth ACM/IEEE Symposium on Edge Computing), Washington DC, November 7-9, 2019
    Paper
  • Personalized Context-aware Re-ranking for E-commerce Recommender Systems
    Changhua Pei, Yi Zhang, Yongfeng Zhang, Fei Sun, Xiao Lin, Hanxiao Sun, Jian Wu, Peng Jiang, Wenwu Ou, Dan Pei
    RecSys 2019, Copenhagen, Denmark, September 16-20, 2019.
    PaperSlides
  • LogAnomaly: Unsupervised Detection of Sequential and Quantitative Anomalies in Unstructured Logs
    Weibin Meng, Ying Liu, Yichen Zhu, Shenglin Zhang, Dan Pei, Yuqing Liu, Yihao Chen, Ruizhi Zhang, Shimin Tao, Pei Sun, Rong Zhou
    IJCAI 2019, Macao, China, August 10-16 2019.
    PaperSlides
  • Robust Anomaly Detection for Multivariate Time Series through Stochastic Recurrent Neural Network
    Ya Su, Youjian Zhao, Chenhao Niu, Rong Liu, Wei Sun, Dan Pei*
    KDD 2019, Anchorage, Alaska, USA, August 4 – 8 2019 (CCF A)
    PaperSlidesCodeDataset
  • Automatic and Generic Periodicity Adaptation for KPI Anomaly Detection 
    Nengwen Zhao, Jing Zhu , Yao Wang, Minghua Ma, Wenchi Zhang, Dapeng Liu, Ming Zhang, Dan Pei
    IEEE TNSM (IEEE Transactions on Network and Service Management), 2019,  Volume: 16 , Issue: 3 , Sept. 2019, Page(s): 1170 – 1183
    Paper(A journal paper, no slides)
  • CoFlux: Robustly Correlating KPIs by Fluctuations for Service Troubleshooting
    Ya Su, Youjian Zhao, Wentao Xia, Rong Liu, Jiabao Bu, Jing Zhu, Yuanpu Cao, Haibin Li, Chenhao Niu, Yiyin Zhang, Zhaogang Wang, Dan Pei
    IWQOS 2019, Phoenix, Arizona, USA, June 24-25 2019
    PaperSlides
  • Causal Analysis of the Unsatisfying Experience in Realtime Mobile Multiplayer Games in the Wild
    Yuan Meng,Shenglin Zhang, Zijie Ye, Benliang Wang, Zhi Wang,Yongqian Sun, Xitong Liu, Shuai Yang, Dan Pei
    ICME 2019, Shanghai, China, July 8-12, 2019
    PaperPoster
  • Dynamic TCP Initial Windows and Congestion Control Schemes through Reinforcement Learning
    Xiaohui Nie, Youjian Zhao, Zhihan li, Guo Chen, Kaixin Sui, Jiyang Zhang, Zijie Ye, Dan Pei
    JSAC 2019, Artificial Intelligence and Machine Learning for Networking and Communications
    Paper (A journal paper, no slides)
  • Unsupervised Anomaly Detection for Intricate KPIs via Adversarial Training of VAE
    Wenxiao Chen, Haowen Xu, Zeyan Li, Dan Pei, Jie Chen, Hong Lin, Yang Feng, Zhaogang Wang
    INFOCOM 2019, Paris, France, April 29-May 2, 2019.
    PaperSlides
  • Label-Less: A Semi-automatic Labelling Tool for KPI Anomalies
    Nengwen Zhao, Jing Zhu, Rong Liu, Dapeng Liu, Ming Zhang, Dan Pei
    INFOCOM 2019, Paris, France, April 29 – May 2, 2019.
    Final Version  Please note: the   original version of this paper greatly understated how closely this work is based on the work of Dr. Keogh and collaborators, and we sincerely apologize for this.
  • 基于聚类的多维数据热点发现算法
    邹磊,朱晶,聂晓辉,苏亚 ,裴丹,孙宇
    小型微型计算机系统,465-471 页,第40卷,第3期, 2019年3月
    Paper  (A journal paper, no slides)

2018:

  • 一种无监督的数据库用户行为异常检测方法
    李海斌,李  琦,汤汝鸣,吴  珺,吕志远,裴  丹,史俊杰,董  旭,房双德,杨一飞,吴  烨
    小型微型计算机系统,第39卷,第11期,2018年11月
    Paper  (A journal paper, no slides)
  • Rapid Deployment of Anomaly Detection Models for Large Number of Emerging KPI Streams
    Jiahao Bu, Ying Liu , Shenglin Zhang , Weibin Meng , Qitong Liu, Xiaotian Zhu, Dan Pei
    IPCCC 2018, Orlando, Florida, USA, November 17-19, 2018.
    PaperSlides
  • The Frame Latency of Personalized Livestreaming Can Be Significantly Slowed Down by WiFi
    Guoshun Nan, Xiuquan Qiao, Jiting Wang, Zeyan Li, Jiaohao Nu, Mengyu Zhou, Changhua Pei, Dan Pei
    IPCCC 2018, Orlando, Florida, USA, November 17-19, 2018.
    PaperSlides
  • Robust and Unsupervised KPI Anomaly Detection Based on Conditional Variational Autoencoder
    Zeyan Li, Wenxiao Chen, Dan Pei
    IPCCC 2018, Orlando, Florida, USA, November 17-19, 2018.
    PaperVideoSlidesCode
  • Robust and Rapid Adaption for Concept Drift in Software System Anomaly Detection
    Minghua Ma, Shenglin Zhang, Dan Pei, Xin Huang, Hongwei Dai
    ISSRE 2018 Best Paper, Memphis, TN, USA, October 15-18, 2018.
    PaperVideoSlides
  • The DevOps Lab Platform for Managing Diversified Projects in Educating Agile Software Engineering.
    Xiaoying Bai; Dan Pei; Mingjie Li; Shanshan Li
    2018 IEEE Frontiers in Education Conference (FIE). San Jose, CA, Oct 3 – Oct 6, 2018.
    Paper
  • Reducing Web Latency through Dynamically Setting TCP Initial Window with Reinforcement Learning
    Xiaohui Nie, Youjian Zhao, Dan Pei, Guo Chen, Kaixin Sui, Jiyang Zhang
    IWQoS 2018, Banff, Alberta, Canada, June 4-6, 2018.
    PaperSlides
  • Robust and Rapid Clustering of KPIs for Large-Scale Anomaly Detection
    Zhihan Li, Youjian Zhao, Rong Liu, Dan Pei
    IWQoS 2018, Banff, Alberta, Canada, June 4-6, 2018.
    Paper Slides
  • Device-Agnostic Log Anomaly Classification with Partial Labels
    Weibin Meng, Ying Liu, Shenglin Zhang, Dan Pei, Hui Dong, Lei Song, Xulong Luo
    IWQoS 2018, Banff, Alberta, Canada, June 4-6, 2018. (Short paper)
    PaperSlidesCode
  • HotSpot: Anomaly Localization for Additive KPIs with Multi-Dimensional Attributes
    Yongqian Sun, Youjian Zhao, Ya Su, Dapeng Liu, Xiaohui Nie, Yuan Meng, Shiwen Cheng, Dan Pei, Shenglin Zhang, Xianping Qu, Xuanyou Guo
    IEEE Access 2018
    Paper  (A journal paper, no slides)
  • PreFix: Switch Failure Prediction in Datacenter Networks
    Shenglin Zhang, Ying Liu, Weibin Meng, Zhiling Luo, Jiahao Bu, Sen Yang, Peixian Liang, Dan Pei, Jun (Jim) Xu, Yuzhi Zhang, Yu Chen, Hui Dong, Xianping Qu, Lei Song.
    SIGMETRICS 2018, Irvine, California, USA, June 18-22, 2018.
    PaperSlidesDataset
  • Continuous Delivery of Personalized Assessment and Feedback in Agile Software Engineering Projects
    Xiaoying Bai, Mingjie Li, Dan Pei, Shanshan Li, Deming Ye
    ICSE 2018 “Software Engineering Education and Training” Track. May 27 – 3 June 2018, Gothenburg, Sweden
    Paper
  • Preventing Wi-Fi Privacy Leakage: A User Behavioral Similarity Approach
    Xiuping Han, Zhi Wang and Dan Pei
    ICC 2018, Kansas City, MO, USA, 20-24 May 2018
    Paper
  • Unsupervised Anomaly Detection via Variational Auto-Encoder for Seasonal KPIs in Web Applications
    Haowen Xu, Wenxiao Chen, Nengwen Zhao, Zeyan Li, Jiahao Bu, Zhihan Li, Ying Liu, Youjian Zhao, Dan Pei*, Yang Feng, Jie Chen, Zhaogang Wang and Honglin Qiao
    WWW 2018, Lyon, France, 23-27 April 2018 (CCF A)
    PaperSlidesCode

 

2017:

  • 基于机器学习的智能运维
    裴丹,张圣林,裴昶华
    中国计算机学会通讯,第13卷,第12期,2017年12月,p68-73
    Paper  (A journal paper, no slides)
  • TCP WISE: One Initial Congestion Window Is Not Enough
    Xiaohui Nie,Youjian Zhao, Guo Chen, Kaixin Sui, Yazheng Chen, Dan Pei, Miao Zhang, Jiyang Zhang
    IPCCC 2017, San Diego, California, USA, Dec 2017.
    PaperSlides
  • Mining Crowd Mobility and WiFi Hotspots on a Densely-populated Campus
    Mengyu Zhou, Kaixin Sui, Dan Pei, Thomas Moscibroda
    PURBA 2017, Maui, Hawaii, USA, September 2017.
    Paper
  • How Vulnerable Is the Public WiFi AP You Are Using?
    Ruming Tang, Haibin Li, Kaixin Sui, Zihao Jin, Xiao Yang, Dan Pei and Beichuan Zhang
    ICCCN 2017,  Vancouver, Canada, 31 July-4 August 2017.
    PaperSlides
  • How Much Are Your Neighbors Interfering with Your WiFi Delay?
    Changhua Pei, Youjian Zhao, Guo Chen, Yuan Meng, Yang Liu, Ya Su, Yaodong Zhang, Ruming Tang and Dan Pei
    ICCCN 2017,  Vancouver, Canada, 31 July-4 August 2017.
    Paper
  • Syslog Processing for Switch Failure Diagnosis and Prediction in Datacenter Networks
    Shenglin Zhang, Weibin Meng, Jiahao Bu, Sen Yang, Ying Liu, Dan Pei, Jun(Jim) Xu, Yu Chen, Hui Dong, Xianping Qu, Lei Song
    IWQoS 2017,Vilanova i la Geltrú, Barcelona, Spain, 14-16 June 2017.
    Paper Slides
  • Latency-Based WiFi Congestion Control in the Air for Dense WiFi Networks
    Changhua Pei, Youjian Zhao, Yunxin Liu, Kun Tan, Jiansong Zhang, Yuan Meng, Dan Pei
    IWQoS 2017, Vilanova i la Geltrú, Barcelona, Spain, 14-16 June 2017.
    PaperSlides
  • You Can Hide, But Your Periodic Schedule Can’t
    Minghua Ma, Kai Zhao, Kaixin Sui, Lei Xu, Yong Li, Dan Pei
    IWQoS 2017, Vilanova i la Geltrú, Barcelona, Spain, 14-16 June 2017. (Short paper)
    PaperSlides
  • F2Tree: Rapid Failure Recovery for Routing in Production Data Center Networks
    Guo Chen,  Youjian Zhao, Hailiang Xu, Dan Pei,  Dan Li.
    IEEE/ACM Transactions on Networking. March 2017.
    Paper
  • Firewall Fingerprinting and Denial of Fireballing Attacks
    Amir Khakpour, Alex Liu, Joshua Hulst, Zihui Ge, Dan Pei, Jia Wang
    IEEE Transactions on Information Forensics and Security. Feb. 2017
    Paper
  • Why it Takes so Long to Connect to a WiFi Access Point?
    Changhua Pei,  Zhi Wang, Youjian Zhao, Zihan Wang, Yuan Meng, Dan Pei, Yuanquan Peng, Wenliang Tang, Xiaodong Qu
    INFOCOM 2017, Atlanta, Georgia, USA, May 1-4.
    PaperSlidesarXiv

2016:

  • Mining Causality Graph For Automatic Web-based Service Diagnosis
    Xiaohui Nie, Youjian Zhao, Kaixin Sui, Dan Pei, Yu Chen, Xianping Qu
    IPCCC 2016, Las Vegas, Nevada, USA, Dec 2016.
    PaperSlides
  • EDUM: Classroom Education Measurements via Large-scale WiFi Networks
    Mengyu Zhou, Minghua Ma, Yangkun Zhang, Kaixin Sui, Dan Pei, Thomas Moscibroda
    UbiComp 2016, Heidelberg, Germany, September 2016.
    PaperSlides
  • Interest-Suppression-based NDN Live Video Broadcasting over Wireless LAN
    Menghan Li, Dan Pei, Xiaoping Zhang, Beichuan Zhang, Ke Xu
    Frontiers of Computer Science, accepted.
    Paper
  • Fetching Popular Data from the Nearest Replica in NDN
    Jianxun Cao, Dan Pei, Xiaoping Zhang, Beichuan Zhang, Youjian Zhao
    ICCCN 2016, Hawaii, USA, August 2016.
    Paper Slides
  • Understanding the Impact of AP Density on WiFi Performance Through Real-World Deployment
    Kaixin Sui, Siqi Sun, Yousef Azzabi, Xiaoping Zhang, Youjian Zhao, Jilong Wang, Zimu Li, Dan Pei
    LANMAN 2016, Rome, Italy, June 2016.
    PaperSlides
  • Your Trajectory Privacy Can Be Breached Even If You Walk in Groups
    Kaixin Sui, Youjian Zhao, Dapeng Liu, Minghua Ma, Lei Xu, Li Zimu, Dan Pei
    IWQoS 2016, Beijing, China, June 2016. (Short paper)
    PaperSlides
  • M3: Practical and Reliable Multi-Layer Video Multicast over Multi-Rate Wi-Fi Network
    Menghan Li, Dan Pei, Xiaoping Zhang, Beichuan Zhang, Zhi Wang, Hailiang Xu, Zihan Wang
    IWQoS 2016, Beijing, China, June 2016.
    PaperSlides
  • Fast and Cautious: Leveraging Multi-path Diversity for Transport Loss Recovery in Data Centers
    Guo Chen, Yuanwei Lu, Yuan Meng, Bojie Li, Kun Tan, Dan Pei, Peng Cheng, Layong (Larry) Luo, Yongqiang Xiong, Xiaoliang Wang, and Youjian Zhao
    USENIX ATC 2016, Denver, CO, USA, June 2016
    PaperSlidesAudio
  • FUNNEL: Assessing Software Changes in Web-based Services
    Shenglin Zhang, Ying Liu, Dan Pei, Yu Chen, Xianping Qu, Shimin Tao, Zhi Zang, Xiaowei Jing, Mei Feng
    Transactions on Service Computing, 2016
    Paper (A journal paper, no slides)
  • MobiCamp: a Campus-wide Testbed for Studying Mobile Physical Activities
    Mengyu Zhou, Kaixin Sui, Minghua Ma, Youjian Zhao, Dan Pei, Thomas Moscibroda
    WPA 2016, Singapore, June 2016
    PaperSlides
  • Characterizing and Improving WiFi Latency in Large-Scale Operational Networks
    Kaixin Sui, Mengyu Zhou, Dapeng Liu, Minghua Ma, Dan Pei, Youjian Zhao, Zimu Li, Thomas Moscibroda
    MobiSys 2016, Singapore, June 2016
    PaperVideoSlides
  • Improving the Freshness of NDN Forwarding States
    Jianxun Cao, Dan Pei, Zhelun Wu, Xiaoping Zhang, Beichuan Zhang, Lan Wang, Youjian Zhao
    IFIP Networking 2016, Vienna, Austria, May 2016
    PaperSlides
  • FOCUS: Shedding Light on the High Search Response Time in the Wild
    Dapeng Liu, Youjian Zhao, Kaixin Sui, Lei Zou, Dan Pei, Qingqian Tao, Xiyang Chen, Dai Tan
    INFOCOM 2016, San Francisco, CA, USA, Apr 2016
    PaperSlides
  • WiFi can Be the Weakest Link of Round Trip Network Latency
    Changhua Pei, Youjian Zhao, Guo Chen, Ruming Tang, Yuan Meng, Minghua Ma, Ken Ling, Dan Pei
    INFOCOM 2016, San Francisco, CA, USA, Apr 2016
    PaperSlides

 

2015:

  • Learning Thresholds for PV Change Detection from Operators’ Labels
    Dapeng Liu, Youjian Zhao, Kaixin Sui, Shiwen Cheng, Dan Pei, Chengbin Quan, Jiao Luo, Xiaowei Jing, Mei Feng
    IPCCC 2015, Nanjing, China, Dec 2015
    PaperSlides
  • Narrowing Down the Debugging Space of Slow Search Response Time
    Dapeng Liu, Youjian Zhao, Dan Pei, Chengbin Quan, Qingqian Tao, Pei Wang, Xiyang Chen, Dai Tan, Xiaowei Jing, Mei Feng
    IPCCC 2015, Nanjing, China, Dec 2015
    PaperSlides
  • Rapid and Robust Impact Assessment of Software Changes in Large Internet-based Services
    Shenglin Zhang, Ying Liu, Dan Pei, Yu Chen, Xianping Qu, Shimin Tao, Zhi Zang
    ACM CoNEXT 2015, Heidelberg, Germany, Dec 2015
    PaperSlides
  • Opprentice: Towards Practical and Automatic Anomaly Detection Through Machine Learning
    Dapeng Liu, Youjian Zhao, Haowen Xu, Yongqian Sun, Dan Pei, Jiao Luo, Xiaowei Jing, Mei Feng
    ACM IMC 2015, Tokyo, Japan, Oct 2015
    PaperSlides
  • Alleviating Flow Interference in Data Center Networks through Fine-grained Switch Queue Management
    Guo Chen, Youjian Zhao, Dan Pei.
    Elsevier Computer Networks. 2015.
    Paper (A journal paper, no slides)
  • NDN Live Video Broadcasting over Wireless LAN
    Menghan Li, Dan Pei, Xiaoping Zhang, Beichuan Zhang and Ke Xu.
    ICCCN, Las Vegas, Nevada, USA. August 3 – August 6, 2015.
    PaperSlides
  • Rewiring 2 Links is Enough: Accelerating Failure Recovery in Production Data Center Networks
    Guo Chen, Youjian Zhao, Dan Pei, and Dan Li.
    ICDCS, Columbus, Ohio, USA. June 29th – July 2nd , 2015.
    PaperSlides
  • How Bad Are The Rogues’ Impact on Enterprise 802.11 Network Performance?
    Kaixin Sui, Youjian Zhao, Dan Pei, and Zimu Li.
    INFOCOM 2015, Hongkong, China, April 27-30, 2015.
    PaperSlides
  • MIFO: Multi-Path Interdomain Forwarding.
    Ming Zhu, Dan Li, Ying Liu, Dan Pei, K.K. Ramakrishnan and Jianping Wu.
    2015 International Conference on Parallel Processing, Beijing, China, Sept. 2015.
    Paper

 

2014:

  • Application-Aware Latency Monitoring for Cloud Tenants via CloudWatch+
    Dapeng Liu, Dan Pei, Youjian Zhao
    CNSM 2014, Rio de Janeiro, Brazil, Nov 2014
    PaperSlides
  • A Measurement Study on BGP AS Path Looping (BAPL) Behavior
    Shenglin Zhang, Ying Liu, Dan Pei
    IEEE ICCCN, Shanghai, China, August 2014
    PaperSlides
  • Multi-AS Cooperative Incoming Traffic Engineering in a Transit-Edge Separate Internet
    Yaodong Zhang, Yue Wang, Dan Pei, and Jian Yuan.
    Elsevier Computer Networks. 2014.
    Paper (A journal paper, no slides)
  • Designing Buffer Capacity of Crosspoint-Queued Switch
    Guo Chen, Dan Pei, Youjian Zhao and Yongqian Sun
    IFIP NPC, Taiwan, Sept. 2014
    PaperSlides
  • CQRD: A Switch-based Approach to Flow Interference in Data Center Networks
    Guo Chen, Dan Pei, and Youjian Zhao
    IEEE LCN, Edmonton, Canada. Sept. 2014
    PaperSlides

For Dan’s publications before 2014, please check Dan’s Google Scholar page.

 
 
 
Scroll Up