Home About Research Invention Members area Contact
Resources
Projects
Publications
Presentations
Supports

Publication List Based on Areas

Dissertations and Theses

Technical Reports

Current Publications

[2017]

[2016]

  • S. Liu, E.-S. Jun, R. Kettimuthu, X.-H. Sun and M. Papka, "Towards Optimizing Large-Scale Data Transfers with End-to-End Integrity Verification," in the 4th International Workshop on Distributed Storage Systems and Coding for Big Data, in conjunction with IEEE BigData 2016. Washington, D.C., USA, Dec. 2016.
  • A. Kougkas, A. Fleck and X.-H. Sun, "Towards Energy Efficient Data Management in HPC: The Open Ethernet Drive Approach," in PDSW-DISCS'16, in conjunction with SC'16, 2016.
  • X. Yang, J. Jenkins, M. Mubarak, R. Ross, and Z.Lan, "Watch Out for the Bully! Job Interference Study on Dragonfly Network," in Proc. of the ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis 2016 (SC'16), Salt Lake City, Utah, USA, Nov. 2016. (acceptance rate: 18%)
  • S. Wallace, X. Yang, V. Vishwanath, W. Allcock, S. Coghlan, M. Papka, and Z. Lan, "A Data Driven Scheduling Approach for Power Management on HPC Systems," in Proc. of the ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis 2016 (SC'16), Salt Lake City, Utah, USA, Nov. 2016. (acceptance rate: 18%)
  • A. Kougkas, M. Dorier, R. Latham, R. Ross, and X.-H. Sun, "Leveraging Burst Buffer Coordination to Prevent I/O Interference," in Proc. of eScience'16, Baltimore, Maryland, USA, Oct. 2016.
  • X. Yang, S. Liu, K. Feng, S. Zhou, and X.-H. Sun, "Visualization and Adaptive Subsetting of Earth Science Data in HDFS - A Novel Data Analysis Strategy with Hadoop and Spark," in Proc. the 6th IEEE International Conference on Big Data and Cloud Computing (BDCloud 2016), Atlanta, GA, Oct. 2016.
  • D. Li, S. Wang, S. Yao, Y.-H. Liu, Y. Cheng, and X.-H. Sun, "Efficient Design Space Exploration by Knowledge Transfer," in Proc. of Eleventh IEEE/ACM/IFIP International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS'16), Pittsburgh, PA, USA, Oct. 2016.
  • X.-H. Sun and Y.-H. Liu, "Utilizing Concurrency Data Access: A New Theory," in Proc. of the 29th International Workshop on Languages and Compilers for Parallel Computing (LCPC2016) (a position paper), Sept, 2016, New York, USA.
  • E. Berrocal, L. Bautista-Gomez, S. Di, Z. Lan, and F. Cappello, "Exploring Partial Replication to Improve Lightweight Silent Data Corruption Detection for HPC Applications," in Proc. of 22nd International European Conference on Parallel and Distributed Computing (Euro-Par 2016), Grenoble, France, Aug. 2016.
  • Y. Chen, C. Chen, Y. Yin, X.-H. Sun, R. Thakur and W. Gropp, "Rethinking High Performance Computing System Architecture for Scientific Big Data Applications," in Proc. of 14th IEEE International Symposium on Parallel and Distributed Processing with Applications (ISPA 2016), Tianjin, China, Aug. 2016. (Best Paper Award)
  • W. Yang, C. Xu, S. He and X.-H. Sun, "On MinMax-Memory Claims for Scientific Workflows in the In-Memory Cloud Computing (Poster Presentation)," in Proc. of the 36th International Conference on Distributed Computing Systems (ICDCS), Nara Hotel, Nara, Japan, Jun. 2016.
  • D. Li, S. Yao, Y.-H. Liu, S. Wang, and X.-H. Sun, "Efficient Design Space Exploration via Statistical Sampling and AdaBoost Learning," in Proc. of 53rd Design Automation Conference (DAC'16), Texas, Austin, USA, Jun. 2016.

[2015]

  • R. Ranjan, L. Wang, A. Y. Zomaya, D. Georgakopoulos, X.-H. Sun and G. Wang, "Recent Advances in Autonomic Provisioning of Big Data Applications on Clouds," IEEE Transaction on Cloud Computing, vol. 3, no. 2, pp. 101-104, June. 2015.
  • J. Wu, X. Xiong, and Z.Lan, "Hierarchical Task Mapping for Parallel Applications on Supercomputers," The Journal of Supercomputing, vol. 71, no. 5, pp. 1776-1802, May. 2015.
  • Z. Zheng, L. Yu, and Z.Lan, "Reliability-Aware Speedup Models for Parallel Applications with Coordinated Checkpointing/Restart," IEEE Transactions on Computers, vol. 64, no. 5, pp. 1402-1415, May. 2015.
  • Yuhang Liu and Xian-He Sun, "Reevaluating Data Stall Time with the Consideration of Data Access Concurrency," Journal Of Computer Science And Technology, vol. 30, no. 2, pp. 227-245, Mar. 2015.
  • A. Haider, X. Yang, N. Liu, S. He and X.-H. Sun, "IC-Data: Improving Compressed Data Processing in Hadoop," in Proc. of 22nd annual IEEE International Conference on High Performance Computing (HiPC 2015), Bengaluru, India, Dec. 2015 (acceptance rate: ~24%)
  • X. Yang, C. Feng, Z. Xu and X.-H. Sun, "Dominoes: Speculative Repair in Erasure Coded Hadoop System," in Proc. of 22nd annual IEEE International Conference on High Performance Computing (HiPC 2015), Bengaluru, India, Dec. 2015 (acceptance rate: ~24%)
  • Yu-Hang Liu and Xian-He Sun, "C^2-bound: A Capacity and Concurrency driven Analytical Model for Manycore Design," in Proc. of the ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis 2015 (SC'15). Texas, Austin, USA, Nov. 2015.
  • H. Eslami, A. Kougkas, M. Kotsifakou, T. Kasampalis, K. Feng, Y. Lu, W. Gropp, X.-H. Sun, Y. Chen and R. Thakur, "Efficient Disk-to-Disk Sorting: A Case Study in Decoupled Execution Paradigm," in Proc. of the Data Intensive Scalable Computing Systems Workshop (DISCS), in conjunction with ACM/IEEE SuperComputing 2015, Austin, TX, USA, Nov. 2015.
  • A. Haider, S. Mickelson, J. Dennis, X.-H. Sun. "Lessons from Post-processing Climate Data on Modern Flash-based HPC Systems (Poster Presentation)," in Proc. of the ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis 2015 (SC'15). Texas, Austin, USA, Nov. 2015.
  • X. Yang, N. Liu, B. Feng, X.-H. Sun and S. Zhou, "PortHadoop: Support Direct HPC Data Processing in Hadoop," in Proc. of IEEE International Conference on Big Data (IEEE BigData 2015). Santa Clara, CA, USA, Oct. 2015. (acceptance rate: 17%)
  • S. Zhou, X. Yang, X. Li, T. Matsui, S. Liu, X.-H. Sun and W. Tao, "A Hadoop-Based Visualization and Diagnosis Framework for Earth Science Data," in Proc. of Big Data in the Geosciences Workshop, in conjunction with IEEE International Conference on Big Data (IEEE BigData 2015) (short paper). Santa Clara, CA, USA, Oct. 2015.
  • K. Wang, N. Liu, I. Sadooghi, X. Yang, X. Zhou, M. Lang, X.-H. Sun and I. Raicu, "Overcoming Hadoop Scaling Limitations through Distributed Task Execution," in Proc. of the IEEE International Conference on Cluster Computing 2015 (Cluster’15), Chicago, IL, USA, Sept. 2015.
  • B. Feng, X. Yang, K. Feng, Y. Yin and X.-H. Sun, "IOSIG+: on the Role of I/O Tracing and Analysis for Hadoop Systems," in Proc. of the IEEE International Conference on Cluster Computing 2015 (Cluster'15) (short paper), Chicago, IL, USA, Sept. 2015.
  • K. Feng, M. G. Venkata, D. Li and X.-H. Sun, "Fast Fault Injection and Sensitivity Analysis for Collective Communications," in Proc. of the IEEE International Conference on Cluster Computing 2015 (Cluster'15), Chicago, IL, USA, Sept. 2015.
  • Z. Zhou, X. Yang, D. Zhao, P. Rich, W. Tang, J. Wang and Z. Lan, "I/O-Aware Batch Scheduling for Petascale Computing Systems," in Proc. of the IEEE International Conference on Cluster Computing 2015 (Cluster’15), Chicago, IL, USA, Sept. 2015.
  • Yu-Hang Liu and Xian-He Sun, "LPM: Concurrency-driven Layered Performance Matching," in Proc. of the 44th International Conference on Parallel Processing (ICPP'15), Beijing, China, Sept. 2015.
  • S. He, X.-H. Sun, Y. Wang, A. Kougkas, and A. Haider, "A Heterogeneity-Aware Region-Level Data Layout Scheme for Hybrid Parallel File Systems," in Proc. of the 44th International Conference on Parallel Processing (ICPP'15), Beijing, China, Sept. 2015.
  • C. Feng, X. Yang, F. Liang, X.-H. Sun, Z. Xu, "LCIndex, A Local and Clustering Index on Distributed Ordered Tables for Multi-Dimensional Range Queries," in Proc. of the 44th International Conference on Parallel Processing (ICPP'15), Beijing, China, Sept. 2015.
  • N. Liu, A. Haider, X.-H. Sun and D. Jin. "FatTreeSim: Modeling a Large-scale Fat-Tree Network for HPC Systems and Data Centers Using Parallel and Discrete Event Simulation," in Proc. of 29th ACM SIGSIM Conference on Principles of Advanced Discrete Simulation (ACM SIGSIM PADS), London, UK, June. 2015. (Best Paper Award, 1/60)
  • N. Liu, X.-H. Sun and D. Jin. "On Massively Parallel Simulation of Large-Scale Fat-Tree Networks for HPC Systems and Data Centers (Poster Presentation)," in Proc. of 29th ACM SIGSIM Conference on Principles of Advanced Discrete Simulation (ACM SIGSIM PADS), London, UK, June. 2015. (Best Poster Award, 1/9)
  • B. Wang, W. Yu, X.-H. Sun and X. Wang, "DaCache: Memory Divergence-Aware GPU Cache Management," in Proc. of 29th International Conference on Supercomputing (ICS'15), Newport Beach, CA. USA, June. 2015
  • S. He, X.-H. Sun, and A. Haider, "HAS: Heterogeneity-Aware Selective Data Layout Scheme for Parallel File Systems on Hybrid Servers," in Proc. of 29th IEEE International Parallel and Distributed Processing Symposium (IPDPS'15), Hyderabad, India, May 2015.
  • Z. Zhou, X. Yang, Z. Lan, P. Rich, W. Tang, V. Morozov, and N. Desai, "Improving Batch Scheduling on Blue Gene/Q by Relaxing 5D Torus Network Allocation Constraints," in Proc. of 29th IEEE International Parallel and Distributed Processing Symposium (IPDPS'15), Hyderabad, India, May 2015.
  • N. Liu, X. Yang, X.-H. Sun, J. Jenkins and R. Ross, "YARNsim: Hadoop YARN Simulation System," in Proc. of 15th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid 2015), Shenzhen, Guangdong, China, May 2015. (acceptance rate: 69/268=25.7%) Source code can be found here.

[2014]

  • Y. Yin, A. Kougkas, K. Feng, H. Eslami, Y. Lu, X.-H. Sun, R. Thakur, W. Gropp, "Rethinking Key-Value Store for Parallel I/O Optimization," in Proc. of the Data Intensive Scalable Computing Systems Workshop (DISCS), in conjunction with ACM/IEEE SuperComputing 2014, New Orleans, LA, USA, Nov. 2014.
  • S. He, Y. Liu, and X.-H. Sun, "PSA: A Performance and Space-Aware Data Layout Scheme for Hybrid Parallel File Systems," in Proc. of the Data Intensive Scalable Computing Systems Workshop (DISCS), in conjunction with ACM/IEEE SuperComputing 2014, New Orleans, LA, USA, Nov. 2014.
  • B. Feng, N. Liu, S. He, and X.-H. Sun, "HPIS3: Towards a High-Performance Simulator for Hybrid Parallel I/O and Storage Systems," in Proc. of 9th Parallel Data Storage Workshop (PDSW'14), in conjunction with ACM/IEEE SuperComputing 2014, New Orleans, LA, USA, Nov. 2014.
  • X. Yang, Y. Yin, H. Jin, and X.-H. Sun, "SCALER: Scalable Parallel File Write in HDFS," in Proc. of IEEE International Conference on Cluster Computing 2014 (Cluster'14), Madrid, Spain, Sept. 2014.
  • E. Berrocal, L. Yu, S. Wallace, M. Papka, and Z. Lan, "Exploring Void Search for Fault Detection on Extreme Scale Systems," (Best Paper Award), in Proc. of IEEE International Conference on Cluster Computing 2014 (Cluster'14), Madrid, Spain, Sept. 2014.
  • X. Yang, X. Zheng, Z. Zhou, W. Tang, J. Wang, and Z. Lan, "Balancing Job Performance with System Performance via Locality-Aware Scheduling on Torus-Connected Systems," in Proc. of IEEE International Conference on Cluster Computing 2014 (Cluster'14), Madrid, Spain, Sept. 2014.
  • C. Chen, Y. Chen, K. Feng, Y. Yin, H. Eslami, R. Thakur, X.-H. Sun, and W. D. Gropp, "Decoupled I/O for Data-Intensive High Performance Computing," in Proc. of Seventh International Workshop on Parallel Programming Models and Systems Software for High-End Computing (P2S2), in conjunction with the International Conference on Parallel Processing (ICPP-2014), Minneapolis, MN, USA, Sept. 2014.
  • S. He, X.-H. Sun, B. Feng and K. Feng, "Performance-Aware Data Placement in Hybrid Parallel File Systems," in Proc. of the 14th International Conference on Algorithms and Architectures for Parallel Processing (ICA3PP), Dalian, China, Aug. 2014.
  • S. He, X.-H. Sun and B. Feng, "S4D-Cache: Smart Selective SSD Cache for Parallel I/O Systems," in Proc. of International Conference on Distributed Computing Systems (ICDCS), Madrid, Spain, June 2014. (acceptance rate: 66/500=13%).

[2013]

  • H. Jin and X.-H. Sun, "Performance Comparison under Failures of MPI and MapReduce: An Analytical Approach," Future Generation Computer Systems (FGCS), vol. 29, no. 7, pp. 1808-1815, Sept. 2013.
  • J. He, J. Kowalkowski, M. Paterno, D. Holmgren, J. Simone, and X.-H. Sun, "Layout-Aware Scientific Computing-A Case Study using the MILC Code," Journal of Computational Science, vol. 4, no. 6, pp. 496-506, Nov. 2013.
  • X. Yang, Z. Zhou, S. Wallace, Z. Lan, W. Tang, S. Coghlan, M. E. Papka, "Integrating dynamic pricing of electricity into energy aware scheduling for HPC systems," in Proc. of 2013 International Conference for High Performance Computing, Networking, Storage and Analysis, Denver, CO, USA, Nov. 2013.
  • Shuibing He, Xian-He Sun, Bo Feng, Xin Huang, and Kun Feng, "A Cost-Aware Region-Level Data Placement Scheme for Hybrid Parallel I/O Systems," in Proc. of the IEEE International Conference on Cluster Computing 2013 (Cluster'13), Indianapolis, IN, USA, Sept. 2013.
  • Kun Feng, Yanlong Yin, Chao Chen, Hassan Eslami, Xian-He Sun, Yong Chen, Rajeev Thakur and William Gropp, "Runtime System Design of Decoupled Execution Paradigm for Data-Intensive High-End Computing (Poster Presentation)," in Proc. of the IEEE International Conference on Cluster Computing 2013 (Cluster'13), Indianapolis, IN, USA, Sept. 2013.
  • J. He, J. Bent, A. Torres, G. Grider, G. Gibson, C. Maltzahn, and X.-H. Sun, "I/O Acceleration with Pattern Detection," in Proc. of the 22th International ACM Symposium on High Performance Distributed Computing (HPDC'13), New York City, NY, USA, June 2013. (acceptance rate: 20/131=15.3%)
  • Shuibing He, Xian-He Sun, and Yanlong Yin, "BPS: A Performance Metric of I/O System," in Proc. of the 2013 International Workshop on High Performance Data Intensive Computing (HPDIC 2013), in Conjunction With IEEE IPDPS 2013, Boston, Massachusetts, USA, May 2013.
  • Y. Yin, J. Li, J. He, X.-H. Sun, and R. Thakur, "Pattern-Direct and Layout-Aware Replication Scheme for Parallel I/O Systems," in Proc. of IEEE International Parallel and Distributed Processing Symposium (IPDPS' 13), Phoenix, AZ, USA, May 2013. (acceptance rate: 106/494=21%).

[2012]

  • J. He, J. Bent, A. Torres, G. Grider, G. Gibson, C. Maltzahn and X.-H. Sun, "Discovering Structure in Unstructured I/O," in Proc. of 7th Parallel Data Storage Workshop (PDSW'12), in conjunction with ACM/IEEE SuperComputing 2012, Salt Lake City, UT, USA, Nov. 2012.
  • J. Wu, Z. Lan, X. Xiong, N. Gnedin and A. Kravtsov, "Hierarchical Task Mapping of Cell-based AMR Cosmology Simulations," in Proc. of the ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis 2012 (SC'12). Salt Lake City, UT, USA, Nov. 2012.
  • Y. Chen, C. Chen, X.-H. Sun, W. D. Gropp, and R. Thakur, "A Decoupled Execution Paradigm for Data-Intensive High-End Computing," in Proc. of the IEEE International Conference on Cluster Computing 2012 (Cluster'12), Beijing, China, Sept. 2012.
  • J. He, X.-H. Sun and R. Thakur, "KNOWAC: I/O Prefetch via Accumulated Knowledge," in Proc. of IEEE International Conference on Cluster Computing (Cluster'12), Beijing, China, Sept. 2012.
  • Z. Zheng, L. Yu, Z. Lan, and T. Jones, "3-Dimensional Root Cause Diagnosis via Co-Analysis," in Proc. of International Conference on Autonomic Computing 2012 (ICAC'12), San Jose, CA, USA, Sept. 2012.
  • H. Jin, J. Ji, X.-H. Sun, Y. Chen and R. Thakur, "CHAIO: Enabling HPC Applications on Data-Intensive File Systems," in Proc. of 41th International Conference on Parallel Processing (ICPP), Pittsburgh, PA, Sept. 2012.
  • W. Tang, D. Ren, Z. Lan and N. Desai, "Adaptive Metric-Aware Job Scheduling for Production Supercomputers," in Proc. of 5th International Workshop on Parallel Programming Models and Systems Software for High-End Computing (P2S2), 2012, in conjunction with ICPP 2012, Pittsburgh, PA, USA, Sept. 2012.
  • L. Yu, Z. Zheng, Z. Lan, T. Jones, J. Brandt, and A. gentile, "Filtering Log Data: Finding the needles in the Haystack," in Proc. of International Conference on Dependable Systems and Networks 2012 (DSN'12), Boston, MA, USA, June 2012.
  • H. Jin, X. Yang, X. -H. Sun, and I. Raicu, "ADAPT: Availability-aware MapReduce Data Placement in Non-Dedicated Distributed Computing Environment," in Proc. of the 32nd International Conference on Distributed Computing Systems (ICDCS), Macau, China, June 2012. (acceptance rate: 71/515=13%).
  • H. Song, H. Jin, J. He, X.-H. Sun and R. Thakur, "A Server-Level Adaptive Data Layout Strategy for Parallel File Systems," in Proc. of 2012 International Workshop on High Performance Data Intensive Computing(HPDIC 2012), in Conjunction With IEEE IPDPS 2012, Shanghai, China, May 2012.
  • Y. Yu, D. Rudd, Z. Lan, N. Gnedin, A. Kravtsov, and J. Wu, "Improving Parallel IO Performance of Cell-based AMR Cosmology Applications," in Proc. of IEEE International Parallel & Distributed Processing Symposium 2012 (IPDPS'12), Shanghai, China, May 2012.
  • H. Zou, X.-H. Sun, S. Ma, and X. Duan, "A Source-Aware Interrupt Scheduling for Modern Parallel I/O Systems," in Proc. of IEEE International Parallel and Distributed Processing Symposium (IPDPS' 12), Shanghai, China, May 2012.
  • H. Jin, T. Ke, Y. Chen and X.-H. Sun, "Checkpointing Orchestration: Toward a Scalable HPC Fault-Tolerant Environment," in Proc. of IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid), Ottawa, Canada, May 2012. (acceptance rate: 83/302=27.5%).
  • Y. Yin, S. Byna, H. Song, X.-H. Sun, and R. Thakur, "Boosting Application-Specific Parallel I/O Optimization Using IOSIG," in Proc. of IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid), Ottawa, Canada, May 2012. (acceptance rate: 83/302=27.5%).
  • R. Ge, X. Feng and X.-H. Sun, "SERA-IO: Integrating Energy Consciousness into Parallel I/O Middleware," in Proc. of IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid), Ottawa, Canada, May 2012. (acceptance rate: 83/302=27.5%).
  • H. Jin and X.-H. Sun, "Performance Comparison under Failures of MPI and MapReduce: An Analytical Approach," in 2nd International Workshop on Cloud Computing and Scientific Applications (CCSA), in conjunction with CCGrid 2012, Ottawa, Canada, May 2012. Invited to a special issue of Future Generation Computing System (FGCS).

[2011]

  • H. Song, Y. Yin, X.-H. Sun, R. Thakur, and S. Lang, "Server-Side I/O Coordination for Parallel File Systems," in Proc. of the ACM/IEEE SuperComputing Conference (SC'11), Seattle, WA, USA, Nov. 2011. (acceptance rate: 74/352=21.0%).
  • X.-H. Sun and D. Wang, "Memory Access Cycle and the Measurement of Memory Systems," in Proc. of 2nd International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computing Systems (PMBS'11), in conjunction with IEEE/ACM SuperComputing 2011, Seattle, WA, USA, Nov. 2011.
  • J. He, H. Song, X.-H. Sun, Y. Yin and R. Thakur, "Pattern-aware File Reorganization in MPI-IO," in Proc. of 6th Parallel Data Storage Workshop (PDSW'11), in conjunction with ACM/IEEE SuperComputing 2011, Seattle, WA, USA, Nov. 2011.
  • J. He, J. Kowalkowski, M. Paterno, D. Holmgren, J. Simone and X.-H. Sun, "Layout-aware Scientific Computing - A Case Study Using MILC," in the Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems (ScalA'11), in conjunction with ACM/IEEE SuperComputing 2011, Seattle, WA, USA, Nov. 2011.
  • W. Tang, N. Desai, V. Vishwanath, D. Buettner, and Z. Lan, "Job Coscheduling on Coupled High-End Computing Systems," in Proc. of International Conference on Parallel Processing Workshops (ICPPW'11), Taipei, Taiwan, Sept. 2011.
  • J. Wu, R. Gonzalez, Z. Lan, N. Gnedin, A. Kravtsov, D. Rudd, and Y. Yu, "Performance Emulation of Cell-based AMR Cosmology Simulations," in Proc. of IEEE International Conference on Cluster Computing (CLUSTER), Austin, Texas, Sept. 2011.
  • D. Wang, X.-H. Sun, N. Hu, and N. Sun, "EthSpeeder: A High-performance Scalable Fault-Tolerant Ethernet Network Architecture for Data Center," in Proc. of the 6th IEEE International Conference on Networking, Architecture, and Storage (NAS2011), Dalian, China, July 2011.
  • L. Yu, Z. Zheng, Z. Lan, and S. Coghlan, "Practical Online Failure Prediction for Blue Gene/P: Period-based vs Event-driven," in Proc. of Proactive Failure Avoidance, Recovery, and Maintenance workshop(in conjunction with DSN'11), Hong Kong, China, June 2011.
  • H. Song, Y. Yin, Y. Chen, X.-H. Sun, "A Cost-intelligent Application-specific Data layout Scheme for Parallel File Systems," in Proc. of the 20th International ACM Symposium on High Performance Distributed Computing (HPDC'11), San Jose, CA, June 2011. (acceptance rate: 22/170=12.9%).
  • H. Song, X.-H. Sun, Y. Chen, "A Hybrid Shared-nothing/Shared-data Storage Scheme for Large-scale Data Processing," Best Paper Award, in Proc. of the 9th IEEE International Symposium on Parallel and Distributed Processing with Applications (ISPA'11), Busan, Korea, May 2011.
  • H. Jin, K. Qiao, X.-H. Sun and Y. Li, "Performance under Failures of MapReduce Applications (Poster Presentation)," in Proc. of the 11th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid'11), Newport Beach, CA, USA, May 2011.
  • K. Zhang, Z. Wang, Y. Chen, H. Zhu and X.-H. Sun, "PAC-PLRU: A Cache Replacement Policy to Salvage Discarded Predictions from Hardware Prefetchers," Student Scholar Award, in Proc. of the 11th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid'11), Newport Beach, CA, USA, May 2011. (acceptance rate: 55/189=29.1%).
  • H. Song, Y. Yin, X.-H. Sun, R. Thakur and S. Lang, "A Segment-Level Adaptive Data Layout Scheme for Improved Load Balance in Parallel File Systems," "in Proc. of the 11th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid'11), Newport Beach, CA, USA, May 2011. (acceptance rate: 55/189=29.1%).
  • H. Song, Y. Chen, X.-H. Sun, "A Hybrid Shared-nothing/Shared-data Storage Architecture for Large Scale Databases(Poster Presentation)," in Proc. of the 11th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid'11), Newport Beach, CA, USA, May 2011.
  • W. Tang, Z. Lan, N. Desai, D. Buettner, Y. Yu, "Reducing Fragmentation on Torus-Connected Supercomputers," in Proc. of IEEE International Parallel and Distributed Processing Symposium (IPDPS' 11), Anchorage, AK, USA, May 2011.
  • Z. Zheng, L. Yu, W. Tang, Z. Lan, R. Gupta, N. Desai, S. Coghlan, and D. Buettner, "Co-Analysis of RAS Log and Job Log on Blue Gene/P," in Proc. of IEEE International Parallel and Distributed Processing Symposium (IPDPS' 11), Anchorage, AK, USA, May 2011.
  • Y. Chen, X.-H. Sun, R. Thakur, P. C. Roth and W. Gropp, "LACIO: A New Collective I/O Strategy for Parallel I/O Systems," in Proc. of IEEE International Parallel and Distributed Processing Symposium (IPDPS' 11), Anchorage, AK, USA, May 2011. (acceptance rate: 112/571=19.6%).

[2010]

  • Z. Lan, J. Gu, Z. Zheng, R. Thakur, and S. Coghlan, "A Study of Dynamic Meta-Learning for Failure Prediction in Large-Scale Systems," in press of Journal of Parallel and Distributed Computing, vol. 70, pp. 630-643, June 2010.
  • X.-H. Sun and Y. Chen, "Reevaluating Amdahl's Law in the Multicore Era," in press of Journal of Parallel and Distributed Computing, vol. 70, no. 2, pp. 183-188, Feb. 2010.
  • Z. Lan, Z. Zheng, and Y. Li, "Toward Automated Anomaly Identification in Large-Scale Systems," in IEEE Transactions on Parallel and Distributed Systems, vol. 21, no. 2, pp. 174 - 187, Feb. 2010.
  • H. Jin, X.-H. Sun, Y. Chen, and T. Ke, "REMEM: REmote MEMory as Checkpointing Storage," in Proc. of the 2nd International Conference on Cloud Computing, Indianapolis, IN, USA, Nov. 2010. (acceptance rate: < 25%).
  • R. Ge, X. Feng, J. Hu, X.-H. Sun, "Assessing Energy Efficiency of Parallel I/O Systems (Poster Presentation)," in Proc. of the ACM/IEEE SuperComputing Conference (SC'10), New Orleans, LA, USA, Nov. 2010.
  • H. Song ,X.-H. Sun, H. Jin, Y. Chen, "Trace-based Adaptive Data Layout Optimization for Parallel File systems (Poster Presentation)," the 5th Petascale Data Storage Workshop, in conjunction with SuperComputing 2010, New Orleans, LA, USA, Nov. 2010.
  • Y. Chen, X.-H. Sun, R. Thakur, H. Song and H. Jin, "Improving Parallel I/O Performance with Data Layout Awareness," in Proc. of the IEEE International Conference on Cluster Computing 2010 (Cluster10), Heraklion, Greece, Sept. 2010. (acceptance rate: 33/107=30.8%).
  • H. Jin, Y. Chen, H. Zhu and X.-H. Sun, "Optimizing HPC Fault-Tolerant Environment: An Analytical Approach," in Proc. of 39th International Conference on Parallel Processing (ICPP'2010), San Diego, CA, USA, Sept. 2010. (acceptance rate: 72/225=32%).
  • Y. Chen, H. Zhu, H. Jin and X.-H. Sun, "Improving the Effectiveness of Context-based Prefetching with Multi-order Analysis," in Proc. of the 3rd International Workshop on Parallel Programming Models and Systems Software for High-End Computing (P2S2), San Diego, CA, USA, Sept. 2010.
  • Z. Zheng, Z. Lan, R. Gupta, S. Coghlan and Peter Beckman, "A Practical Failure Prediction with Location and Lead Time for Blue Gene/P," in Proc. of Fault-Tolerance at Extreme Scale workshop (in conjunction with DSN'10), Chicago, IL, USA, June 2010.
  • Y. Chen, H. Song, R. Thakur and X.-H. Sun, "A Layout-aware Optimization Strategy for Collective I/O," in Proc. of High Performance Distributed Computing (HPDC-2010) (short paper), Chicago, IL, USA, June 2010.
  • H. Zhu, Y. Chen and X.-H. Sun, "Timing Local Streams: Improving Timeliness in Data Prefetching," in Proc. of the 24th International Conference on Supercomputing (ICS'10), Tsukuba, Japan, June 2010. (acceptance rate: 32/180=17.8%).
  • Y. Chen, H. Zhu and X.-H. Sun, "An Adaptive Data Prefetcher for High-Performance Processors," in Proc. of the 10th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid'10), Melbourne, Australia, May 2010. (acceptance rate: 51/219=23.3%).
  • R. Ge, X. Feng, S. Subramanya and X.-H. Sun, "Characterizing the Energy Efficiency of I/O Intensive Parallel Applications on Power-Aware Clusters," in the 6th workshop on high performance power-aware computing in conjunction with the 24th IEEE International Parallel and Distributed Processing Symposium, Atlanta, GA, USA, Apr. 2010.
  • W. Tang, N. Desai, D. Buettner, and Z. Lan, "Analyzing and Adjusting User Runtime Estimates to Improve Job Scheduling on Blue Gene/P," Best Paper Award, in Proc. of IPDPS'10, Atlanta, GA, USA, Apr. 2010.

[2009]

  • M. Wu, Xian-He Sun, "QoS of Grid Computing," in Grid Technologies and Utility Computing: Concepts for Managing Large-Scale Applications (Encyclopedia of Grid Computing Technologies and Applications), Igi Global, 2009, pp 59-74, ISBN-10: 1605661848, ISBN-13: 978-1605661841.
  • C. Du, P. Shukla, and X.-H. Sun, "Virtual Machines in Grid Environments: Dynamic Virtual Machines," in Grid Computing: Infrastructure, Service, and Application (Hardcover), CRC, 2009, pp 405-431, ISBN-10: 1420067664, ISBN-13: 978-1420067668.
  • X.-H. Sun, S. Byna, D. Holmgren, "Modeling Data Access Contention in Multicore Architectures," in Proc. of Fifteenth International Conference on Parallel and Distributed Systems (ICPADS'09), Shenzhen, China, Dec. 2009.
  • B. Xie, Y. Chen, X.-H. Sun and H. Jin, "Performance under Failure of Multi-tier Web Services," in Workshop on Internet-based Virtual Computing Environment (in conjunction with ICPADS'09), Shenzhen, China, Dec. 2009.
  • H. Jin, X.-H. Sun, B. Xie and Y. Chen, "An Implementation and Evaluation of Memory-based Checkpointing (Poster Presentation)," in Proc. of the ACM/IEEE SuperComputing Conference(SC'09), Portland, OR, USA, Nov. 2009.
  • X.-H. Sun, Y. Chen and Y. Yin, "Data Layout Optimization for Petascale File Systems," in Proc. of The 4th Petascale Data Storage Workshop (in conjunction with ACM/IEEE SC'09), Portland, OR, USA, Nov. 2009.
  • X.-H. Sun, C. Du, H. Zou, Y. Chen, and P. Shukla, "V-MCS: A Configuration System for Virtual Machines," in Proc. of Workshop on Web 2.0 on e-Research Infrastructure, Services and Applications (in conjunction with Cluster'09), New Orleans, LA, USA, Aug. 2009.
  • Z. Zheng and Z. Lan, "Reliability-Aware Scalability Models for High Performance Computing," in Proc. of IEEE Cluster'09, New Orleans, LA, USA, Aug. 2009.
  • W. Tang, Z. Lan, N. Desai, and D. Buettner, "Fault-Aware, Utility-Based Job Scheduling on Blue Gene/P Systems," in Proc. of IEEE Cluster'09, New Orleans, LA, USA, Aug. 2009.
  • Z. Zheng, Z. Lan, B.-H. Park, and A. Geist, "System Log Pre-processing to Improve Failure Prediction," in Proc. of IEEE/IFIP International Conference on Dependable Systems and Networks (DSN'09), Estoril, Lisbon, Portugal, June 2009.
  • H. Jin, X.-H. Sun, Z. Zheng, Z. Lan, and B. Xie, "Performance under Failures of DAG-based Parallel Computing," in Proc. of IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGrid'09), Shanghai, China, May 2009. (acceptance rate: 57/271=21%).
  • Z. Fang, X.-H. Sun, Y. Chen, and S. Byna, "Core-Aware Memory Access Scheduling Schemes," in Proc. of IEEE International Parallel & Distributed Processing Symposium (IPDPS'09), Rome, Italy, May 2009. (acceptance rate: 100/440=22.7%).

[2008]

  • L. Piccoli, J. B. Kowalkowski, J. N. Simone, X.-H. Sun, H. Jin, D. J. Holmgren, N. Seenu, and A. G. Singh, " Lattice QCD Workflows: A Case Study," in 3rd International Workshop on Scientific Workflows and Business Workflow Standards in e-Science (SWBES), Dec. 2008.
  • Y. Chen, S. Byna, X.-H. Sun, R. Thakur, and W. Gropp, "Hiding I/O Latency with Pre-execution Prefetching for Parallel Applications," Best paper award finalist, in Proc. of the ACM/IEEE SuperComputing Conference (SC'08), Nov. 2008. (acceptance rate: 59/277=21.3%).
  • S. Byna, Y. Chen, X.-H. Sun, R. Thakur, and W. Gropp, "Parallel I/O Prefetching Using MPI File Caching and I/O Signatures," in Proc. of the ACM/IEEE SuperComputing Conference (SC'08), Nov. 2008. (acceptance rate: 59/277=21.3%).
  • X.-H. Sun, Y. Chen, and S. Byna, "Scalable Computing in Multicore Era," in the International Symposium on Parallel Algorithms, Architectures and Programming (PAAP'08), Sept. 2008.
  • Y. Chen, S. Byna, X.-H. Sun, R. Thakur, and W. Gropp, "Exploring Parallel I/O Concurrency with Speculative Prefetching," in Proc. 37th International Conference on Parallel Processing (ICPP'08), Sept. 2008. (acceptance rate: 81/263=30.8%).
  • J. Gu, Z. Zheng, Z. Lan, J. White, E. Hocks, and B-H. Park, "Dynamic Meta-Learning for Failure Prediction in Large-scale Systems: A Case Study," in Proc. 37th International Conference on Parallel Processing (ICPP'08), Sept., 2008.
  • L. Piccoli, J. Simone, J. Kowalkowski, et.al, " Tracking LQCD Workflows(Poster Presentation)," in Lattice 2008, July 2008.
  • Y. Li and Z. Lan, "A Fast Recovery Mechanism for Checkpointing in Networked Environments," in Proc. of DSN'08, June, 2008.
  • S. Byna, Y. Chen, and X.-H. Sun, "A Taxonomy of Data Prefetching Mechanisms," in Proc. of the International Symposium on Parallel Architectures, Algorithms, and Networks (I-SPAN), May, 2008.
  • Z. Lan, Y.Li, Z. Zheng, and P. Gujrati, " Enhancing Application Robustness through Adaptive Fault Tolerance," in Proc. of the NSFNGS Workshop (in conjunction with IPDPS'08), April, 2008.
  • X.-H. Sun, Z. Lan, Y. Li, H. Jin, and Z. Zheng, "Towards a Fault-aware Computing Environment," in Proc. of the High Availability and Performance Computing Workshop (HAPCW), Mar. 2008

[2007]

  • L. Piccoli, X.-H. Sun, J. Simone, et. al.,"The LQCD Workflow Experience: What We Have Learned (Poster Presentation)," in Proc. of the ACM/IEEE SuperComputing Conf. 2007 (SC'07), Nov. 2007.
  • M. Wu, X.-H. Sun, and H. Jin, "Performance under Failure of High-End Computing," in Proc. of the ACM/IEEE SuperComputing Conf. 2007 (SC'07), Nov. 2007. (acceptance rate: 54/268=20.1%).
  • Y. Chen, S. Byna, and X.-H. Sun, "Data Access History Cache and Associated Data Prefetching Mechanisms," in Proc. of the ACM/IEEE SuperComputing Conf. 2007 (SC'07), Nov. 2007. (acceptance rate: 54/268=20.1%).
  • Z. Zheng, Y. Li, and Z. Lan, "Anomaly Localization in Large-scale Clusters," In Proc. of IEEE Cluster'07, Sep. 2007
  • P. Gujrati, Y. Li, Z. Lan, R. Thakur, and J. White, "Exploring Meta-learning to Improve Failure Prediction in Supercomputing Clusters," in Proc. of 2007 International Conference on Parallel Processing (ICPP'07), Sept. 2007.
  • Y. Li, P. Gujrati, Z. Lan, and X.-H. Sun, "Fault-Driven Re-Scheduling For Improving System-level Fault Resilience," in Proc. of 2007 International Conference on Parallel Processing (ICPP'07), Sept. 2007.
  • X.-H. Sun and M. Wu, "Quality of Service of Grid Computing: Resource Sharing," in Proc. of the 6th International Conference on Grid and Cooperative Computing(GCC'07), Aug. 2007.
  • Z. Lan, Y. Li, P. Gujrati, Z. Zheng, R. Thakur, and J. White, "A Fault Diagnosis and Prognosis Service for TeraGrid Clusters," in Proc. of TeraGrid'07, Jun. 2007.
  • Y. Li and Z. Lan, "Using Adaptive Fault Tolerance to Improve Application Robustness on the TeraGrid," in Proc. of TeraGrid'07, Jun. 2007.
  • K. Xiao, N. Chen, S. Ren, L. Shen, X.-H. Sun, K. Kwiat, and M. Macalik, "A Workflow-based Non-intrusive Approach for Enhancing the Survivability of Critical Infrastructures in Cyber Environment," in Proc. of the 3rd International Workshop on Software Engineering for Secure Systems (SESS'07), May 2007.
  • C. Du, X.-H. Sun, and M. Wu, "Dynamic Scheduling with Process Migration," in Proc. of IEEE International Symposium on Cluster Computing and the Grid 2007, Rio de Janeiro, Brazil, May 2007.
  • X.-H. Sun, S. Byna, and Y. Chen, "Improving Data Access Performance with Server Push Architecture," in Proc. of the NSF Next Generation Software Program Workshop (in conjunction with IPDPS '07), March 2007.

[2006]

[2005]

[2004]

Previous Publications



Illinois Institute of Technology
Home | About | Contact | Sitemap