Publications since joining Penn State

Recent (2000-2024) Publications, Awards, Talks, Etc.
(Always in need of an update it seems. For latest bibliography list, most can be found in DBLP and Google Scholar.)

For a list of earlier publications, please go here.

2024

  •  John Stogin, Ankur Arjun Mali, C. Lee Giles, "A provably stable neural network Turing Machine with finite precision and time," Inf. Sci. 658: 120034 (2024)      
  • Ting-Yao Hsu, Chieh-Yang Huang, Shih-Hong Huang, Ryan A. Rossi, Sungchul Kim, Tong Yu, C. Lee Giles, Ting-Hao Kenneth Huang, "SciCapenter: Supporting Caption Composition for Scientific Figures with Machine-Generated Captions and Ratings, CHI Extended Abstracts 2024: 284:1-284:9  
  • Neisarg Dave, Daniel Kifer, C. Lee Giles, Ankur Mali, "Stability Analysis of Various Symbolic Rule Extraction Methods from Recurrent Neural Network," CoRR abs/2402.02627 (2024)      
  • Mukund Srinath, Pranav Venkit, Maria Badillo, Florian Schaub, C. Lee Giles, Shomir Wilson, "Automated Detection and Analysis of Data Practices Using A Real-World Corpus," CoRR abs/2402.11006 (2024)     
  • Ting-Yao Hsu, Chieh-Yang Huang, Shih-Hong Huang, Ryan A. Rossi, Sungchul Kim, Tong Yu, C. Lee Giles, Ting-Hao K. Huang, "SciCapenter: Supporting Caption Composition for Scientific Figures with Machine-Generated Captions and Ratings," CoRR abs/2403.17784 (2024)

2023

  • Zeba Karishma, Shaurya Rohatgi, Kavya Shrinivas Puranik, Jian Wu, C. Lee Giles, "ACL-Fig: A Dataset for Scientific Figure Classification," Scientific Document Understanding Workshop at 35th AAAI Conference on Artificial Intelligence (SDU@AAAI). 2023.
  • Mukund Srinath, Lee Matheson, Pranav Narayanan Venkit, Gabriela Zanfir-Fortuna, Florian Schaub, C. Lee Giles, Shomir Wilson, "Privacy Now or Never: Large-Scale Extraction and Analysis of Dates in Privacy Policy Text. DocEng 2023: 24:1-24:4
  • Mukund Srinath, Soundarya Nurani Sundareswara, Pranav Venkit, C. Lee Giles, Shomir Wilson, "Privacy Lost and Found: An Investigation at Scale of Web Privacy Policy Availability," DocEng 2023: 26:1-26:10      
  • Ting-Yao Hsu, Chieh-Yang Huang, Ryan A. Rossi, Sungchul Kim, C. Lee Giles, Ting-Hao Kenneth Huang, "GPT-4 as an Effective Zero-Shot Evaluator for Scientific Figure Captions," EMNLP (Findings) 2023: 5464-5474
  • Alexander G Ororbia., Ankur Mali, Daniel Kifer, and C. Lee Giles, "Backpropagation-Free Deep Learning with Recursive Local Representation Alignment," Proceedings of the AAAI Conference on Artificial Intelligence, vol. 37, no. 8, pp. 9327-9335. 2023.    
  • Tatiana Chakravorti, Robert Fraleigh, Timothy Fritton, Michael McLaughlin, Vaibhav Singh, Christopher Griffin, Anthony Kwasnica, David M. Pennock, C. Lee Giles, Sarah Rajtmajer, "A Prototype Hybrid Prediction Market for Estimating Replicability of Published Work," Augmenting Human Intellect - Proceedings of the Second International Conference on Hybrid Human-Artificial Intelligence HHAI 2023," 300-309, 2023.

2022

2021

  • Nakshatri, Nishanth, Arjun Menon, C. Lee Giles, Sarah Rajtmajer, and Christopher Griffin, "Design and analysis of a synthetic prediction market using dynamic convex sets," Results in Control and Optimization, 5, 100052, 2021.
  • Rao, Shivansh, Vikas Kumar, Daniel Kifer, C. Lee Giles, and Ankur Mali, "Omnilayout: Room layout reconstruction from indoor spherical panoramas," Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 3706-3715, 2021.
  • Shokouhi, P., Girkar, V., Rivière, J., Shreedharan, S., Marone, C., Giles, C. L., & Kifer, D., "Deep learning can predict laboratory quakes from active source seismic data, Geophysical Research Letters, 48(12), e2021GL093187, 2021.
  • Kaixuan Zhang, Qinglong Wang, C. Lee Giles, "An Entropy Metric for Regular Grammar Classification and Learning with Recurrent Neural Networks," Entropy, 23(1): 127, 2021.
  • Byron Reeves, Nilam Ram, Thomas N. Robinson, James J. Cummings, C. Lee Giles, Jennifer Pan, Agnese Chiatti, Mj Cho, Katie Roehrick, Xiao Yang, Anupriya Gagneja, Miriam Brinberg, Daniel Muise, Yingdan Lu, Mufan Luo, Andrew Fitzgerald, Leo Yeykelis, "Screenomics: A Framework to Capture and Analyze Personal Life Experiences and the Ways that Technology Shapes Them," Human-Computer Interactions, 36(2): 150-201, 2021.
  • Liu, Lu, Nima Dehmamy, Jillian Chown, C. Lee Giles, and Dashun Wang, "Understanding the onset of hot streaks across artistic, cultural, and scientific careers," Nature communications, 12(1), 1-10, 2021.
  • Teja Lanka, Sree Sai, Sarah Rajtmajer, Jian Wu, and C. Lee Giles, "Extraction and evaluation of statistical information from social and behavioral science papers," In Companion Proceedings of the Web Conference 2021, 426-430, 2021.
  • Ling, Meng, Jian Chen, Torsten Möller, Petra Isenberg, Tobias Isenberg, Michael Sedlmair, Robert S. Laramee, Han-Wei Shen, Jian Wu, and C. Lee Giles, "Document domain randomization for deep learning document layout extraction," International Conference on Document Analysis and Recognition (ICDAR), 497-513, 2021.
  • Dave, Neisarg, Riley Bakes, Barton Pursel, and C. Lee Giles, "Math Multiple Choice Question Solving and Distractor Generation with Attentional GRU Networks," International Educational Data Mining Society (EDM),  2021.
  • Feng Xia, C. Lee Giles, Huan Liu, Kuansan Wang: Guest Editorial: Scholarly Big Data, "IEEE Trans. Emerg. Top. Comput." 9(1): 200-203, 2021.
  • Sai Ajay Modukuri, Sarah Michele Rajtmajer, Anna Cinzia Squicciarini, Jian Wu, C. Lee Giles, "Understanding and Predicting Retractions of Published Work.," Proceedings of the Workshop on Scientific Document Understanding co-located with 35th AAAI Conference on Artificial Intelligence, SDU@AAAI, 2021
  • Mukund Srinath, Soundarya Nurani Sundareswara, C. Lee Giles, Shomir Wilson, "PrivaSeer: A Privacy Policy Search Engine," Web Engineering - 21st International Conference, ICWE 2021, 286-301, 2021.
  • Kandimalla, Bharath, Shaurya Rohatgi, Jian Wu, and C. Lee Giles, "Large scale subject category classification of scholarly papers with deep attentive neural networks," Frontiers in research metrics and analytics, (5), 600382, 2021.
  • Nishanth Nakshatri, Arjun Manoj Menon, C. Lee Giles, Sarah Michele Rajtmajer, Christopher Griffin, "Design and Analysis of a Synthetic Prediction Market using Dynamic Convex Sets," CoRR abs/2101.01787 (2021)
  • Mali, Ankur, Alexander G. Ororbia, Daniel Kifer, and C. Lee Giles, "Recognizing and verifying mathematical equations using multiplicative differential neural units," In Proceedings of the AAAI Conference on Artificial Intelligence, 35 (6), 5006-5015. 2021.
  • Ankur Mali, Alexander Ororbia, Daniel Kifer, C. Lee Giles, "Recognizing and Verifying Mathematical Equations using Multiplicative Differential Neural Units," CoRR abs/2104.02899 (2021)
  • Jian Wu, Rajal Nivargi, Sree Sai Teja Lanka, Arjun Manoj Menon, Sai Ajay Modukuri, Nishanth Nakshatri, Xin Wei, Zhuoer Wang, James Caverlee, Sarah Michele Rajtmajer, C. Lee Giles, "Predicting the Reproducibility of Social and Behavioral Science Papers Using Supervised Learning Models.,"CoRR abs/2104.04580 (2021)
  • Shivansh Rao, Vikas Kumar, Daniel Kifer, C. Lee Giles, Ankur Mali, "OmniLayout: Room Layout Reconstruction from Indoor Spherical Panoramas," CoRR abs/2104.09403 (2021)

2020

  • Bharath Kandimalla, Shaurya Rohatgi, Jian Wu, C. Lee Giles, "Large Scale Subject Category Classification of Scholarly Papers With Deep Attentive Neural Networks," Frontiers in Research Metrics and Analytics, Volume 5: 600382 (2020)
  • Yasith Jayawardana, Alexander C. Nwala, Gavindya Jayawardena, Jian Wu, Sampath Jayarathna, Michael L. Nelson, C. Lee Giles, "Modeling Updates of Scholarly Webpages Using Archived Data," IEEE BigData 2020: 1868-1877
  • Shaurya Rohatgi, Jian Wu, C. Lee Giles: PSU at CLEF-2020 ARQMath Track: Unsupervised Re-ranking using Pretraining. CLEF (Working Notes) 2020
  • Shaurya Rohatgi, Zeba Karishma, Jason Chhay, Sai Raghav Reddy Keesara, Jian Wu, Cornelia Caragea, C. Lee Giles: COVIDSeer: Extending the CORD-19 Dataset. DocEng 2020: 21:1-21:4
  • Wei Zhong, Shaurya Rohatgi, Jian Wu, C. Lee Giles, Richard Zanibbi: Accelerating Substructure Similarity Search for Formula Retrieval. ECIR (1) 2020: 714-727
  • Jian Wu, Pei Wang, Xin Wei, Sarah Michele Rajtmajer, C. Lee Giles, Christopher Griffin: Acknowledgement Entity Recognition in CORD-19 Papers. SDP@EMNLP 2020: 10-19
  • Kunho Kim, Athar Sefid, C. Lee Giles: Learning CNF Blocking for Large-scale Author Name Disambiguation. SDP@EMNLP 2020: 72-80
  • Ke Yuan, Dafang He, Xiao Yang, Zhi Tang, Daniel Kifer, C. Lee Giles: Follow The Curve: Arbitrarily Oriented Scene Text Detection Using Key Points Spotting And Curve Prediction. ICME 2020: 1-6
  • Krutarth Patel, Cornelia Caragea, Jian Wu, C. Lee Giles: Keyphrase Extraction in Scholarly Digital Library Search Engines. ICWS 2020: 179-196
  • Kaixuan Zhang, Qinglong Wang, C. Lee Giles: Deep Learning, Grammar Transfer, and Transportation Theory. ECML/PKDD (2) 2020: 609-623
  • Alexander Ororbia, Ankur Mali, Daniel Kifer, C. Lee Giles: Reducing the Computational Burden of Deep Learning with Recursive Local Representation Alignment. CoRR abs/2002.03911 (2020)
  • Ankur Mali, Alexander Ororbia, Daniel Kifer, Clyde Lee Giles: Recognizing Long Grammatical Sequences Using Recurrent Networks Augmented With An External Differentiable Stack. CoRR abs/2004.07623 (2020)
  • Mukund Srinath, Shomir Wilson, C. Lee Giles: Privacy at Scale: Introducing the PrivaSeer Corpus of Web Privacy Policies. CoRR abs/2004.11131 (2020)
  • Ting-Hao Kenneth Huang, Chieh-Yang Huang, Chien-Kuang Cornelia Ding, Yen-Chia Hsu, C. Lee Giles: CODA-19: Reliably Annotating Research Aspects on 10,000+ CORD-19 Abstracts Using a Non-Expert Crowd. CoRR abs/2005.02367 (2020)
  • John Stogin, Ankur Mali, C. Lee Giles: Provably Stable Interpretable Encodings of Context Free Grammars in RNNs with a Differentiable Stack. CoRR abs/2006.03651 (2020)
  • Bharath Kandimalla, Shaurya Rohatgi, Jian Wu, C. Lee Giles: Large Scale Subject Category Classification of Scholarly Papers with Deep Attentive Neural Networks. CoRR abs/2007.13826 (2020)
  • Athar Sefid, Clyde Lee Giles, Prasenjit Mitra: Extractive Summarizer for Scholarly Articles. CoRR abs/2008.11290 (2020)
  • Yasith Jayawardana, Alexander C. Nwala, Gavindya Jayawardena, Jian Wu, Sampath Jayarathna, Michael L. Nelson, C. Lee Giles: Modeling Updates of Scholarly Webpages Using Archived Data. CoRR abs/2012.03397 (2020)
  • Xiaoxiao Li, Rabah A. Al-Zaidy, Amy X. Zhang, Stefan Baral, Le Bao, C. Lee Giles: Automating Document Classification with Distant Supervision to Increase the Efficiency of Systematic Reviews. CoRR abs/2012.07565 (2020)
  • Kaixuan Zhang, Qinglong Wang, C. Lee Giles. "Adversarial Models for Deterministic Finite Automata," Advances in Artificial Intelligence - 33rd Canadian Conference on Artificial Intelligence, Canadian AI 2020,  540-552, 2020.
  • Ke Yuan, Dafang He, Zhuoren Jiang, Liangcai Gao, Zhi Tang, C. Lee Giles, "Automatic Generation of Headlines for Online Math Questions," Thirty-Fourth AAAI Conference on Artificial Intelligence, (AAAI 2020), 9490-9497, 2020.
  • Kaixuan Zhang, QinglongWang, Xue Liu, C. Lee Giles, "Shapley Homology: Topological Analysis of Sample Influence for Neural Networks," Neural Computation, 32(7): 1355-1378, 2020.
  • Alexander Ororbia, Ankur Mali, C. Lee Giles, and Daniel Kifer, "Continual learning of recurrent neural networks by locally aligning distributed representations," IEEE Transactions on Neural Networks and Learning Systems, 2020.
  • Ankur Mali, Alexander Ororbia, C. Lee Giles, "Sibling Neural Estimators: Improving Iterative Image Decoding with Gradient Communication," Data Compression Conference (DCC 2020), 23-32, 2020.

2019

2018

  • Qinglong Wang, Kaixuan Zhang, Alexander G. Ororbia II, Xinyu Xing, Xue Liu, C. Lee Giles, "An Empirical Evaluation of Rule Extraction from Recurrent Neural Networks," Neural Computation, 30(9), 2018.
  • Liu, Lu, Yang Wang, Roberta Sinatra, C. Lee Giles, Chaoming Song, Dashun Wang, "Hot streaks in artistic, cultural, and scientific careers," Nature, 559(7714), 396, 2018.
  • Chen Liang, Jianbo Ye, Shuting Wang, Bart Pursel, C. Lee Giles, "Investigating Active Learning for Concept Prerequisite Learning," Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence (AAAI), In the 8th Symposium on Educational Advances in Artificial Intelligence (EAAI), 2018.
  • Chen Liang, Xiao Yang, Neisarg Dave, Drew Wham, Bart Pursel, C. Lee Giles, "Distractor Generation for Multiple Choice Questions Using Learning to Rank," Proceedings of the Thirteenth Workshop on Innovative Use of NLP for Building Educational Applications@NAACL-HLT," 2018.
  • Kunho Kim, Athar Sefid, Bruce A. Weinberg, C. Lee Giles, "A Web Service for Author Name Disambiguation in Scholarly Databases," 2018 IEEE International Conference on Web Services (ICWS 2018), 265-273, 2018.
  • Agnese Chiatti, Mu Jung Cho, Anupriya Gagneja, Xiao Yang, Miriam Brinberg, Katie Roehrick, Sagnik Ray Choudhury, Nilam Ram, Byron Reeves, C. Lee Giles, "Text extraction and retrieval from smartphone screenshots: building a repository for life in media," Proceedings of the 33rd Annual ACM Symposium on Applied Computing (SAC 2018), 948-955, 2018.
  • Rabah A. Al-Zaidy, C. Lee Giles, "Extracting Semantic Relations for Scholarly Knowledge Base Construction," 12th IEEE International Conference on Semantic Computing (ICSC 2018), 56-63, 2018.
  • Agnese Chiatti, Mu Jung Cho, Anupriya Gagneja, Xiao Yang, Miriam Brinberg, Katie Roehrick, Sagnik Ray Choudhury, Nilam Ram, Byron Reeves, C. Lee Giles: Text Extraction and Retrieval from Smartphone Screenshots: Building a Repository for Life in Media. CoRR abs/1801.01316 (2018)
  • Qinglong Wang, Kaixuan Zhang, Alexander G. Ororbia II, Xinyu Xing, Xue Liu, C. Lee Giles: A Comparison of Rule Extraction for Different Recurrent Neural Network Models and Grammatical Complexity. CoRR abs/1801.05420 (2018)
  • Chen Liang, Jianbo Ye, Han Zhao, Bart Pursel, C. Lee Giles: Active Learning of Strict Partial Orders: A Case Study on Concept Prerequisite Relations. CoRR abs/1801.06481 (2018)
  • Alexander G. Ororbia II, Ankur Mali, Daniel Kifer, C. Lee Giles: Conducting Credit Assignment by Aligning Local Representations. CoRR abs/1803.01834 (2018)
  • Alexander G. Ororbia II, Ankur Mali, Jian Wu, Scott O'Connell, David J. Miller, C. Lee Giles: Learned Iterative Decoding for Lossy Image Compression Systems. CoRR abs/1803.05863 (2018)
  • Dafang He, Yeqing Li, Alexander N. Gorban, Derrall Heath, Julian Ibarz, Qian Yu, Daniel Kifer, C. Lee Giles: Guided Attention for Large Scale Scene Text Verification. CoRR abs/1804.08588 (2018)

2017

2016

2015

  • Alexander G. Ororbia II, C. Lee Giles, and David Reitter, Online Semi-Supervised Learning with Deep Hybrid Boltzmann Machines and Denoising Autoencoders. arXiv:1511.06964 [cs], 2015.
  • Alexander G. Ororbia II, C. Lee Giles, and David Reitter. "Learning a deep hybrid model for semi-supervised text classification," Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing (EMNLP), Lisbon, Portugal, 2015.
  • Alexander G. Ororbia II, David Reitter, Jian Wu, and C. Lee Giles. "Online learning of deep hybrid architectures for semi-supervised categorization," Proc. European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML-PKDD). Porto, Portugal, 2015.
  • Wenyi Huang, Zhaohui Wu, Liang Chen, Prasenjit Mitra, C. Lee Giles, "A Neural Probabilistic Model for Context Based Citation Recommendation," Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence AAAI 2015,  2404-2410, 2015.
  • Hung-Hsuan Chen, C. Lee Giles, "ASCOS++: An Asymmetric Similarity Measure for Weighted Networks to Address the Problem of SimRank,"  ACM Transactions on Knowledge Discovery from Data (TKDD), 10(2): 15.1-26, (2015).
  • Sujatha Das Gollapalli, Cornelia Caragea, Prasenjit Mitra, C. Lee Giles, "Improving Researcher Homepage Classification with Unlabeled Data," ACM Transactions on the Web (TWEB), 9(4): 17.1-32, (2015).
  • Jian Wu, Kyle Mark Williams, Hung-Hsuan Chen, Madian Khabsa, Cornelia Caragea, Suppawong Tuarob, Alexander Ororbia, Douglas Jordan, Prasenjit Mitra, C. Lee Giles, "CiteSeerX: AI in a Digital Library Search Engine," AI Magazine, 36(3): 35-48 (2015).
  • Madian Khabsa, C. Lee Giles, "Chemical entity extraction using CRF and an ensemble of extractors," J. Cheminformatics, 7(S-1): S12 (2015).
  • Martin Krallinger, Obdulia Rabal, Florian Leitner, Miguel Vazquez, David Salgado, Zhiyong Lu, Robert Leaman, Yanan Lu, Donghong Ji, Daniel M. Lowe, Roger A. Sayle, Riza Theresa Batista-Navarro, Rafal Rak, Torsten Huber, Tim Rocktäschel, Sérgio Matos, David Campos, Buzhou Tang, Hua Xu, Tsendsuren Munkhdalai, Keun Ho Ryu, S. V. Ramanan, P. Senthil Nathan, Slavko Zitnik, Marko Bajec, Lutz Weber, Matthias Irmer, Saber A. Akhondi, Jan A. Kors, Shuo Xu, Xin An, Utpal Kumar Sikdar, Asif Ekbal, Masaharu Yoshioka, Thaer M. Dieb, Miji Choi, Karin Verspoor, Madian Khabsa, C. Lee Giles, Hongfang Liu, Ravikumar Komandur Elayavilli, Andre Lamurias, Francisco M. Couto, Hong-Jie Dai, Richard Tzong-Han Tsai, Caglar Ata, Tolga Can, Anabel Usie, Rui Alves, Isabel Segura-Bedmar, Paloma Martínez, Julen Oyarzabal, Alfonso Valencia, "The CHEMDNER corpus of chemicals and drugs and its annotation principles," J. Cheminformatics, 7(S-1): S2 (2015).
  • Suppawong Tuarob, Line C. Pouchard, Prasenjit Mitra, C. Lee Giles, "A generalized topic modeling approach for automatic document annotation," Int. J. on Digital Libraries, 16(2): 111-128 (2015).
  • Zhaohui Wu, C. Lee Giles, "Sense-aware Semantic Analysis: A Multi-Prototype Word Representation Model Using Wikipedia," Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence (AAAI 2015), 2188-2194, 2015.
  • Wenyi Huang, Zhaohui Wu, Liang Chen, Prasenjit Mitra, C. Lee Giles, "A Neural Probabilistic Model for Context Based Citation Recommendation," Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence (AAAI 2015), 2404-2410, 2015.
  • Jian Wu, Jason Killian, Huaiyu Yang, Kyle Williams, Sagnik Ray Choudhury, Suppawong Tuarob, Cornelia Caragea, C. Lee Giles, "PDFMEF: A Multi-Entity Knowledge Extraction Framework for Scholarly Documents and Semantic Search," Proceedings of the 8th International Conference on Knowledge Capture (K-CAP 2015), 13:1-13:8, 2015.
  • Rabah A. Al-Zaidy, C. Lee Giles, "Automatic Extraction of Data from Bar Charts," Proceedings of the 8th International Conference on Knowledge Capture (K-CAP 2015), 30:1-30:4, 2015.
  • Zhaohui Wu, Liang Chen, C. Lee Giles: "Storybase: Towards Building a Knowledge Base for News Events," Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing (ACL 2015), (System Demonstrations) 133-138, 2015.
  • Sagnik Ray Choudhury, Prasenjit Mitra, Clyde Lee Giles, "Automatic Extraction of Figures from Scholarly Documents," 2015 ACM Symposium on Document Engineering (DocEng 2015), 47-50, 2015.
  • Chen Liang, Shuting Wang, Zhaohui Wu, Kyle Williams, Bart Pursel, Benjamin Bräutigam, Sherwyn Saul, Hannah Williams, Kyle Bowen, C. Lee Giles, "BBookX: An Automatic Book Creation Framework," 2015 ACM Symposium on Document Engineering (DocEng 2015), 121-124, 2015.
  • Shuting Wang, Chen Liang, Zhaohui Wu, Kyle Williams, Bart Pursel, Benjamin Bräutigam, Sherwyn Saul, Hannah Williams, Kyle Bowen, C. Lee Giles: "Concept Hierarchy Extraction from Textbooks," 2015 ACM Symposium on Document Engineering (DocEng 2015), 147-156, 2015.
  • Alexander G. Ororbia II, C. Lee Giles, David Reitter, "Learning a Deep Hybrid Model for Semi-Supervised Text Classification," 2015 Conference on Empirical Methods in Natural Language Processing(EMNLP 2015), 471-481, 2015.
  • Chen Liang, Zhaohui Wu, Wenyi Huang, C. Lee Giles, "Measuring Prerequisite Relations Among Concepts," 2015 Conference on Empirical Methods in Natural Language Processing(EMNLP 2015), 1668-1674, 2015.
  • Madian Khabsa, Pucktada Treeratpituk, C. Lee Giles, "Online Person Name Disambiguation with Constraints," 15th ACM/IEEE-CE on Joint Conference on Digital Libraries (JCDL 2015), 37-46, 2105.
  • Pucktada Treeratpituk, Madian Khabsa, C. Lee Giles, "Automatically Generating a Concept Hierarchy with Graphs,"15th ACM/IEEE-CE on Joint Conference on Digital Libraries (JCDL 2015), 265-266, 2015.
  • Alexander G. Ororbia II, David Reitter, Jian Wu, C. Lee Giles, "Online Learning of Deep Hybrid Architectures for Semi-supervised Categorization," Machine Learning and Knowledge Discovery in Databases - European Conference (ECML PKDD 2015), 516-532, 2015.
  • Dayu Yuan, Prasenjit Mitra, Huiwen Yu, C. Lee Giles, "Updating Graph Indices with a One-Pass Algorithm," 2015 ACM SIGMOD International Conference on Management of Data, 1903-1916, 2015.
  • Kyle Williams, C. Lee Giles, "On the Use of Similarity Search to Detect Fake Scientific Papers," Similarity Search and Applications - 8th International Conference (SISAP 2015), 332-338, 2015.
  • Alexander G. Ororbia II, Jian Wu, Madian Khabsa, Kyle Williams, Clyde Lee Giles, "Big Scholarly Data in CiteSeerX: Information Extraction from the Web," 24th International Conference on World Wide Web Companion (WWW 2015) (Companion Volume), 597-602, 2015.
  • Sagnik Ray Choudhury, Clyde Lee Giles, "An Architecture for Information Extraction from Figures in Digital Libraries," 24th International Conference on World Wide Web Companion (WWW 2015) (Companion Volume), 667-672, 2015.

2014

  • Jian Wu, Kyle Williams, Hung-Hsuan Chen, Madian Khabsa, Cornelia Caragea, Alexander Ororbia, Douglas Jordan, C. Lee Giles, "CiteSeerX: AI in a Digital Library Search Engine," Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, Innovative Applications of Artificial Intelligence, 2930-2937, 2014.
  • Kyle Williams, Hung-Hsuan Chen, C. Lee Giles, "Supervised Ranking for Plagiarism Source Retrieval," Working Notes for CLEF 2014 Conference, 1021-1026, 2014.
  • Kyle Williams, Hung-Hsuan Chen, C. Lee Giles, "Classifying and ranking search engine results as potential sources of plagiarism," ACM Symposium on Document Engineering (DocEng), 97-106, 2014.
  • Kyle Williams, Jian Wu, C. Lee Giles, "SimSeerX: a similar document search engine," ACM Symposium on Document Engineering (DocEng), 143-146, 2014.
  • Madian Khabsa, Pucktada Treeratpituk, C. Lee Giles, "Large scale author name disambiguation in digital libraries," IEEE International Conference on Big Data, 41-42, 2014.
  • Cornelia Caragea, Jian Wu, Alina Maria Ciobanu, Kyle Williams, Juan Pablo Fernández Ramírez, Hung-Hsuan Chen, Zhaohui Wu, C. Lee Giles," CiteSeer x : A Scholarly Big Dataset," Advances in Information Retrieval - 36th European Conference on IR Research (ECIR), 311-322, 2014.
  • Jian Wu, Alexander Ororbia, Kyle Williams, Madian Khabsa, Zhaohui Wu, C. Lee Giles, "Utility-Based Control Feedback in a Digital Library Search Engine: Cases in CiteSeerX," 9th USENIX International Workshop on Feedback Computing, 2014.
  • Jian Wu, Pradeep B. Teregowda, Kyle Williams, Madian Khabsa, Douglas Jordan, Eric Treece, Zhaohui Wu, C. Lee Giles, "Migrating a Digital Library to a Private Cloud," 2014 IEEE International Conference on Cloud Engineering (IC2E) 97-106, 2014.
  • Kyle Williams, Jian Wu, Sagnik Ray Choudhury, Madian Khabsa, C. Lee Giles, "Scholarly big data information extraction and integration in the CiteSeerχ digital library," Proceedings of the 30th International Conference on Data Engineering (ICDE) Workshops: IIWeb 2014 — 10th International Workshop on Information Integration on the Web, 68-73, 2014.
  • Kyle Williams, Lichi Li, Madian Khabsa, Jian Wu, Patrick C. Shih, C. Lee Giles, "A Web Service for Scholarly Big Data Information Extraction," 2014 IEEE International Conference on Web Services  (ICWS 2014), 105-112, 2014.
  • Zhaohui Wu, Jian Wu, Madian Khabsa, Kyle Williams, Hung-Hsuan Chen, Wenyi Huang, Suppawong Tuarob, Sagnik Ray Choudhury, Alexander Ororbia, Prasenjit Mitra, C. Lee Giles, "Towards building a scholarly big data platform: Challenges, lessons and opportunities," IEEE/ACM Joint Conference on Digital Libraries (JCDL 2014), 117-126, 2014.
  • Zhaohui Wu, Wenyi Huang, Liang Chen, C. Lee Giles, "Crowd-sourcing Web knowledge for metadata extraction," IEEE/ACM Joint Conference on Digital Libraries (JCDL 2014), 141-144, 2014
  • Hung-Hsuan Chen, Madian Khabsa, C. Lee Giles, "The feasibility of investing in manual correction of metadata for a large-scale digital library," IEEE/ACM Joint Conference on Digital Libraries (JCDL 2014), 225-228, 2014.
  • Wenyi Huang, Zhaohui Wu, Prasenjit Mitra, C. Lee Giles, "RefSeer: A citation recommendation system," IEEE/ACM Joint Conference on Digital Libraries (JCDL 2014), 371-374, 2014.
  • Kyle Williams, Jian Wu, Sagnik Ray Choudhury, Madian Khabsa, C. Lee Giles: Scholarly big data information extraction and integration in the CiteSeerχ digital library. ICDE Workshops 2014: 68-73, 2014.
  • Sujatha Das Gollapalli, Yanjun Qi, Prasenjit Mitra, C. Lee Giles, "Extracting Researcher Metadata with Labeled Features," Proceedings of the 2014 SIAM International Conference on Data Mining (SDM 2014), 740-748, 2014.
  • Zhaohui Wu, Dayu Yuan, Pucktada Treeratpituk, C. Lee Giles: Science and Ethnicity: How Ethnicities Shape the Evolution of Computer Science Research Community. CoRR abs/1411.1129, 2014.

2013

2012

2011

2010

2009

2008

  • Isaac G. Councill, C. Lee Giles, Min-Yen Kan, "ParsCit: an Open-source CRF Reference String Parsing Package," Proceedings of the International Conference on Language Resources and Evaluation (LREC 2008), 2008.
  • William Browuer, Saurabh Kataria, Sujatha Das, Prasenjit Mitra, C. Lee Giles, "Segregating and extracting overlapping data points in two-dimensional plots, "ACM/IEEE Joint Conference on Digital Libraries (JCDL 2008), 276-279, 2008.
  • Umer Farooq, Craig H. Ganoe, John M. Carroll, Isaac G. Councill, C. Lee Giles, "Design and evaluation of awareness mechanisms in CiteSeer," Information Processing and Management, 44(2): 596-612, 2008.
  • Jian Huang, Omid Madani, C. Lee Giles, "Error-driven generalist+experts (edge): a multi-stage ensemble framework for text categorization,"  17th ACM Conference on Information and Knowledge Management (CIKM 2008), 83-92, 2008.
  • Jian Huang, Ziming Zhuang, Jia Li, C. Lee Giles, "Collaboration over time: characterizing and modeling network evolution,"  International Conference on Web Search and Web Data Mining (WSDM2008), 107-116, 2008.
  • Shu Huang, Qiankun Zhao, Prasenjit Mitra, C. Lee Giles, "Hierarchical Location and Topic Based Query Expansion,"  Twenty-Third AAAI Conference on Artificial Intelligence, (AAAI 2008), 1150-1155, 2008.
  • Saurabh Kataria, William Browuer, Prasenjit Mitra, C. Lee Giles, "Automatic Extraction of Data Points and Text Blocks from 2-Dimensional Plots in Digital Documents,"  Twenty-Third AAAI Conference on Artificial Intelligence (AAAI 2008), 1169-1174, 2008.
  • Huajing Li, Wang-Chien Lee, C. Lee Giles, "Workload analysis for scientific literature digital libraries," International Journal on Digital Libraries, 9(2): 139-149, 2008.
  • Huajing Li, Zaiqing Nie, Wang-Chien Lee, C. Lee Giles, Ji-Rong Wen, "Scalable community discovery on textual data with relations," 17th ACM Conference on Information and Knowledge Management (CIKM 2008), 1203-1212, 2008.
  • Ying Liu, Lucian V. Lita, Radu Stefan Niculescu, Prasenjit Mitra, C. Lee Giles, "Finding a Haystack in Haystacks - Simultaneous Identification of Concepts in Large Bio-Medical Corpora,"  SIAM International Conference on Data Mining (SDM 2008), 668-679, 2008.
  • Ying Liu, Lucian V. Lita, Radu Stefan Niculescu, Kun Bai, Prasenjit Mitra, C. Lee Giles, "Real-time data pre-processing technique for efficient feature extraction in large scale datasets,"  17th ACM Conference on Information and Knowledge Management (CIKM 2008), 981-990, 2008.
  • Ying Liu, Prasenjit Mitra, C. Lee Giles, "Identifying table boundaries in digital documents via sparse line detection,"  17th ACM Conference on Information and Knowledge Management (CIKM 2008), 1311-1320, 2008.
  • Xiaonan Lu, Brewster Kahle, James Ze Wang, C. Lee Giles, "A metadata generation system for scanned scientific volumes," ACM/IEEE Joint Conference on Digital Libraries (JCDL 2008), 167-176, 2008.
  • Yang Song, C. Lee Giles, "Efficient user preference predictions using collaborative filtering," 19th International Conference on Pattern Recognition (ICPR 2008)," 1-4, 2008.
  • Yang Song, Lu Zhang, C. Lee Giles, "A sparse gaussian processes classification framework for fast tag suggestions,"  17th ACM Conference on Information and Knowledge Management (CIKM 2008), 93-102, 2008.
  • Yang Song, Lu Zhang, C. Lee Giles, "A Non-parametric Approach to Pair-Wise Dynamic Topic Correlation Detection," 8th IEEE International Conference on Data Mining (ICDM 2008), 1031-1036, 2008.
  • Yang Song, Ziming Zhuang, Huajing Li, Qiankun Zhao, Jia Li, Wang-Chien Lee, C. Lee Giles, "Real-time automatic tag recommendation,"  31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2008), 515-522, 2008.
  • Bingjun Sun, Prasenjit Mitra, C. Lee Giles, "Mining, indexing, and searching for textual chemical molecule information on the web,"  17th International Conference on World Wide Web (WWW 2008), 735-744, 2008.
  • Yang Sun, Huajing Li, Isaac G. Councill, Wang-Chien Lee, C. Lee Giles, "Measuring user preference changes in digital libraries,"  17th ACM Conference on Information and Knowledge Management (CIKM 2008), 1497-1498, 2008.
  • Yang Sun, Huajing Li, Isaac G. Councill, Jian Huang, Wang-Chien Lee, C. Lee Giles, "Personalized ranking for digital libraries based on log analysis, "10th ACM International Workshop on Web Information and Data Management (WIDM2008), 133-140, 2008.
  • Yang Sun, Isaac G. Councill, C. Lee Giles, "BotSeer: An Automated Information System for Analyzing Web Robots,"  Eighth International Conference on Web Engineering (ICWE 2008), 108-114, 2008.
  • Qingzhao Tan, Prasenjit Mitra, C. Lee Giles, "Metadata extraction and indexing for map search in web documents,"  17th ACM Conference on Information and Knowledge Management (CIKM 2008), 1367-1368, 2008.
  • Xiaolong Zhang, Yan Qu, C. Lee Giles, Piyou Song, "CiteSense: supporting sensemaking of research literature,"  twenty-sixth annual SIGCHI conference on Human factors in computing systems (CHI 2008), 677-680, 2008.
  • Ding Zhou, Shenghuo Zhu, Kai Yu, Xiaodan Song, Belle L. Tseng, Hongyuan Zha, C. Lee Giles, "Learning multiple graphs for document recommendations,"  17th International Conference on World Wide Web (WWW 2008), 141-150, 2008.
  • Ding Zhou, Jiang Bian, Shuyi Zheng, Hongyuan Zha, C. Lee Giles, "Exploring social annotations for information retrieval,"  17th International Conference on World Wide Web (WWW 2008), 715-724, 2008.
  • Ding Zhou, Shenghuo Zhu, Kai Yu, Xiaodan Song, Belle L. Tseng, Hongyuan Zha, C. Lee Giles, "Learning multiple graphs for document recommendations,"  17th International Conference on World Wide Web (WWW 2008), 141-150, 2008.
  • Ziming Zhuang, Cliff Brunk, Prasenjit Mitra, C. Lee Giles, "Towards Click-Based Models of Geographic Interests in Web Search,"  IEEE/WIC/ACM International Conference on Web Intelligence (WI 2008), 293-299, 2008.
  • Ziming Zhuang, Cliff Brunk, C. Lee Giles, "Modeling and visualizing geo-sensitive queries based on user clicks,"  First International Workshop on Location and the Web (LocWeb 2008), 73-76, 2008.

2007

2006

2005

2004:

2003

2002

2001

2000


 2000 Reprinted Paper: