User Tools

Site Tools


Xiaohua “Tony” Hu, Ph.D.

Contact Information

Phone: (215) 895-0551


Ph.D. - Computer Science | University of Regina - Canada (1995)
M.Sc. - Computer Science | Simon Fraser University - Canada
M.Eng. - Computer Engineering | Institute of Computing Technology Chinese Academy of Science
B.Sc. - Software | Wuhan University

Title: CVDI Site Director & Founder; Professor in the College of Computing and Informatics, Drexel University


Xiaohua Tony Hu (Ph.D., 1995) is a full professor and the founding director of the data mining and bioinformatics lab at the College of Computing and Informatics (the former College of Information Science and Technology, one of the best information science schools in the USA, ranked as #1 in 1999 and #6 in 2010 in information systems by U.S. News & World Report). He is also serving as the founding Co-Director of the NSF Center (I/U CRC) on Visual and Decision Informatics (NSF CVDI), IEEE Computer Society Bioinformatics and Biomedicine Steering Committee Chair, and IEEE Computer Society Big Data Steering Committee Chair. Tony is a scientist, teacher, and entrepreneur. He joined Drexel University in 2002. He founded the International Journal of Data Mining and Bioinformatics (SCI indexed) in 2006, International Journal of Granular Computing, Rough Sets and Intelligent Systems in 2008. Earlier, he worked as a research scientist in the world-leading R&D centers such as Nortel Research Center, and Verizon Lab (the former GTE labs). In 2001, he founded the DMW Software in Silicon Valley, California. He has a lot of experience and expertise to convert original ideas into research prototypes, and eventually into commercial products, many of his research ideas have been integrated into commercial products and applications in data mining fraud detection, database marketing.

Tony’s current research interests are in data/text/web mining, big data, bioinformatics, information retrieval and information extraction, social network analysis, healthcare informatics, rough set theory and application. He has published more than 240 peer-reviewed research papers in various journals, conferences and books such as various IEEE/ACM Transactions (IEEE/ACM TCBB, IEEE TFS, IEEE TDKE, IEEE TITB, IEEE SMC, IEEE Computer, IEEE NanoBioScience, IEEE Intelligent Systems), JIS, KAIS, CI, DKE, IJBRA, SIG KDD, IEEE ICDM, IEEE ICDE, SIGIR, ACM CIKM, IEEE BIBE, IEEE CICB etc, co-edited 20 books/proceedings. He has received a few prestigious awards including the 2005 National Science Foundation (NSF) Career award, the best paper award at the 2007 International Conference on Artificial Intelligence, the best paper award at the 2004 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology, the 2010 IEEE Granular Computing Outstanding Contribution Awards, the 2007 IEEE Bioinformatics and Bioengineering Outstanding Contribution Award, the 2006 IEEE Granular Computing Outstanding Service Award, and the 2001 IEEE Data Mining Outstanding Service Award. He has also served as a program co-chair/conference co-chair of 14 international conferences/workshops and a program committee member in more than 80 international conferences in the above areas. He is the founding editor-in-chief of the International Journal of Data Mining and Bioinformatics (SCI indexed), International Journal of Granular Computing, Rough Sets and Intelligent Systems, an associate editor/editorial board member of four international journals (KAIS, IJDWM, IJSOI and JCIB). His research projects are funded by the National Science Foundation (NSF), US Dept. of Education, the PA Dept. of Health, the Natural Science Foundation of China (NSFC). He has obtained more than US$8.0 million research grants in the past 8 years as PI or Co-PI (PIs of 7 NSF grants, PI of 1 IMLS grant in the last 8 years). He has graduated 13 Ph.D. students from 2006 to 2014 and is currently supervising 12 Ph.D. students.

Investigated Projects

  • Large-Scale Probabilistic Anomaly Detection from Text and Its Application to Medical Records 7a.004.DU
  • Big Data Analysis in Social Media Applications (Joint USA-Finland Proposal) 15.3
  • Analyzing, Modeling and Summarizing Social Media and Linked Datasets
  • Large-Scale Social Media Analytical Tools with Application to Detecting Emerging Events
  • NSF Research Experience for Teachers (RET)
  • NSF Research Experience for Undergraduates (REU)
  • NSF Research Experience for Veterans
  • NSF CORBI: Modeling, Visualization, and Understanding of Large Data Sets
  • GOALI: A Joint Model for Image Topic and Emotion Identification and Annotation
  • Collaborative Research: Fundamental Research in Visualization-based Gap Analysis and Link Prediction
  • A Predictive Analytics Framework for Spatiotemporal Hotspots


  • Zhou G., Zhiyuan Zhu, Tingting He, Hu X:Cross-lingual sentiment classification with stacked autoencoders. Knowl. Inf. Syst. 47(1): 27-44 (2016)
  • Zhou G., Zhiwen Xie, Tingting He, Jun Zhao, Hu X. Learning the Multilingual Translation Representations for Question Retrieval in Community Question Answering via Non-Negative Matrix Factorization. IEEE/ACM Trans. Audio, Speech & Language Processing 24(7): 1305-1314 (2016)
  • Hu X., Lin T., Raghavan R., Wah B., Baeza-Yates R., Fox G., Shahal C., Smith M., Yang Q., Ghani R., Fan H., Lempel R., Nambir R.(Eds) , Proceedings of the 2013 IEEE International Conference on Big Data. (IEEE Big Data 2013), ISBN: 978-1-4799-1292-6, Oct 6-9, 2013 Sana Clara, CA, USA
  • Wang Y. Hu X., AOBA: Recognizing Object Behavior in Pervasive Urban management, IEEE Transactions on Knowledge and Data Engineering, 26(11):2625-2638(2014)
  • Wang Y. Hu X., Fuzzy Reasoning of Accident Provenance in Pervasive Healthcare Monitoring Systems. IEEE Journal of Biomedical and Health Informatics, 17(6):1015-1022(2013)
  • Zhang X., Hu X., He T., Park, E.K., Zhou X., Utilizing Different Link Types to Enhance Document Clustering based on Markov Random Field Model with Relaxation Labeling, accepted to be published in IEEE Transactions on Systems, Men and Cybernetics, Part A, 2012
  • Xin Chen, Hu X, He T, An Y. Wu X., Inferring functional groups from microbial gene catalogue with probabilistic topic models, accepted to be published in IEEE Transactions on NanoBioscience, 2012
  • Chen Y., Yin X., Li Z. Hu X., Promoting Ranking Diversity for Biomedical Information Retrieval based on LDA, accepted to be published in BMC Genomics, 2012
  • Lu C., Hu X., J. Park, Exploiting the Social Tagging Network for Web Clustering, IEEE Transactions on Systems, Men and Cybernetics, Part A, Vol 41 (5), Sept., 2011, pp840-852
  • Hu X., Park, E.K., Zhang X., Microarray Gene Cluster Identification and Annotation through Cluster Ensemble and EM based Informative Textual Summarization, IEEE Transactions on Information Technology in Biomedicine, Sept., 2009, Vol. 13, No. 5, pp832-840
  • Hu X., Shen X., Mining Biomedical Literature for Identification of Potential Virus/Bacteria, in IEEE Intelligent System, Nov/Dec 2009, Vol 24 No. 6, pp73-77
  • Hu X., Zhang X., Yoo I., Wang X., Feng J.., Mining Hidden Connections among Biomedical Concepts from Disjoint Biomedical Literature Sets through Semantic-based Association Rule, International Journal of Intelligent System, 25(2): 207-223 (2010)
  • Zhou X., Hu X., Zhang X., Topic Signature Language Models for Ad-hoc Retrieval, in IEEE Transactions on Knowledge and Data Engineering (IEEE TKDE), Sept 2007, pp 276-287
  • Li Y., Hu X., Lin H., Yang Z. , A Framework for Semi-supervised Feature Generation and its Applications in Biomedical Literature Mining, IEEE/ACM Transactions on Computational Biology and Bioinformatics, March-April 2011, pp294-307
  • Lu C., Park. J., Hu X., User Tags versue Expert-created metadata: A Comparison between LibraryThing tags and Library Congress Subject Headings, accepted to be published in Journal of Information Science
  • Yan, R., Li C., Heish H., Hu P. Hu Xiaohua, He T.. Socialized Language Model Smoothing via Bi-Directional Influence Propogation on Social Network, WWW 2016, Montreal, Canada, April 10-14, 2016 (full paper, acceptance rate : 16%)
  • Liu M., Fang Y., park D., Hu Xiaohua, Yu Z., Retrieving Non-Redundant Questions to Summarize a Product Review, SIGIR 2016, Pisa, Italy, July 17-21, 2016 (full paper, acceptance rate: 18%)
  • Yan R., Cheng-Te Li, Hu Xiaohua, Ming Zhang:Chinese Couplet Generation with Neural Network Structures. ACL (1) 2016
  • Wanying Ding, Yue Shang, Lifan Guo, Xiaohua Hu, Rui Yan, Tingting He. Video Popularity Prediction by Sentiment Propagation via Implicit Network, regular paper, CIKM 2015
  • Rui Yan, Xiang Li, Mengwen Liu, Xiaohua Hu, Tackling Sparsity, the Achilles Heel of Social Networks: Language Model Smoothing via Social Regularization. ACL (2) 2015: 623-629
  • Cercone N., Hou L., Keselj V., An A.,. Naruedomkul N.,, Hu X., From Computational Intelligence to Web Intelligence , in IEEE Computer, November 2002, pp 72-76
  • Chen X., Hu X., Zhou Z., An Y., He T., Park E.K., Modeling Semantic Relations between Visual Attributes and Object Categories via Dirichlet Forest Prior, in ACM CIKM 2012 (full paper, acceptance rate: 13.4)
  • An Y, Hu X., Song Y., Learning to Discover Complex Mappings from Web Forms to Ontologies, in ACM CIKM 2012 ( full paper, acceptance rate: 13.4%)
about/personnel/tony_hu.txt · Last modified: 2021/02/10 15:59 by sally.johnson