Information Retrieval
Spring 2014
9:00 ~12:00 AM, Thursdays
Prof. Berlin Chen (陳柏琳)

Tentative List of Topics:


Course Overview & Introduction

Book Chapter: Modern Information Retrieval, Ch. 1
Paper: The History of Information Retrieval Research
02/27   Classical Models cf. Modern Information Retrieval, Ch.3
03/06   Classical Models  
03/13   Evaluation Metrics  
03/20   Benchmark Collections HW#1: Evaluations for IR (Due: 03/27)
03/27   Extensions of Classic (Set, Algebra & Probabilistic) Models HW#2: Classic Models for IR and Query Expansion (Due: 04/10)
04/03   Exercise  
04/10   Relevance Feedback and Query Expansion  
04/17   Latent Semantic Analysis HW#3: LSA for IR  (Due: 05/16)
04/24   Language Modeling for Information Retrieval  
05/01   Language Modeling for Information Retrieval  
05/08   Clustering: Metrics and Techniques  
05/15   Clustering: Metrics and Techniques  
05/22   Indexing and Searching HW#4: LM for IR (Due 06/05)
05/29   Paper Presentation
曾厚強: Learning to Predict Readability using Diverse Linguistic Features (COLING 2010)
吳佳厚: Concept-Based Information Retrieval Using Explicit Semantic Analysis (ACM TOIS 2011)
陳煥元: Entity-Centric Document Filtering: Boosting Feature Mapping through Meta-Features (CIKM 2013)
張庭豪: CRF Framework for Supervised Preference Aggregation (CIKM 2013)
朱聖池: Multi-Label Classification by Mining Label and Instance Correlations from Heterogeneous Information Networks
劉書宇: Disambiguating Implicit Temporal Queries by Clustering Top Relevant Dates in Web Snippets (WI&IAT 2012)
06/05   Paper Presentation
王思涵: Search Result Diversification in Resource Selection for Federated Search (SIGIR 2013)
劉憶年: How Do Users Respond to Voice Input Errors? Lexical and Phonetic Query Reformulation in Voice Search (SIGIR 2013)
彭紹峻: Using Micro-Reviews to Select an Efficient Set of Reviews (CIKM 2013)
吳培豪: Toward whole-session relevance exploring intrinsic diversity in web search (SIGIR 2013)
施凱文: Graph-of-word and TW-IDF: New Approach to Ad Hoc IR (CIKM 2013)
陳思澄: An Unsupervised Topic Segmentation Model Incorporating Word Order (SIGIR 2013)
06/12   User Interfaces for Search  
06/19   Web Search Basics  


R. Baeza-Yates and B. Ribeiro-Neto, Modern Information Retrieval: The Concepts and Technology behind Search (2nd Edition), ACM Press, 2011

Christopher D. Manning, Prabhakar Raghavan and Hinrich Schütze, Introduction to Information Retrieval, Cambridge University Press, 2008
W. Bruce Croft, Donald Metzler, and Trevor Strohman, Search Engines: Information Retrieval in Practice, Addison Wesley, 2009


C. C. Aggarwal, ,C.X. Zhai (eds.), Mining Text Data, Springer, 2012.
W. B. Frakes and R. Baeza-Yates, Information Retrieval: Data Structures & Algorithms,  Prentice-Hall, 1992.
C.X. Zhai, Statistical Language Models for Information Retrieval (Synthesis Lectures Series on Human Language Technologies), Morgan & Claypool Publishers, 2008)
W. B. Frakes and R. Baeza-Yates, Information Retrieval: Data Structures & Algorithms,  Prentice-Hall, 1992.

T. K. Landauer, D. S. McNamara, S. Dennis, W. Kintsch (eds.) , Handbook of Latent Semantic Analysis, Lawrence Erlbaum, 2007
D. A. Grossman, O. Frieder, Information Retrieval: Algorithms and Heuristics, Springer, 2004.
 I. H. Witten, A. Moffat, and T. C. Bell, Managing Gigabytes: Compressing and Indexing Documents and Images, Morgan Kaufmann Publishing, 1999.
C. Manning and H. Schutze, Foundations of Statistical Natural Language Processing, MIT Press, 1999.
D. Jurafsky and J. H. Martin, Speech and Language Processing, Prentice-Hall, 2000.
W.B. Croft and J. Lafferty (eds.), Language Models for Information Retrieval, Kluwer International Series on Information Retrieval, Volume 13, Kluwer Academic Publishers, 2002.
Stephen Robertson and Hugo Zaragoza, The Probabilistic Relevance Framework: BM25 and Beyond. Foundations and Trends in Information Retrieval 3 no. 4, 333-389 (2009).
D. Carmel and E. Yom-Tov , "Estimating the Query Difficulty for Information Retrieval," Synthesis Lectures on Information Concepts, Retrieval, and Services, Morgan & Claypool Publishers, 2010.



 M. Sanderson and W. B. Croft, "The history of information retrieval research," Proceedings of the IEEE, Vol. 100, pp. 1444 - 1451, May 2012.
O. Kolomiyets, M.-F. Moens, "A survey on question answering technology from an information retrieval perspective," Information Sciences 181 (2011) 5412–5434
Johan Schalkwyk et al., "Google Search by Voice: A case study," 2010.
D. Blei, A. Ng, and M. Jordan, "Latent Dirichlet allocation,"  Journal of Machine Learning Research, 3:993-1022, January 2003.
V. Lavrenko and W.B. Croft, "Relevance-Based Language Models"  ACM SIGIR 2001.
C. H. Papadimitriou, P. Raghavan, H. Tamaki, S. Vempala, "Latent semantic indexing: A probabilistic analysis,'' analyzes an information retrieval technique related to principle components analysis.
Liu, X. and Croft, W.B., "Statistical Language Modeling For Information Retrieval,"  the Annual Review of Information Science and Technology, vol. 39, 2005
Lan Huang. A Survey On Web Information Retrieval Technologies. 2000.
Karen Spa¨rck Jones, "Some Points in a Time," Computational Linguistics, Vol. 31, No. 1, 2005.
D. Hiemstra, "Information Retrieval Model," In: A. Goker, J. Davies, and M. Graham (eds.), Information Retrieval: Searching in the 21st Century, Wiley, 2009
M. Steyvers, T. Griffiths,  "Probabilistic Topic Models," In T. K. Landauer, D. S. McNamara, S. Dennis, W. Kintsch (eds.). Handbook of Latent Semantic Analysis, Mahwah NJ: Lawrence Erlbaum, 2007.
X. Yi, J. Allan,  "A Comparative Study of Utilizing Topic Models for Information Retrieval," in the Proceedings of ECIR'09.
Nallapati, Discriminative Models for Information Retrieval, in the Proceedings of SIGIR 2004
T. Joachims and F. Radlinski, Search Engines that Learn from Implicit Feedback, IEEE Trans. on Computer 40(8), pp. 34-40, 2007
B. Chen, H.M. Wang, L.S. Lee, “A discriminative HMM/N-gram-based retrieval approach for Mandarin spoken documents,” ACM Transactions on Asian Language Information Processing, Vol. 3, No. 2, pp. 128-145, June 2004.


