Á¤º¸°Ë»ö ¹× ½Ç½À
Á¤º¸°Ë»ö °ÀÇ
- 0. °Àǰèȹ
- ±³Àç: An Introduction to Information Retrieval, Christopher D. Manning, et al., online edition 2009 Cambridge UP
- ¼ö¾÷¿î¿µ¹æ½Ä: °ÀÇ ¹× ½Ç½À
- ¼ºÀûÆò°¡¹æ½Ä: Áß°£°í»ç 35% + ±â¸»°í»ç 35% + ½Ç½À/°úÁ¦/Ãâ¼® 30%
- 0-1. ÆÄÀ̽㠼Ұ³ [PDF]
- 0-2. ÆÄÀ̽ã ÀÚ·áÇü/º¯¼ö [PDF]
- 0-3. ÆÄÀ̽ã Á¦¾î¹® [PDF]
- 0-4. ÆÄÀ̽ã ÇÔ¼ö [PDF]
- 1. Chapter 1. Boolean Retrieval [PDF]
- 2. Chapter 2. The term vocabulary and postings lists [PDF]
- 3. Chapter 3. Dictionaries and tolerant retrieval [PDF]
- 4. Chapter 6-1. Scoring, Term Weighting [PDF]
- 5. Chapter 6-2. The vector space model [PDF]
- 6. Chapter 8. Evaluation in Information Retrieval [PDF]
- 7. Chapter 9. Relevance Feedback [PDF]
- 8. Chapter 11. Probabilistic Information Retrieval [PDF]
- 9. Chapter 12. Language Models for IR [PDF]
- 10. Chapter 21. Link Analysis [PDF]
- 11. Chapter 13. Text Classification and Naive Bayes [PDF]
Á¤º¸°Ë»ö ½Ç½À
Á¤º¸°Ë»ö ÇÁ·ÎÁ§Æ® - PythonÀ¸·Î Á¤º¸°Ë»ö±â ±¸Çö
- ´ë»ó¾ð¾î ¹× ¹®¼¼Â: ¿µ¾î TREC AP88 ¹®¼¼Â (242MB)
- Á¤º¸°Ë»ö ¸ðµ¨: tf-idf weighting
- Porter stemmer Ãß°¡ (Porter stemmer ¸ðµâÀº ÁÖ¾îÁü) ¹× Sed/Awk ´ë½Å PythonÀ» ÀÌ¿ëÇÑ Posting list »ý¼º ¸ðµâ ±¸Çö
- PythonÀÇ dictionary ´ë½Å DB·Î ±¸Çö
- Evaluation Æ÷¸Ë Ãâ·Â ¹× À̸¦ ÀÌ¿ëÇÑ ¼º´É ÃøÁ¤ (¼º´É ÃøÁ¤ ÇÁ·Î±×·¥Àº ÁÖ¾îÁü)
- Ãß°¡ ¿É¼Ç (±¸ÇöÀ» Çϸé Ãß°¡ Á¡¼ö ÁÖ¾îÁü)
- Á¤º¸°Ë»ö ¸ðµ¨ Ãß°¡: Vector space model (´Ù¾çÇÑ ¿É¼Ç ±¸Çö: lnc.ltc µî), BM25, Language model for IR
- PythonÀ¸·Î CGI ±¸Çö (ÀÎÅÍ³Ý ºê¶ó¿ìÀú¿¡¼ Á¤º¸°Ë»ö±â ½ÇÇà °¡´É Çϵµ·Ï)
- Çѱ¹¾î Á¤º¸°Ë»ö±â ±¸Çö (À½Àý bigram model ÀÌ¿ë)