專題討論


指導老師 張嘉惠教授
學生
組別 組員
1 方淑芬呂理維
2 楊兵河黃茁淳
3 吳彥欽呂紹誠
4 李火山張祜嘉
5 許涵雲陳修平
6

倪家祥潘燕弘

7 蕭宏文蔡任及
Resource

ACM Special Interest Group in Information Retrieval Homepage

The ACM SIGMOD Anthology / DBLP:Search

Ahoy! The Homepage Finder v3.0(尋找論文作者網頁)

資料庫實驗室 DBLP Bibliography


已報告論文

SubjectAuthor

Web Document Clustering:A Feasibility Demonstration

Oren Zamir
Oren Etzioni

Fast and Intuitive Clustering of Web Documents

Oren Zamir
Oren Etzioni

Automatic Resource Compilcation by Analyzing Hyperlink Structure and Associated Text

Soumen Chakrabarti
Byron Dom
Prabhakar Raghavan
Sridhar Rajagopalan
David Gibson
Jon Kleinberg

Syntactic Clustering of the Web

Andrei Z. Broder

New Indices for Text: Pat Trees & Pat Arrays

Gaston H. Gonnet
Ricardo A. Baeza-Yates
Tim Snider

Generating Finite-State Transducers For Semi-Structured Data Extraction From The Web

許鈞南

A Statistical Method for Estimating the Usefulness of Text Databases

Clement T.Yu

Semantic Caching via Query Matching for Web Sources

Dongwon Lee
Wesley W. Chu

Efficient Data Mining for Path Traversal Patterns

Ming-Syan Chen
Jong Soo Park
Philip S. Yu

Learning to Remove Internet Advertisements

Nicholas Kushmerick

MailCat:An Intelligent Assistant for Organizing E-Mail

Richard B. Segal
Jeffrey O. Kephart

A Personal News Agent that Talks,Learns and Explains

Daniel Billsus
Michael J. Pazzani

WebQuery:Searching and Visualizing the Web through Connectivity

Rick Kazman
Jemory Carriere

Regression Testing for Wrapper Maintenance

Nicholas Kushmerick

 


第一組

Web Document Clustering:A Feasibility Demonstration &
Fast and Intuitive Clustering of Web Documents

Speaker:呂理維/方淑芬(10/05)

  • 論文作者:Oren Zamir(graduate student) & Oren Etzioni(Associate Professor)
    Dept. of Computer Science & Engineering,University of Washington
  • Oren Etzioni Current Research:
    The Intelligent WebWare project investigates novel methods,
    inspired by the fields of Artificial Intelligence and Information Retrieval,
    for making the Web easier to navigate

投影片

 

第二組

Automatic Resource Compilcation
by Analyzing Hyperlink Structure and Associated Text

Speaker:楊兵河(10/12)

相關網站

投影片

Syntactic Clustering of the Web

Speaker:黃茁淳(10/12)

  • 論文作者:Andrei Z. Broder,Compaq System Research Center(SRC)中的一員
  • SRC主要研究方向為
    • Scalable Systems
    • Human-centered Interation
    • Internet
  • 作者的研究非常多樣化,發表過各式各樣的題目,總共發表四十三篇在各種期刊上。
    近二年發表過 Web 相關的論文有五、六篇。

投影片

 

第三組

New Indices for Text: Pat Trees & Pat Arrays

Speaker:吳彥欽(10/19)

  • 論文作者:Gaston H. Gonnet, Ricardo A. Baeza-Yates, Tim Snider
  • Gaston H. Gonnet:Professor,ETH Zurich,Switzerland,Informatik ,Institute for Scientific Computation

投影片

Generating Finite-State Transducers For Semi-Structured
Data Extraction From The Web

Speaker:呂紹誠(10/26)

  • 論文作者:許鈞南
    • 目前工作:中研院資科所研究員,交大教授
    • 研究:
      • Internet Information Agents
      • Information Integration & Mediation
      • Semantic Query Optimization (SQO)
      • Learning Semantic Rules for SQO
      • Robustness of Knowledge
      • Information Extractioin from Web Pages
      • Feature Selection for Neural Networks
  • 發表於1998 Elsevier Science Ltd.

投影片

 

第四組

A Statistical Method for
Estimating the Usefulness of Text Databases

Speaker:李火山(11/01)

  • 論文作者:Clement T.Yu
    Professor(EECS)
    Education Ph.D.
    Computer Science Cornell University 1973
    M.S. Computer Science
    B.S. Applied Mathematics
    Columbia University,New York,1970

投影片

7

Speaker:張祜嘉(11/09)

  • 論文作者:

投影片

 

第五組

Semantic Caching via Query Matching for Web Sources

Speaker:許涵雲/陳修平(11/16)

  • 論文作者:Dongwon Lee & Wesley W. Chu
  • Dongwon Lee:
    Department of Computer Science University of California,
    Los Angeles Los Angeles,CA 90095,USA
    Email:{dongwon,wwc}@cs.ucla.edu
  • 相關網站:

投影片一投影片二

 

第六組

Efficient Data Mining for Path Traversal Patterns

Speaker:倪家祥(11/23)

  • 論文作者:
  • Ming-Syan Chen Elect. Eng. Department National Taiwan Univ
    Taipei,Taiwan,ROC
  • Jong Soo Park Department of Comput. Sci.
    Sungshin Women’s Univ. Seoul,Korea
  • Philip S. Yu IBM T.J Watson Res. Ctr. P.O.Box 704 Yorktown
    NY 10598,U.S.A

投影片

10

Speaker:潘燕弘(11/30)

  • 論文作者:

投影片

 

第七組

Learning to Remove Internet Advertisements

Speaker:蕭宏文(11/30)

  • 論文作者:Nicholas Kushmerick
    Department of Computer Science
    University College Dublin

投影片

12

Speaker:蔡仁及(11/30)

  • 論文作者:

投影片

 

第一組

MailCat:
An Intelligent Assistant for Organizing E-Mail

Speaker:方淑芬(12/07)

投影片

A Personal News Agent that
Talks, Learns and Explains

Speaker:呂理維(12/07)

投影片

 

第二組

WebQuery:
Searching and Visualizing the Web through Connectivity

Speaker:楊兵河(12/14)

投影片

16

Speaker:黃茁淳(12/23)

 

第三組

17

Speaker:呂紹誠(12/23)

Regression Testing for Wrapper Maintenance

Speaker:吳彥欽(12/28)

  • 論文作者:Nicholas Kushmerick
    University College Dublin Ireland

投影片

 

第四組

 

Speaker:李火山(12/28)