/ プログラム/ 発表一覧/ 著者一覧/ 企業展示一覧/ jsai2010ホーム /

1D1-3 CGMからの自己教師あり学習と条件付確率場を用いた人間行動マイニング

06月09日(Wed) 09:00〜10:20 D会場(3F-会議室4)
1D1 Webマイニング「知識獲得」

演題番号1D1-3
題目CGMからの自己教師あり学習と条件付確率場を用いた人間行動マイニング
著者Nguyen MinhThe(電気通信大学大学院情報システム学研究科)
川村 隆浩(電気通信大学大学院情報システム学研究科)
中川 博之(電気通信大学大学院情報システム学研究科)
田原 康之(電気通信大学大学院情報システム学研究科)
大須賀 昭彦(電気通信大学大学院情報システム学研究科)
時間06月09日(Wed) 09:40〜10:00
概要In our definition, human activity can be expressed by five basic attributes: actor, action, object, time and location. The goal of this paper is to describe a method to automatically extract all of the basic attributes and the relationships (transition and cause) between activities
derived from sentences in Japanese CGM (consumer generated media). Previous work had some limitations, such as high setup cost, inability of extracting all attributes, limitation on the types of sentences that can be handled, and insufficient consideration of interdependency
among attributes. To resolve these problems, this paper proposes a novel approach that uses
conditional random fields and self-supervised learning. This approach treats the activity extraction as a sequence labeling problem, and has advantages such as domain-independence, scalability, and unnecessary hand-tagged data. Since it is unnecessary to fix the
positions and the number of the attributes in activity sentences, this approach can extract all attributes and relationships between activities by making only a single pass over its corpus. Additionally, by converting to simpler sentences, the approach can deal with complex
sentences retrieved from Japanese CGM. In an experiment, this approach achieves high precision (activity: 88.84%, attributes: over 90%, relationships: over 84%).
論文PDFファイル