Cross-Media Meta Search: Query Relaxation and Information

by user

on 28 марта 2017

Category: Documents

>> Downloads: 21

152

views

Report

Comments

Description

Download Cross-Media Meta Search: Query Relaxation and Information

Transcript

Cross-Media Meta Search: Query Relaxation and Information

Master Thesis
Cross-Media Meta Search:
Query Relaxation and Information
Integration for Heterogeneous-Media
Search Engines
Supervisor
Professor Katsumi TANAKA
Department of Social Informatics
Graduate School of Informatics
Kyoto University
Akihiro KUWABARA
February 9, 2004
i
Cross-Media Meta Search:
Query Relaxation and Information Integration for
Heterogeneous-Media Search Engines
Akihiro KUWABARA
Abstract
In recent years, the quantity of multimedia contents on WWW has been in-
creasing with improvement in the speed of transmission speed, and the spread of
digital cameras. Since the contents are distributed on a lot of websites and expressed by various media, it is important to construct a function which searches
the contents eﬀectively.
Conventional meta search engines retrieve the results from several search en-
gines and automatically classify them. The meta search engines dispatch queries
to same type search engines, and retrieve mono-type contents. Moreover, only
the link to a Web page is shown in ranking form as a uniﬁed reference result.
However, in such conventional meta search engines, the information on various
media is simultaneously uncollectible. If a user follows the link of a reference
result and peruses two or more Web pages, he can acquire eﬀective information
at last.
In this paper, in order to solve this problem, we propose a system which dis-
patch queries to adequate search engines based on the media type. Moreover,
we propose the system which extracts only the information in relation to the
reference keyword group from the Web page obtained as a reference result, and
uniﬁes these automatically.
The information over various media relevant to the reference keywords are
searched and uniﬁed. It can be called output like an encyclopedia using Web
page. Users can only input reference keywords and can peruse easily various
information currently distributed on Web about the keyword. Users can eﬀort
of perusing the Web page of a link place sequentially from the result of higher
rank and discovering eﬀective information from Web pages. And also users can
peruse the information over reference keywords now at a glance eﬀectively so
that it may say that this language is such meanings.
In this system, the case where are the reference question by two or more
ii
keywords is inputted by the user is considered. The search engine used here
corresponds to various media. That is, it is the search engine with which the
types of a text search, a image search, a music search, etc. We focus on a text
search and a image search. we propose the method of transforming a reference
question and collecting Web pages eﬃciently.
A text search has many hits of a solution, but it is diﬃcult to acquire eﬀective
information. A image search has high precision and can obtain the picture of
a reference keyword simply easily, it has very few hits of a solution. In order
to solve this, we use query relaxation. The query relaxation method assigns
whether it is used for text search, or it is used for image search for every word
of a reference question. And each search engine performs separately and the
common Web page of a reference result becomes the answer of reference. By
using this query relaxation, it becomes possible to utilize eﬀectively taking advantage of the strong point of the search engine of two diﬀerent media. That
is, it becomes possible to raise recall, maintaining precision.
Next, only the portion in relation to the reference keyword is extracted from
the inside of a Web page set of the solution collected using query relaxation
method. A related portion is extracted in consideration of frequency of the
word which appears in the Web page group of a search result, and position
relation between the picture of image search and a paragraph. Thus, it enables
a user to peruse eﬀectively by unifying only the important portion of a Web
page instead of the Web page itself.
By cross-media meta search, we think that users become possible to peruse
the suitable information about the inputted reference keywords simply and to
search multimedia contents easily.
iii
クロスメディア・メタサーチ
異メディア・サーチエンジンに対する質問緩和と情報統合
桑原昭裕
内容梗概
近年，通信の高速化やデジタルカメラの普及に伴い，WWW によって取得可
能なコンテンツはその量や種類が増大している．ある事象に関する情報を検索
する場合，情報が大量の Web サイトに分散し，かつ，種々のメディアによって
表現されているため，ユーザに有益な情報だけを収集してくることは非常に困
難になってきている．よって，ユーザが有効な情報を検索する際は，より効果
的な情報を大量のコンテンツの中から選択し，様々な情報を統合する機能が重
要である．
情報を効果的に検索し統合する手段として，メタサーチエンジンが挙げられ
る．従来の WWW のメタサーチエンジンは，各々のサーチエンジンが有する
インデックス情報をもとにキーワード検索した Web ページについて，重複の除
去・自動分類などを行い，検索結果を表示するものであった．従来のメタサー
チエンジンでは，利用する各々のサーチエンジンは同一タイプのものだけであ
り，および統合した検索結果として Web ページへのリンクだけがランキング形
式で示される．しかし，このような従来のメタサーチでは，様々なタイプのメ
ディアの情報を同時に収集することはできない．また検索結果のリンクをたど
り，複数の Web ページを巡回しなくては，有効な情報を得ることはできない.
この問題を解決するために，本研究では，ユーザが入力した質問キーワード
群を変換して，テキスト検索エンジンや画像検索エンジンなどの多様なメディ
ア向けの検索エンジンに対して検索処理を行う．また，検索結果として得られ
た Web ページから検索キーワード群に関連している情報だけを抽出し，自動的
に統合する方式を提案する．
提案する方式では，百科事典を引くように検索キーワードを入力するだけで
そのキーワードについての Web 上に分散している様々な情報を容易に閲覧する
ことを目標としている．本方式を用いることで，従来のようにリンク先の Web
ページを上位の結果から順に閲覧し，様々な内容を含んでいる Web ページから
有効な情報を探し出すという手間を省くことが可能になり，検索キーワードの
多義的な意味を取得することが可能となり，効果的な閲覧が可能になる．
iv
本研究では，複数のキーワードによる And 型検索質問がユーザによって入力
された場合を考える．また，ここで利用するサーチエンジンは複数のメディア
に対応するものを想定している．すなわち，テキスト検索や画像検索や音楽検
索などのタイプの異なるサーチエンジンである．その中でもテキスト検索と画
像検索に焦点を当て，検索質問を変換させ Web ページを効率よく収集してくる
方法を提案する.
テキスト検索はヒット件数が多いが，有効な情報が少なくなってしまうとい
う特徴を持っており，画像検索は精度が高く，検索キーワードの画像を簡単に
得ることがでるが，ヒット件数が非常に少ないという特徴を持っている. これ
を解決するために，質問緩和法を利用する. 質問緩和法とは，検索質問の単語
ごとに，テキスト検索に使用するか，画像検索に使用するかを割り当てるもの
である. そして，それぞれの検索を別々に行い，共通の解の Web ページを検索
の解とするものである. この質問緩和法を利用することによって，二つの異な
るメディアの検索エンジンの長所を活かし，有効に活用することが可能になる.
つまり，精度を保ちつつ，再現率を高めることが可能になる.
次に，質問緩和法を利用し，収集してきた解の Web ページ集合内から，検索
キーワードに関連している部分だけを抽出する．これは，検索結果の Web ペー
ジ群に出現する単語の頻度と，画像検索の検索結果の画像と文章との位置関係
を考慮し，関連部分を抽出するものである. このように Web ページそのもので
はなく，Web ページの重要な部分だけを統合することで，ユーザは効果的に閲
覧することが可能になるのである.
本方式によって，ユーザは入力した検索キーワードについて適切な情報を簡
便に閲覧することが可能となり，容易にマルチメディアコンテンツの検索が可
能となると考えられる．
Cross-Media Meta Search:
Query Relaxation and Information Integration for
Heterogeneous-Media Search Engines
Contents
Chapter 1
Introduction
1
Chapter 2
Cross-Media Meta Search
5
Chapter 3
Query Relaxation
8
3.1
Basic concept . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
3.2
Query Relaxation method . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
3.3
Answers by Query Relaxation . . . . . . . . . . . . . . . . . . . . . . . . 12
3.4
Example of Query Relaxation . . . . . . . . . . . . . . . . . . . . . . . . 14
3.5
The degree of Query Relaxation . . . . . . . . . . . . . . . . . . . . . . . 15
3.6
Merit of Query Relaxation . . . . . . . . . . . . . . . . . . . . . . . . . . 15
Chapter 4
Experiments and Evaluation
8
17
4.1
The outline of experiment . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
4.2
Experiment results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
4.3
Consideration . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
Chapter 5
5.1
Improvement of Query relaxation
23
Method of improvement . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
5.1.1
Execution order of subset . . . . . . . . . . . . . . . . . . . . . 23
5.1.2
Breadth-ﬁrst search and Depth-ﬁrst search . . . . . . . . . 24
5.1.3
The statistical technique . . . . . . . . . . . . . . . . . . . . . . 25
5.1.4
The linguistic technique . . . . . . . . . . . . . . . . . . . . . . 28
5.1.5
The extended proposal of the query relaxation . . . . . . . 30
Chapter 6
Information Integration
32
6.1
Extract information . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
6.2
Copyright problem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34
6.3
Searching based on Web page . . . . . . . . . . . . . . . . . . . . . . . . 34
6.4
Informational arrangement . . . . . . . . . . . . . . . . . . . . . . . . . . 35
Chapter 7
Prototype System
37
7.1
Implement . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37
7.2
Implemeting display . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40
Chapter 8
Related work
41
Chapter 9
Conclusion
45
Acknowledgments
47
References
48
Chapter 1
Introduction
Since the Internet environment has spread increasingly, the number of Web
pages is increasing steadily. The quantity of multimedia contents on WWW has
been increasing by the spread of a broadband, digital cameras, etc. Thus, since
Web space is ﬂooded with various information, it is becoming very diﬃcult for
users to collect only useful information.
It is the search engine which a user uses for looking for a Web page needed
from a huge Web page. The conventional search engine shows the link to the
Web page of a reference result by inputting a reference question. However, there
is a limit in the amount of information which each search engine has, and since
it is decided like the text search engine and the image search engine what media
are searched for every search engine, there is a limitation also in the kind of
information which can be searched.
It is the meta search which is compensated with the ability not to do with one
search engine. A meta search increases the amount of information by using two
or more search engines collectively. Moreover meta search engine is used as a
means to search information eﬀectively and to unify it. In the conventional meta
search engine, if a user inputs a reference keyword group and performs reference,
user input keywords are passed to each search engine, each search engine carries
out retrieval separately, and collects Web pages. A meta search engine, to the
Web page which each search engine collected, removes duplication, operates a
classiﬁcation automatically, and outputs a reference result. But, meta search
engines have three problems.
1. The meta search engines uses same type search engines which search Web
pages based on text style.
In the existing meta search, only the text search engine is used mostly.
Thereby, since only the text document in a Web page is taken into consideration, it is thought on the Web page which has the present various media
that suﬃcient reference cannot be performed.
1
2. the same reference question is performed to every search engine which the
meta search engine uses.
When there were many reference keywords, or when unrelated for every
reference keyword, a reference result which a user desires does not come
out. In order to solve this problem, the given keyword is not used as it is,
but it is considered that it is necessary to make it change into a certain
form.
3. Finally, as uniﬁed reference results show the links to Web pages, users have
to browse one by one.
In almost all search engines, the link to a Web page is displayed as a
reference result. Therefore, when a user peruses the Web page of a reference
result, a user is very troublesome, in order to have to repeat operation of
perusing the Web page of each of reference results until it discovers the
Web page which has started the contents which can be judged to be useful
information. Moreover, since various contents are described in one Web
page, a user cannot collect only useful information eﬃciently.
Thus, it is not easy for users to acquire useful information in the conventional
search engines. It is important that users can acquire various information, such
as text, and pictures, to having inputted the reference keyword into one search
engine, and users can easily ﬁnd useful information. Now, the search engine is
highly eﬃcient that it is improved rapidly and is easy to use rapidly. I think
that meta search of using them well collectively is a very eﬀective means. If a
meta search which solves a problem which was described above exists, a user
will be considered that it can collect information very eﬃciently.
Then, we propose the Cross-media meta search. We deﬁne the Cross-media
meta search as being collecting reference results using a search engine of a
diﬀerent kind. By using a search engine of a diﬀerent kind simultaneously, the
problem that only the text search used mostly is solvable. Moreover, I think that
the Cross-media meta search needs to transform a reference question in order
to use a search engine of a diﬀerent kind. I think that informational processing
2
and informational integration are required, in order to show the reference result
of each media eﬃciently.
Information integration
Search result
Input keywords
The system
Return answer
Change keywords
Search engine
Collect Web pages
Web space
Figure 1: Conventional Meta Search
3
Fig.1 shows the whole system image. The system has the following func-
tions.
• the system uses search engines of a diﬀerent kind.
• the system changes user input keywords.
• the system integrations information which each search engine collects
The remainder of this paper is organized as follows: Section 2 explains
Cross-Media Meta search, Section 3 explains Query Relaxation, Section 4 discusses our experiments, Section 5 discusses improvement of query relaxation,
Section 6 explains Information Integration, Section 7 explains the prototype
system, Section 8 discusses related works, and we conclude in Section 9.
4
Chapter 2
Cross-Media Meta Search
We think there is much information on Web space. If we can use Web well,
we can receive a lot of beneﬁt. Especially, it is very eﬀective that users get only
the information which users want. Then, we think that Web can be expressed
like an encyclopedia. When we can only look up a word in an encyclopedia,
we can know the meaning, a photograph, a related word of the word. We put
this in for Web retrieval. Thus, users only inputted the reference keyword and
get various information. We use a meta search in order to realize this. Each of
conventional search engines cannot cover the huge information on Web space.
And, since each of search engines has a ﬁeld, such as text search and image
search, users cannot get the information on various forms in one search engine.
By using meta search engines, we can increase the amount of information and
can also receive several kinds of media.
In this paper, we propose Cross-media meta search. The Cross-media meta
search engine intuitively collects several types of information from the Web
based on user input keyword queries. The diﬀerences between conventional
Web meta-search engines and Cross-media meta search engines are shown in
Figs.2 and 3 are summarized as follows:
• Conventional meta search
Conventional Web meta search engines send the user input keyword query
Q (possibly with minor changes) directly to several search engines. Modiﬁcations are not made to a user-input keyword queries. To retrieve the
results conventional meta-search engines return a list of pertinent URLs
with duplicates removed and with ranking scores.
• Cross-Media meta search
The Cross-media meta search engine is designed to collect not only Web
pages, but also mixed types of multimedia contents (images, sounds etc.)
by sending, query Q to several search engines, each of which is dedicated
to a speciﬁc type of media content. On sending query Q, the search engine
5
may modify and/or relax the term into Q1, . . . , Qn , according to the characteristics of each media type. The output of the Cross-media meta search
engine is not simply a list of URLs, but a mixture of texts, images, and
sounds, edited like an encyclopedia.
Search engine E1
Text search
result
http://www....
result
Search engine E2
Text search
Q
Intersection
http://www....
...
Search engine Em
Text search
result
List of URL
Figure 2: Conventional Meta Search
Q1
Q
Q2
..
.
Qn
Search engine E1
Text search
Web
pages
I
Search engine E2
Image search
Web
pages
...
I
Search engine Em
Music, etc… search
Web
pages
Intersection
Web pages
of result
Information
Integration
Figure 3: Cross-Media Meta Search
6
There are two points where Cross-media meta search is characterized.
The ﬁrst is that Cross-media meta search uses the search engine of the type
of various media in order to collect various information at once. We think it
important that a system changes a user’s question into the form which was
adapted for each search engine without using as it is, in order to use the search
engine of diﬀerent media eﬃciently. Then, we propose the method of query
relaxation.
The second is that Cross-media meta search extracts one portion out of a
Web page, integrates those information and displays on users. Because it is bad
eﬃciency that a system displays URLs of Web pages collected with each search
engine. It is thought that the information which a user needs is a part in the
Web page of the collected result. Then, we propose the method of information
integration of Web pages collected of each search engines.
7
Chapter 3
Query Relaxation
In this chapter, we describe Query Relaxation of Cross-media meta search.
In the conventional meta search, the user’s input query is used for each search
engine as it is. However, in Cross-Media meta search, each search engine is used
by using the Query Relaxation method. So, in Cross-Media meta search, each
engine can be used eﬃciently.
The method of Query Relaxation is as follow.
1. User input query is divided into a subset.
2. Each subset is used for each search engine.
3. The reference results of each search engine are collected.
4. The common Web page of each reference result is result of Cross-Media
meta search.
The method of Query Relaxation is described in detail below.
3.1
Basic concept
We assume that users wish to have multimedia content (texts, images, sounds,
etc.) that is related to their keywords K1 ,K2 ,. . . ,Kn (n≥2). Conventionally, if
a user wants the text about these keywords, he will use a text search and if a
user wants the picture about these keywords, he will use a image search. These
actions are usual. However, since it is a thing relevant to these keywords, if
a text and a picture can be searched simultaneously and are able to peruse a
reference result simultaneously, we think that a user can peruse information
very eﬃciently. Then, our Cross-media meta search engine uses various search
engines. (namely, E1,E2 ,. . . ,Em (m≥2)). For example, E1 is Google[5], E2 is
AltaVista[6], E3 is Google image search[7], and so on. Thus, the key feature of
our system is that it can use a mixture of search engines with diﬀerent media
types.
We describe the search engines which Cross-media meta search uses.
• A text search engine is typiﬁed by Google and AltaVista.
By inputting a keyword, these search engines outputs a reference result
Web pages which contains all keywords that user inputs from the Web page
8
in the database which crawler collected. Many search engines correspond
to this and are called so-called robot type search engine. This robot type
search engine is putting many pages into the database, and attaches the
index. Since it will search out of a lot of pages if it searches, there is very
much hit number of cases. Moreover, by using algorithm called an original
page rank by each search engine, the optimal solution comes to a higher
rank. However, since there is much hit number of cases, there is a fault
that there will be many noises or a page which is unrelated and which does
not have validity will hit mostly.
• An image search engine is typiﬁed by Google image search.
When keywords with which a user is related to a picture needed is in-
putted, a system judges contents, by analyzing the text which adjoins a
picture, the caption of a picture, and many other factors. Moreover, duplication is eliminated using advanced algorithm and the picture of the
highest quality is displayed ﬁrst. Thus, an image search engine obtains a
picture in inputting a text. It is very eﬃcient for collecting pictures so as to
extract a picture, Since the image search engine is looking at only a picture
and its very near, its relevance for every keyword is high, and its relevance
of a keyword and a picture is high. However, the hit number of cases may
decrease very much, and a reference result may become zero.
• an animation and music search engine is typiﬁed by Naver.
This is the search engine which specialized in the animation ﬁle on the
Internet. The animation ﬁle of various form, such as rm, asf, mpg, mov,
and swf, can be searched. an animation and music search engine obtains
these ﬁles in inputting a text and displays these ﬁle in link form there is a
problem that we must visit each link and check ﬁles.
Fig.4, 5, 6 shows the search results of each search engine.
In this paper, We focus on two search engines, the text search engine and
image search engine considered to be eﬀective for acquiring information most.
9
A picture of image search
Title and Link
A part of text of Web page
Link of Web page
Image search engine
Text search engine
Figure 4: text search
Figure 5: image search
Link of movie or music file
Link of Web page
Type of file and file size
Movie and music search engine
Figure 6: each search engine
We describe how Cross-media meta search is realized using these two search
engines.
3.2
Query Relaxation method
Most users do not know which keywords are most appropriate for each search
engine. Therefore, we allow users to input their query in the form of a conjunctive query Q = K1 ∧K2 ∧. . . ∧Kn . Unfortunately, when we use an image search
engine, if the number of keywords exceeds 3, it is diﬃcult to obtain any images
for the query. but we cannot get eﬃcient reference result in most case. That is
query which consists of three or more words is severe as conditions for reference.
To solve the problem, we propose relaxing conditions method of queries to receive adequate results by sharing the keywords for some kinds of search engines.
10
For example, when three keywords, ”Mt. Fuji”, ”snow” and ”sunset”, are inputted Google image search engine, a referent result is nothing. But, inside
of a referent result when two keywords, ”Mt. Fuji” and ”snow”, are inputted
Google image search engine, four Web pages contain the text, ”sunset”. In this
case, we can receive the pages which are related to three keywords by relaxing
conditions.
Let Ans (Q) be the set of answers for a user-input conjunctive keyword query
Q. In our query relaxation approach, the conjunctive query Q is divided into a
set of tuples bound by sub-query:
φ，
｛K1 , . . . , Kn ｝
｛K1 ｝,｛K2 , . . . , Kn ｝
｛K2 ｝,｛K1 , K3 . . . , Kn ｝
.
..
｛K1 , K2 ｝,｛K3 , . . . , Kn ｝
｛K1 , K3 ｝,｛K2 , K4 . . . , Kn ｝
.
..
｛K1 , . . . , Kn−1 ｝,｛Kn ｝
｛K1 , . . . , Kn ｝，φ
In this case, the query is divided into two separate sub-queries, because two
kinds of search engines are required for the query. Generally, the number of
sub-queries is dependent on the number of search engines. The sub-query for
each set of tuples is translated for the image search engine. For example, the
ﬁrst tuple (i.e. [ φ , K1 ,. . . ,Kn ]) shows that there are no keywords for the text
search engine, and K1 ,K2 ,. . . ,Kn are the input keywords for the image search
engine. In general, the larger the number of keywords, the more diﬃcult the
image search, as shown by the latter sub-query. Therefore, more images can be
obtained by the latter half of the tuple, because there are fewer keywords in the
latter sub-query. Fig.7 shows two separate sub-queries.
11
Ǿ, {K 1‫ޓ‬
, K 2‫ޓ‬
,‫ޓ‬
...‫ޓ‬
, Kn}‫ޓ‬
{K 1}, {K 2 ‫ޓ‬
,‫ޓ‬
...‫ޓ‬
, Kn}
{‫ޓ‬
K 1‫ޓ‬
, K 2‫ޓ‬
}, {K 3 ‫ޓ‬
, K 4‫ޓ‬
,‫ޓ‬
...‫ޓ‬
, Kn}
䊶䊶䊶
䊶䊶䊶
䊶䊶䊶
{Kn}, {K 1‫ޓ‬
,‫ޓ‬
...‫ޓ‬
, Kn − 1}
{‫ޓ‬
Kn − 1‫ޓ‬
, Kn‫ޓ‬
}, {K 1, K 2 ‫ޓ‬
,‫ޓ‬
...‫ޓ‬
, Kn − 2}
䊶䊶䊶
䊶䊶䊶
{K 1‫ޓ‬
, K 2‫ޓ‬
,‫ޓ‬
...‫ޓ‬
, Kn}‫ޓ‬
,Ǿ
The former element (blue) is used for text search.
The latter element (pink) is used for image search.
Figure 7: sub-queries
3.3
Answers by Query Relaxation
Answers for query Q are retrieved as unions of all the sub-query tuples:
Ans(Q) = Ans(K1 ∧ . . . ∧ Kn , E2 )
∪ (Ans(K1 , E1 )
∩ Ans(K2 ∧ . . . ∧ Kn , E2 ))
∪ (Ans(K2 , E1 )
∩ Ans(K1 ∧ K3 ∧ . . . ∧ Kn , E2))
∪ ...
∪ (Ans(K1 ∧ K2 , E1)
∩ Ans(K3 ∧ . . . ∧ Kn , E2 ))
∪ (Ans(K1 ∧ K3 , E1)
∩ Ans(K2 ∧ K4 ∧ . . . ∧ Kn , E2))
∪ ...
∪ Ans(K1 ∧ . . . ∧ Kn , E1 )
12
where Ans(Q, Ei) means the answers of the query Q by the search engine i.
In this case, E1 is a text search engine and E2 is an image search engine. The
engines return URLs which match the queries.
To compute (Ans(K1 ∧ K2 , E1 ) ∩ Ans(K3 ∧ . . . ∧ Kn , E2)), Ans(K1 ∧ K2 , E1 )
and Ans(K3 ∧ . . . ∧ Kn , E2) are processed separately. Then the intersection of
the answers is calculated. The answers are retrieved in the same way for all of
the tuples. Finally, Ans(Q) can be retrieved as unions of all the answers.
{‫ޓ‬
K 1‫ޓ‬
, K 2 },
‫{ޓ‬K 3 ‫ޓ‬
,‫ޓ‬
...‫ޓ‬
, Kn}
K1 ∧ K 2
K 3 ∧ ... ∧ K n
input
input
text
search engine
Image
search engine
output
output
Common
web pages
Web pages of results
of a text search
Web pages contained picture
of results of an image search
Web pages of results
by Query relaxation
Figure 8: Answers by Query Relaxation
13
3.4
Example of Query Relaxation
We would now like to demonstrate the results obtained using the query re-
laxation approach with the following query: Q = q1 ∧ q2 , where q1 is ”Mt. Fuji”
and q2 is ”snow”. Fig. 9 shows venn diagrams of the results of query Q. The left
diagram shows the results from a conventional meta-search engine. The right
diagram shows the results using the query relaxation approach. In the left diagram, the hatched areas show the results for Ans(q1 ∧ q2, SEtext ), and the right
hatched areas show the results for Ans(q1 ∧ q2, SEimg ), where SEtext is the text
search engine and SEimg is an image search engine. The top hatched areas on
the right diagram shows the results for (Ans(q1, SEtext ) ∩ Ans(q2, SEimg )). The
bottom hatched areas shows the results for (Ans(q2, SEtext ) ∩ Ans(q1, SEimg )).
Therefore, using proposed query relaxation, the possibility for obtaining pertinent results is increased.
Conventional results
Query relaxation results
Figure 9: Venn diagram of results of results of query Q
Thus, in addition to results of the conventional meta search, the common
Web pages of text search and image search become results by using this technique. That is, the Web pages of Fig.10 are also collected.
14
Text 䇸snow䇹
Image 䇸Mt. Fuji䇹
Figure 10: the sample pages which the system collects
3.5
The degree of Query Relaxation
We deﬁne the degree of relaxation as the number of keywords in a user input
query Q that are actually used by a Web text search engine. Why does it call
it the degree of Query Relaxation? Because, text search engine has more hit
number of cases than image search engine. This shows that the way of image
search engine is severe. Since using text search engine has relaxed search, it
makes it the degree of relaxation for how many keywords to have used in text
search engine. That is, when three of the keywords are used for an image search
engine, the degree of relaxation is considered to be zero. When two keywords
are used for an image search engine and the third keyword is used for a text
search engine, the degree of relaxation is considered to be one. We compare the
results of 0 of relaxation, to 1 of relaxation, and 2 of relaxation respectively.
Fig.11 shows the example putted concrete keywords.
3.6
Merit of Query Relaxation
A user wants to peruse only eﬀective information. The index of the search
engine from which a type is diﬀerent can be used. The Web pages which have
15
0 of relaxation
Ǿ, {Mt.Fuji ∧ snow ∧ sunset‫ޓ‬
}
Using text search
Using image search
1 of relaxation
Mt .Fuji ∧ snow ‫ޓ‬
}{snow ‫ޓ‬
}, {Mt .Fuji ∧ sunset }
{Mt.Fuji}, {sunset ∧ snow} {sunset }, {‫ޓ‬
2 of relaxation
{Mt.Fuji ∧ snow}, {sunset‫ޓ‬
} {Mt .Fuji ∧ sunset }, {snow} {snow ∧ sunset }, {Mt .Fuji}
3 of relaxation
{Mt .Fuji ∧ snow ∧ sunset },Ǿ
Figure 11: Lattice of query Q (Q = Mt. Fuji ∧ snow ∧ sunset)
not been collected until now are also collected taking advantage of the diversity
of media. Moreover, if there are many questions inputted in order to extract
conditions, a reference result will not come out, If there are few questions in
order to search broadly, it will increase to an excessive page to a reference result.
It is diﬃcult to adjust this well. Since an image search outputs the picture a
user wants as a search result, it is not necessary to carry out text search, and it
does not need to look for a picture out of Web pages. Therefore, we think that
exact comprehensive and information will be collectable, because it uses mixing
the picture reference with this high accuracy, and the text reference which can
search many Web pages. We think that we create an encyclopedia using Web
by using this technique.
16
Chapter 4
4.1
Experiments and Evaluation
The outline of experiment
By using the method of Chapter 3, it is proved what result is obtained
actually. We investigated the hit number of pages and the eﬀective number
of pages of search results at the time of inputting various keywords in the
experiment. The keyword used for an experiment chooses what had become the
title of the report of the news page which mainly exists from various ﬁelds.
We use Google for text search, and Google image for image search.
The user input keyword is set to three. This time, the reference results of the
degree 0 of relaxation, the degree 1 of relaxation, and the degree 2 of relaxation
are compared. Here, the result of the degree 1 of relaxation also includes the
result of the degree 0 of relaxation. The result of the degree 2 of relaxation
includes the result of the degree 0 of relaxation, and the degree 1 of relaxation.
However, since we want the result using image search, we don’t use the result
of the degree 3 of relaxation of text search only.
4.2
Experiment results
A part of experiment result is shown below．The keywords of text search
are inputted into a text search. The keywords of image search are inputted
into an image search. The hit number shows the common web pages of a text
search result and an image search result. The number of pertinent pages shows
the page which we judge to be described the contents of a reference keyword
appropriately in the hit pages. Fig.12, 13, 14 shows table about experiments.
Fig.15, 16 shows graph about recall and precision for every reference keyword.
Precision and recall are displayed for every degree of relaxation. Precision is
the average precision to the result of every degree of relaxation. Since the
whole solution set on Web is not understood, recall shall consider total of the
eﬀective page for every experiment as the whole solution set. Therefore, it takes
cautions that the recall at the time of the degree 2 of relaxation is 100%. So
this experiment is the evaluation relatively.
17
keywords
“soccer”㺢 ” Nakata”㺢 “team of Japan”
The degree of
relaxation
Keywords for
image search
0
“soccer” 㺢 “Nakata”
㺢“team of Japan”
1
2
Keywords for
text search
Number of
hits
Number of
Pertinent pages
19
17
“soccer”㺢
“Nakata”
“team of Japan”
32
22
“soccer”㺢
“team of Japan”
“Nakata”
63
15
“Nakata” 㺢
“team of Japan”
“soccer”
31
13
“soccer”
“Nakata” 㺢
“team of Japan”
48
6
“Nakata”
“soccer”㺢
“team of Japan”
40
26
“team of Japan”
“soccer”㺢
“Nakata”
13
0
keywords
“Kyoto”㺢 “Autumnal leaves”㺢 “Koudai-ji”
The degree
of relaxation
Keywords for image
search
0
“Kyoto” 㺢“Koudai-ji”
㺢“Autumnal leaves”
1
2
Keywords for text
search
Number of
hits
Number of
Pertinent pages
6
6
“Kyoto”㺢
“Autumnal leaves”
“Koudai-ji”
38
13
“Kyoto” 㺢“Koudai-ji”
“Autumnal leaves”
24
23
“Autumnal leaves”
㺢“Koudai-ji”
“Kyoto”
10
8
“Kyoto”
“Autumnal leaves”
㺢“Koudai-ji”
7
1
“Koudai-ji”
“Kyoto” 㺢
“Autumnal leaves”
6
2
“Autumnal leaves”
“Kyoto” 㺢“Koudai-ji”
54
11
keywords
“musical”㺢 “Shiki theatre company”㺢 “performance”
The degree
of relaxation
Keywords for image
search
0
“musical”㺢“performance”
㺢“Shiki theatre company”
1
2
Keywords for text
search
Number of
hits
Number of
Pertinent pages
19
17
“musical”㺢
“Shiki theatre company”
“performance”
38
30
“musical”
㺢“performance”
“Shiki theatre company”
20
12
“Shiki theatre company”
㺢 “performance”
“musical
21
15
“musical”
“Shiki theatre company”
㺢 “performance”
24
14
“Shiki theatre company”
“musical”
㺢“performance”
73
35
“performance”
“musical”㺢
“Shiki theatre company”
8
2
Figure 12: table of type A
18
keywords
“Mt. Fuji”㺢 ”sunset”㺢 “snow”
Number of
hits
Number of
Pertinent pages
0
0
“snow”
8
6
“Mt.Fuji”㺢“snow”
“sunset”
12
12
“sunset”㺢“snow”
“Mt.Fuji”
3
3
“Mt.Fuji”
“sunset”㺢“snow”
4
3
“snow”
“Mt.Fuji”㺢“sunset”
1
1
“sunset”
“Mt.Fuji” 㺢“snow”
0
0
The degree
of relaxation
Keywords for
image search
0
“Mt.Fuji”㺢“sunset”
㺢“snow”
“Mt.Fuji”㺢“sunset”
1
2
Keywords for text
search
keywords
“typhoon” 㺢“flood”㺢“heavy rain”
The degree
of relaxation
Keywords for image
search
0
“typhoon” 㺢“flood”
㺢“heavy rain”
1
2
Keywords for text
search
Number of
hits
Number of
Pertinent pages
14
4
“typhoon”㺢
“heavy rain”
“flood”
33
20
“typhoon” 㺢“flood”
“heavy rain”
22
14
“heavy rain” 㺢“flood”
“typhoon”
34
14
“typhoon”
“heavy rain”
㺢“flood”
18
7
“heavy rain”
“typhoon” 㺢“flood”
43
19
“flood”
“typhoon”
㺢 “heavy rain”
22
5
Figure 13: table of type B
keywords
“SARS”㺢 ” Pneumonia”㺢 “Condition”
The degree
of relaxation
0
1
Keywords for image
search
SARS, Pneumonia,
Condition
Number of
hits
Number of
Pertinent pages
0
0
SARS, Pneumonia
condition
0
0
SARS, Condition
Pneumonia,
0
0
SARS
0
0
Pneumonia,
Condition
0
0
Pneumonia
SARS, Condition
1
0
Condition
SARS, Pneumonia
0
0
Pneumonia, Condition
SARS
2
Keywords for text
search
Figure 14: table of type C
19
Precision(%)
100
80
60
average
40
䃂 degrees 0 of relaxation
䂥 degrees 1 of relaxation
䂓 degrees 2 of relaxation
20
0
20
40
60
80
100
Recall(%)
Figure 15: graph of typep A
Precision(%)
100
80
60
average
40
䃂 degrees 0 of relaxation
䂥 degrees 1 of relaxation
䂓 degrees 2 of relaxation
20
0
20
40
60
80
100
Recall(%)
Figure 16: graph of type B
20
4.3
Consideration
The results of the experiment are shown in Fig.12, 13, 14. The recall and
precision graphs for each keyword are also shown in Fig.15, 16.
The recall ratio for the results increases from 0 to 1 for every degree of relax-
ation. That is, increasing the number of keywords for the text search engine is
an eﬀective means of obtaining pertinent Web pages. Moreover, compared with
change of the recall from the degree 0 of relaxation to the degree 1 of relaxation, the change to the degree 2 of relaxation from the degree 1 of relaxation
was small. In the graph, there are some patterns.
• Type (A) is normal.
The recall ratio is increased and the precision ratio is decreased according
to the degree of relaxation.
• Type (B) is especial.
Type (B) shows from 0 to 1 of relaxation with and increased precision
and recall ratio that is particularly apparent. No answers were obtained at
0 of relaxation, but by relaxing the degree, many answers were obtained.
We think that it is the pattern in which the validity by relaxing the query
is shown notably.
• Type (C) does not have any answers.
When there is no keyword which can express a concrete picture like Type
(C), even if it makes the degree of relaxation high, a reference result does
not come out.
When there are few especially results of the degree 0 of relaxation, this pat-
tern appears in many cases. This shows that it is eﬀective to relax a query, if
there are few results of the degree 0 of relaxation. By the result of an experiment, query relaxation can improve recall considerably, maintaining precision
to about 70% at one of degree.
Through an experiment, even if a user inputs a few keyword into image search
engine, it turns out that the poor ﬁeld that there is little hit number of cases
also exists. In the case of the words and phrases to which each keyword cannot
express a concrete picture, even if it raises the degree of relaxation, there is
very little hit number. It turns out that the remarkable diﬀerence is attached
21
to the number of hits, and the eﬀective page by the keyword relaxed. Depending on the keyword, the hit number of a result of the degree 2 of relaxation
become fewer than of the degree 1 of relaxation. When it leaves a keyword
more concrete than such a result to image search, it is thought that the number
of hits will increase. However, we think that the degree of relation of reference
keywords, the degree of coincidence, the frequency for which a keyword is used
as contents of the metadata of a picture, etc. are related.
However, in the present evaluation, cautions are required for it to be depen-
dent on the picture reference algorithm of Google too much.
22
Chapter 5
Improvement of Query relaxation
We described the query relaxation method in Chapter 3. However, in the
method of calculating all subsets, when a user inputs a lot of keywords, the eﬃciency of the query relaxation method will become very bad. When a question
increases, it is because the number of subsets increases explosively. Therefore,
we consider the increase in eﬃciency of the query relaxation method supposing
the case where the reference question Q consists of a lot of keywords. That is
the case where many keywords are inputted into a search engine in order that
a user may extract and look for his information needed, the case where one
sentence of a Web page is copied, etc.
We describe the approaches of the query relaxation method for lots of key-
words (N keywords) below.
5.1
5.1.1
Method of improvement
Execution order of subset
Many subsets are made by inputting many keywords. We focus on the point
whether it is good to somewhere perform the subset. The lattice structure of a
subset in the query relaxation method is shown in Fig.7. The number of search
engines is two, the number of keywords is N. The element of the former of each
subset in a ﬁgure is used for text search and the element of the latter is used
for image search. That is, they are the degree 0 of relaxation, and the degree 1
of relaxation sequentially from a top.
It is clear by looking at the lattice structure of a ﬁgure that the number of
subsets of query will become a very huge quantity, If the number of keywords
will be N even if the number of search engines is two. Since it is such, in order to
collect solutions eﬃciently by Cross-media meta search, we think it important
not to collect solution pages in all subsets, but to collect solutions sequentially.
We think each search engine. About text search engine, The number of
solutions decreases like the direction under a ﬁgure, that is, the degree of relaxation becomes large. Since a reference question increases and conditions are
extracted, this is natural. About image search engine, The number of solutions
23
decreases like the direction up a ﬁgure, that is, the degree of relaxation becomes
small. Thus, although it is clear about the solution of each search engine, since
the question easing method is used simultaneously, such a relation is not realized. This is the point which makes it diﬃcult to decide the execution order of
the query relaxation method.
Pruning of every search engine is considered by the easy method. The con-
dition of the solution which put keyword K1 ∧ K2 ∧ K3 into search engine E1 ,
that is Ans(K1 ∧ K2 ∧ K3 , E1 ), is more severe than of Ans(K1 ∧ K2 , E1 ). Since
the keyword is added, this is natural. Therefore, the number of the solutions
of Ans(K1 ∧ K2 ∧ K3 , E1) should become below Ans(K1 ∧ K2 , E1 ). When
Ans(K1 ∧ K2 , E1 ) does not have a solution, Ans(K1 ∧ K2 ∧ K3 , E1) does not
have a solution. If this is used, the subset with the element without a solution
of a subset does not have a solution. If it becomes so, it is not necessary to
search one of the two any longer. It can specify not performing beforehand
reference which does not have a solution or does not exceed a threshold with
the hit number. Thus, by being simpliﬁed, pruning of each search engine is
possible.
The argument on an execution order is described below.
5.1.2
Breadth-first search and Depth-first search
The foundations of the order of from which subset to perform are breadth-ﬁrst
search and depth-ﬁrst search.
• breadth-ﬁrst search
The degree 0 of relaxation is performed ﬁrst. Since what has high accu-
racy is good as for reference, Web pages which has high accuracy are showed
sequentially. Second, every one degree of relaxation is raised. Finally, a
query is relaxed until the contents which a user satisﬁes are obtained. By
performing breadth-ﬁrst search, it can see sequentially from what has high
accuracy. We think that wide range of information can be acquired by
seeing the sequence with the same degree of relaxation.
• depth-ﬁrst search
The degree 0 of relaxation is performed ﬁrst. Second, a user considers
which keyword it leaves and the subset which raised the degree of relaxation
24
is performed. For example, the case of the search ”Mt.F uji” ∧ ”snow” ∧
”sunset”, suppose that the user speciﬁed that he wanted to see the picture
which is Mt. Fuji. Then, after showing that ”Mt.F uji”∧”snow” is used for
image search, the solution by which ”Mt.F uji” is used for image search
is displayed. When the same keyword can see what is used for picture
reference, we think that it is suitable in the speciﬁc thing when you want
a user.
In order to perform these, the statistical technique is used in breadth-ﬁrst
search and the linguistic technique is used in depth-ﬁrst search. The technique
is shown below.
5.1.3
The statistical technique
By the present query relaxation method, all subsets are to be performed.
However, calculation time is in a starting result display, and they are stripes.
The method of performing from the optimal subset and showing a solution one
by one is required. In order to judge which subset is the optimal, there is
a method of asking the hit number of cases of each keyword for the optimal
subset. The reason using the hit number is that what it is easy to acquire as
information which it had the front in the stage which begins reference is the hit
number of cases.
First, the one where the degree of relaxation is smaller, that is many keywords
used for image search, has high accuracy. Image search is because the conditions
are severe. Second, generally, there is so much hit number of image search of
the word that there is much hit number of text search of a certain word. Third,
considering the ”And” search with two words, the way of a reference result which
combined the word with much reference number of cases increases. Therefore,
we think that the method of searching from a subset with the small degree of
relaxation and with much hit number of cases of each word used for picture
reference is the optimal.
25
1
0 of relaxation
1 of relaxation
Ǿ, {Mt .Fuji ∧ snow ∧ sunset ‫ޓ‬
}
3
2
4
{Mt.Fuji}, {sunset ∧ snow} {sunset }, {‫ޓ‬
Mt .Fuji ∧ snow }‫{ޓ‬snow },
‫{ޓ‬Mt .Fuji ∧ sunset }
2 of relaxation
7
6
5
{Mt.Fuji ∧ snow}, {sunset‫ޓ‬
} {Mt .Fuji ∧ sunset }, {snow} {snow ∧ sunset }, {Mt.Fuji}
3 of relaxation
{Mt .Fuji ∧ snow ∧ sunset },Ǿ
The number of hits of each keywords
Mt. Fuji > snow > sunset
Figure 17: Breadth-ﬁrst search of the stastistical technic
Fig.17 shows the execution order of the subset when performing breadthﬁrst search. As an example, the case where a keyword is inputted as ”Mt.
Fuji” and ”snow” and ”sunset” is considered. Here, the hit number of cases
presupposes that it is the order of the Mt. Fuji ＞ snow ＞ sunset. The number
in a ﬁgure is the number of an execution order. As shown in a ﬁgure, it carries
out from 0 with the lowest degree of relaxation, and, next, the degree 1 of
relaxation is performed. In the degree 1 of relaxation, it shall perform from
a subset with most scores using the following formulas. Scoresub is score of
each subset text hitsall is the sum total of the number of hits of all reference
results which put each word of a reference keyword into text search, respectively,
and was obtained. image hitsall is the sum total of the number of hits of all
reference results which put each word of a reference keyword into image search,
respectively, and was obtained. text hitssub is the sum total of the hit number
of the keyword used for text search and image hitssub is the sum total of the
26
hit number of the keyword used for image search of a subset to ask for a score.
Scoresub = α(
text hitssub
image hitssub
) + β(
)
text hitsall
image hitsall
(α + β = 1)
(1)
α and β are changed by to which reference importance is attached. This formula is a formula for choosing what has more hit number of cases based on
the idea about the above-mentioned hit number. Thus, the query relaxation
method shall be carried out to breadth-ﬁrst search using a statistical technique.
However, there is a possibility that accuracy will become bad and a noise will
increase more as a reference result increases. Moreover, the information which
the user meant may not come out. It becomes important to adjust by sorting
out in the stage of a display of a reference result.
The other statistical techniques that the technique of judging whether it uses
for image search or it uses for text search, based on the information which the
user judged in the past by saving the data of the past reference, is also considered.
We experimented statistical method. Fig. 18 is an example of an experiment.
Fig. 19 shows the graph as a result of an experiment.
keywords
“Kyoto”㺢 “Autumnal leaves”㺢 “Koudai-ji”
order
Keywords for image
search
1
“Kyoto” 㺢“Koudai-ji”
㺢“Autumnal leaves”
2
“Kyoto”㺢
“Autumnal leaves”
3
Keywords for text
search
Number
of hits
Number of
Precision Recall
Pertinent pages
6
6
100%
9%
“Koudai-ji”
38
13
43%
29%
“Kyoto” 㺢“Koudai-ji”
“Autumnal leaves”
24
23
62%
66%
4
“Autumnal leaves”
㺢“Koudai-ji”
“Kyoto”
10
8
67%
78%
5
“Kyoto”
“Autumnal leaves”
㺢“Koudai-ji”
7
1
60%
80%
6
“Autumnal leaves”
“Kyoto” 㺢“Koudai-ji”
6
2
58%
83%
7
“Koudai-ji”
“Kyoto” 㺢
“Autumnal leaves”
54
11
44%
100%
Figure 18: Experiment of the stastistical technic
27
Precision(%)
100
80
60
40
20
0
䃂 degrees 0 of relaxation
䂥 degrees 1 of relaxation
䂓 degrees 2 of relaxation
order
20
40
60
80
100
Recall(%)
Figure 19: Graph of the stastistical technic
We think from an experiment result that precision may fall only by judging
from the hit number of cases. However, recall is high also in an early stage. We
think that the statistical technique attaches importance to recall.
5.1.4
The linguistic technique
This is the method of considering the optimal subset paying attention to the
ornamentation relation of the word in a subset. It is thought that a picture
which user wants is a word modiﬁed from other words in the reference keyword
group. That is, it is thought that it is the word limited most. For example,
the case of the search ”Kyoto” ∧ ”autumnalleaves” ∧ ”Kodai − ji”, generally
it is thought that the relation of ”Kodai-ji where autumnal leaves are famous
in Kyoto” is realized. Thus, we think that it is that with which Kodai-ji is
embellished from other keywords. In this case, when ”Kodai-ji” is inputted into
a picture surely the reference number of cases increased so that it might understand also from Fig.20. It can be said that this is harnessing the characteristic
of each search engine. There is a word which is easy to search with image
search. In this case, the number of hits of proper nouns such as a person and a
28
1
0 of relaxation
Ǿ, {Mt .Fuji ∧ snow ∧ sunset ‫ޓ‬
}
3
2
1 of relaxation
{Mt.Fuji}, {sunset ∧ snow} {sunset }, {‫ޓ‬
Mt .Fuji ∧ snow ‫ޓ‬
} {snow ‫ޓ‬
}, {Mt .Fuji ∧ sunset }
4
2 of relaxation
{Mt.Fuji ∧ snow}, {sunset‫ޓ‬
} {Mt.Fuji ∧ sunset}, {snow} {snow ∧ sunset}, {Mt.Fuji}
3 of relaxation
{Mt .Fuji ∧ snow ∧ sunset },Ǿ
The keywords of image which a user wants
“ Mt. Fuji ”
Figure 20: Depth-ﬁrst search of the linguistic technic
building tends to increase in picture reference. Thus, the technique of judging
a subset can be considered by considering the ornamentation relation between
words. However, since this is a guess, the page diﬀerent from the intention of
a user may be judged to be the optimal page. Moreover, the linguistic thing is
diﬃcult for a system judging.
We experimented the linguistic method. Fig. 21 is an example of an experi-
ment. Fig. 22 shows the graph as a result of an experiment.
keywords
“Kyoto”㺢 “Autumnal leaves”㺢 “Koudai-ji”
order
Keywords for image
search
1
“Kyoto” 㺢“Koudai-ji”
㺢“Autumnal leaves”
2
“Autumnal leaves”
㺢“Koudai-ji”
3
4
Keywords for text
search
Number
of hits
Number of
Precision Recall
Pertinent pages
6
6
100%
9%
“Kyoto”
10
8
88%
21%
“Kyoto” 㺢“Koudai-ji”
“Autumnal leaves”
24
23
93%
58%
“Koudai-ji”
“Kyoto” 㺢
“Autumnal leaves”
54
11
51%
72%
Figure 21: Experiment of the linguistic technic
29
Precision(%)
100
80
60
䃂
䂥
䂓
40
20
0
degrees 0 of relaxation
degrees 1 of relaxation
degrees 2 of relaxation
order
20
40
60
80
100
Recall(%)
Figure 22: Graph of the linguistic technic
We think from an experiment result that precision is better than the statistical technique. The linguistic technique has high precision, maintaining moderate recall. We think that the linguistic technique attaches importance to
precision.
5.1.5
The extended proposal of the query relaxation
It is enumerated as follows what enhancing idea you exist besides the de-
scription of the above-mentioned.
• Duplication of a reference keyword is not permitted by query relaxation
method. That is, when dividing into a subset, it uses for either text
search or image search. By permitting duplication, a reference keyword
can be used by both of reference. All information can be covered by allowing duplication. The number of processing increases enormously so
that the number of subsets increases more enormously. When searching
for example, ”Mt.F uji” ∧ ”snow” ∧ ”sunset”, The solution set which the
common pages of ”Mt.F uji” ∧ ”sunset”inputted into image search and
”Mt.F uji” ∧ ”snow” inputted into text search increases. It is thought by
allowing duplication of a reference keyword that a reference result can be
30
extracted more. However, since calculation time increases, it is important
to perform in not all subsets, but to calculate only a suitable subset, taking
the hit number of a keyword into consideration.
• The concept of inTitle and inText is introduced into the partial question
to a text search engine.
As a role of each word in a subset, the role assignment of whether the word
is used by inTitle or inText is carried out. It can be decided the word in
the reference keyword, make which word main and is considered. However,
since a role divides a partial question further, the number of the groups of
a partial question will increase and eﬃciency becomes bad.
• An unnecessary keyword is ﬁltered and removed from query Q.
When the reference question consists of too much a lot of keywords, The
Web page of the solution to the reference question is very diﬃcult for ﬁnding
it. Then, an unimportant word is removed by setting importance as each
word of a question of a user. Only an important word is extracted and the
word is used as a reference keyword. As a method of deciding importance,
we consider how to feed back from the hit number of cases, and the method
of preparing a relevant dictionary as a database beforehand. In this way,
the eﬃciency of reference is improved by generating new query Q’.
• Adaptation of feedback.
The word which is easy to be used for a picture, and the word used for a
text are fed back judging from a reference result. That is, the history of
the past reference is stored in a database, a user proﬁle is created, and the
query relaxation method is performed based on it.
• Feedback of image search.
This is a picture-oriented method more. A user feeds back by choosing a
thing more needed and the similar thing from the picture of a reference
result. It is more desirable to perform this with a search engine which
analyzes and searches a picture. However, those search engines do not exist
in the present stage. Then, we consider how to extract a word including
the coincidence relation from the surrounding text of the picture which the
user chose, and to search again.
31
Chapter 6
Information Integration
In order that a user may collect information eﬃciently, a Web page is not
shown as it is, but only the information related from a Web page is extracted
and displayed. The information integration is described below.
6.1
Extract information
The Web page has collected as a solution set by the query relaxation method.
However, in one Web page, the subject which is unrelated to a reference keyword
is also included. By removing unrelated subject from a Web page, we think
that eﬃcient useful information is acquirable. Based on this, only the portion
relevant to a reference keyword is extracted from the inside of a Web page by
this paper. The frequency of the word in the text in a Web page is used in
considering the relevance to a reference keyword. Moreover, since a possibility
that many subjects about the picture are included is high, the surroundings of
the picture collected by picture reference also take a distance relation with a
picture into consideration.
A procedure is described below.
1. Calculate the importance of a word based on the frequency of a word.
2. Calculate the distance of a picture and a paragraph.
3. Calculate the importance of a paragraph by using both the importance of
a word and distance of paragraph
A formula is expressed below.
Is is importance of paragraph s.
wt is importance of word of t in paragraph s.
r is distance of paragraph s from picture.
Is = Scores ∗ r2
(2)
Scores =
(3)
wt
w∈s
A paragraph is extracted based on this importance.
Thus, only related subject is extracted from the Web page of a reference re-
sult. Fig.23 shows this method. The reference result is made into the paragraph
32
and picture which were extracted from the Web page instead of a Web page
unit. Thus, it uniﬁes automatically, new contents are generated and a user is
shown by making this into a reference result. It is the feature for a reference
result to be settled as contents with new not the display of a URL list but
picture of a reference result and text document unlike the conventional meta
search engine, and to be displayed.
↹௝ᬌ⚝䈱⚿ᨐ䈱↹௝
extraction
Web䊕䊷䉳ౝ䈱䊁䉨䉴䊃
䂾 ን჻ጊ䈦䈩䈬䉖䈭ጊ 䉂䈭䈘䉖䈗ሽ䈛䈱䈫䈍䉍䇮ን჻ጊ䈲
ᮡ㜞3,776m䇮ᣣᧄ䈪৻⇟㜞䈇ጊ䈪䈜䇯
ᵴἫጊ䈫䈚䈩䈱ን჻ጊ ን჻ጊⷰశ䉕ᭉ䈚䉁䉏䉎⊝䈘䉖䈲䇮
ን჻ጊ䈏ታ䈲ᵴἫጊ䈣䈫䈇䈉䈖䈫䉕䈗ሽ䈛䈪䈚䉊䈉䈎䇯 ᣣᧄ
৻䈱㜞䈘䇮ᧃᐢ䈏䉍䈱䈐䉏䈇䈭ᒻ䈲䇮ን჻ጊ䈏ㆊ෰䈮૗ᐲ䉅
ྃἫ䉕䈚䇮ṁጤ䈭䈬䈱Ἣጊྃ಴‛䈏ᐞ㊀䈮䉅ၸⓍ䈚䇮䈧䈒䉌䉏
䈢䉅䈱䈪䈜䇯䈖䈱䉋䈉䈮Ἣጊ䈣䈎䉌䈖䈠䇮੹䈱ᭉ䈚䉁䉏䉎ን
჻ጊ䈏䈅䉎䉒䈔䈪䈜䇯 ㆊ෰䈱ྃἫ ን჻ጊ䈲੹䈎䉌⚂70䌾
20ਁᐕ೨䈮ᵴേ䉕㐿ᆎ䈚䇮ྃἫ䉕➅䉍㄰䈜䈖䈫䈪⚂䋱ਁᐕ೨
䈮⃻࿷䈱䉋䈉䈭⟤䈚䈇౞㍙ᒻ䈱Ἣጊ䈫䈭䈦䈢䈫⠨䈋䉌䉏䈩䈇䉁
䈜䇯ฎᢥᦠ䈭䈬䈱ᱧผ⾗ᢱ䈮䉅ን჻ጊ䈱ྃἫ䈱⸥ㅀ䈏䈅䉍䉁
䈜䇯1707ᐕ䈱ቲ᳗ྃἫ䉕ᦨᓟ䈮䈖䉏䉁䈪䈱⚂300ᐕ㑆䇮ን჻
ጊ䈲㕒䈎䈭⁁ᘒ䈪䈜䇯
Figure 23: Extract information
As for the Web page of the reference result collected for each sub-queries,
the Web page unit does not serve as a solution. Only the portion relevant to the
reference keyword is extracted from each Web page. Thus, a reference result is
not a Web page unit but the portion extracted from the Web page. Here, the
information extracted from the Web page of the reference result of each partial
question is uniﬁed automatically, and new contents are generated. A user is
shown this as a reference result. Unlike the conventional meta-search engine, a
reference result is not the display of a URL list. In Cross-media meta search,
it is the feature that the picture and a text document of a reference result are
collected as new contents, and are displayed. Fig.24 shows this information
integration method.
33
Figure 24: information integration
6.2
Copyright problem
In Cross-media meta search, only one portion which is related on a page
unlike the conventional reference is extracted. Therefore, the system is adding
processing to the work of others who are called a Web page. Moreover, the
system shows Web page so that it may be made to peruse simultaneously with
other pages. These acts may also become violation of copyright.
However, in the conventional search engine such as Google, the contents of
some pages containing the reference keyword other than URL are displayed on
the display screen of a reference result. Moreover, in Google image search, only
the photograph is extracted and shown. These acts cannot become violation of
copyright.
Based on these, in Cross-media meta search, a system speciﬁes the source point.
6.3
Searching based on Web page
Thus, the Cross-media meta search which extended the usual reference can be
used for other uses. One of them is searching based on a Web page. When the
user needs reference most, it is a time of there being information which he does
not understand, and information knowing. It is not eﬃcient that user continue
34
to peruse a Web page after visiting and searching one by one at this time.
Therefore, it is the purpose of this method that a user will search simultaneously
with perusing Web pages without using Web search cite. When the user has
patrolled the Web page and the language regarded as not understanding comes
out, this is speciﬁed with the drug of a mouse etc. At this time, while a user
searches, Cross-media meta search moves by the background. It can be used
with the feeling of seeing as an encyclopedia by unifying a result and displaying
brieﬂy. Fig.25 shows this image.
䊘䉳䉲䊢䊮
ᄖ㊁ᚻ 䋨ฝᛩ䈕䊶Ꮐᛂ䈤䋩
↢ᐕ᦬ᣣ
㪈㪐㪎㪊ᐕ㪈㪇᦬㪉㪉ᣣ
಴り࿾
り㐳䊶૕㊀
䊒䊨౉䉍
㪉㪇㪇㪉ᐕ䈱ᚑ❣
৻⸒⚫੺
ᗲ⍮⋵⼾ጊ↸
㪈㪎㪌㪺㫄䇮㪎㪊㫂㪾
㪈㪐㪐㪈ᐕ 䊄䊤䊐䊃㪋૏ᜰฬ䈪䉥䊥䉾䉪䉴౉䉍
ᛂ₸ 㪅㪊㪉㪈䊶㪏ᧄ႗ᛂ䊶㪌㪈ᛂὐ
⿛᡹቞䈠䉐䈦䈢䊜䉳䊞䊷⃻ᓎᦨ㜞䊒䊧䊷
䊟䊷䈱㪈ੱ䈫䈱๭䈶ჿ䉅㜞䈇䇯㪈ᐕ⋡䈮ᣂ
ੱ₺䈫㪤㪭㪧䉕₪ᓧ䇮㪉ᐕ⋡䈱ᤓቄ䉅䉥䊷䊦
䉴䉺䊷ᛩ␿䈪㪉ᐕㅪ⛯䊃䉾䊒䈫ฬታ䈫䉅䈮౗
䈰஻䈋䈢⌀䈱ᄢ䉴䉺䊷䇯࿾ర䈪䉅䊙䊥䊅䊷
䉵䈱㗻䈫䈚䈩⹺⍮䈘䉏䈩䈇䉎䇯㪉㪇㪇㪊ᐕ䉅䉅
䈤䉐䉖ᦼᓙᄢ䋣
Figure 25: searching based on Web page
6.4
Informational arrangement
For information integration, an important thing is not displaying information
as mere enumeration, but is displaying in a form with a certain scenario. Since
we want to create a thing like the encyclopedia using Web as motivation, we
consider the display method like an encyclopedia. Then, we think that an
important thing is showing the contents considered to be the most eﬀective to
the portion which the users chose. As an element like an encyclopedia, there are
35
explanation of a term, a photograph, usage, related subject, etc. It is adapted
for this information integration in this. In order to decide the element like an
encyclopedia, it is determined from a reference keyword what is subject. The
example of the element in the subject of reference to display is raised to below.
• Name of a person
A proﬁle, career, and photograph.
• Name of a place
Scenery, sightseeing spot, and the special feature.
• Term
Meaning, relation term and photograph.
• News
A report, a related article, and characters.
We think it eﬀective to show such an element.
Moreover, it is required to also consider the degree of similar of a text. Since
there is little amount of information even if the same text is displayed, as for
the subject with the high degree of similar, it is good to display the subject
which was omitted and is diﬀerent. For example, a medical virus and the virus
of a computer mainly exist in a virus. Completely, since this is another subject,
it classiﬁes and it displays. However, the text with the high degree of similar
is summarized into a medical virus. For example, even if there are many texts
about an inﬂuenza virus, it does not indicate all, but what has the high degree
of similar is eliminated and displayed.
36
Chapter 7
Prototype System
The prototype system was implemented according to the above-mentioned
method.
In this chapter, we describe the prototype system of Cross-Media meta search
system.
And, implementing environment is as follows;
• OS : Windows XP
• CPU : Pentium4 2.00GHz
• Memory : 2GB
• Development Environment : Microsoft Visual Studio .Net/ C
7.1
Implement
First, a user inputs their query which is consisted of K1 ,K2,. . . ,Kn (n≥2).
That is query Q = K1 ∧K2 ∧. . . ∧Kn .
Second, we use Google text search engine as E1 . And, we use Google image
search engine as E2 . Ans(Q) is Web page set of common Web pages of the
result Web pages of E1 and E2 . The reference result of Question Q is shown to
a user by generating the new contents which uniﬁed information using this Ans
(Q). Contents are written by html and a system creates a html ﬁle dynamically
here according to a demand of a user. Dynamic integration was enabled when
a web browser read it at any time.
The system architecture of the prototype system of Cross-media meta search
is shown in Fig.26. Proto type system of Cross-media meta search consists 4
parts.
• Interface part
Interface part is a part that a user inputs a reference query and the screen
of a reference result is displayed after reference is completed. A reference
result is displayed by reading html generated by information integration
part.
37
Cross-Media meta search System
Interface part
Execute
query relaxation
part
Information
Integration
part
Collecting
Web pages
part
Google
Google image
Web
Webspace
space
Figure 26: The system architecture of the prototype system of Cross-media
meta search
• Executing query relaxation part
Executing query relaxation part is a part that the reference question which
the user inputted is made into a subset using the query relaxation method
and those subsets are inputted into Google and Google image of a search
engine to be used.
• Collecting Web pages part
Collecting Web pages part is a part that a system collects the reference
results of the question which each search engine inputted, and matches
and chooses a common Web page out of the Web page of the reference
38
result.
• Information integration part
Information integration part is a part that a system extracts only an important portion from a common Web page, generates the html ﬁle for displaying a reference result by interfacepart based on the extracted information.
The ﬂow of processing of a prototype is as follows.
1. The query which the user inputted, and the number of keywords are read.
A user input query can be carved for every word by using blank. Each
keywords are held as K1 ,K2 ,. . . ,Kn (n≥2).
2. K1 ,. . . ,Kn is inputted to Google image search and image’s URL and URL
of result Web pages are collected.
3. K1 is inputted to Google text search, URL of result Web pages is collected.
4. K2 ,. . . ,Kn is inputted to Google image search and image’s URL and URL
of result Web pages are collected.
5. Calculate common Web pages of both results. The page which is not common is canceled.
6. Title and text except tab and Javascript are extracted from source code of
common web pages
7. This procedure is in the same degree of relaxation.
8. The degree of relaxation is raised one and this is repeated.
9. In order to check tf value to each text document, a sentence is divided per
word using a chasen[8].
10. The importance of a paragraph is calculated by calculating the importance
of a word.
11. A paragraph is extracted based on this importance.
12. The picture of the reference result of image search is mixed and new contents are created in html form.
13. The created html ﬁle is read and a user is shown.
39
7.2
Implemeting display
Fig.27 shows implementing display. Thus, it is a display screen like an or-
dinary browser. However, the result displayed is not as a result of URL like
before. A user inputs a reference question and pushes a reference button. If it
does so, it will be displayed in the form where the text of one portion of the
picture as a result of image search and the Web page as a result of text search
was extracted as shown in a ﬁgure.
Retrieval Query
Reference keyword
Image search results
Text extracted from a web page
Figure 27: implementing display
40
Chapter 8
Related work
Naver[9] is a type of meta-search engine that simultaneously searches for
HTML pages, CG animations, images, and sounds etc., pertinent to the user
input keyword(s) and then merges the retrieved content. These results are only
a collection of the retrieved information for each search engine. Naver is a Crossmedia meta-search engine that can retrieve information from several diﬀerent
media-sources from a user-input keyword query.
But a keyword(s) is inputted like the conventional search engine, and The
domain of images of image search and the domain of the list of URL of text
search are divided by integration. Although various media are treated, it diﬀers
from this research at this point.
Reference result of image
Reference result of text
Figure 28: NAVER
MIRADOR-Search[10] is service which uniﬁed the semantic reference in a
text developed by FUJITSU, and the visual reference by the picture and which
oﬀers a crossing media reference function.
If a user inputs a reference keyword, Web Crawler will analyze the Web
page on the Internet, and a picture and explanation texts will be collected
automatically. And as a reference result, the collected picture is mapped and
displayed on three dimensions based on the picture features, such as a color of
a picture, and a form. This system is the reference which specialized in the
41
picture very much.
Specify a kind of feature
feature of text
(frequency of word)
feature
extraction
feature of picture
(color, figure)
Set of feature
Space of feature of N dimension
3D space
SOM
Arrange
the similar information
to near
Figure 29: MIRADOR-Search
Cyclone[11] is also a meta-search engine for searching ordinary Web pages.
From a user-input keyword query, it searches several search engines and assembles the results into an encyclopedia-like form. The point of showing only a
required portion, without displaying a Web page as a reference result is connected with this research.
However, in this system, if one word is not inputted, a suitable solution can-
not be obtained. That is, it is thought that it is a Japanese dictionary using
Web.
Title and field of Web page
extracted paragraph
Figure 30: CYCLONE
42
Mr. oyama[12] proposed the Web reference technique using the formation
of hierarchical structure of the reference question at the time of inputting two
or more keywords into a search engine.
Precision is raised by distinguishing a role for every keyword like the keyword
showing the theme, and the keyword showing the contents about the theme.
Moreover, the reference result set which the same keyword also compared the
reference result which changed and searched the role, and had a big diﬀerence
as the display method of a reference result is shown in parallel.
In this research, this technique is used and the role is given to the question
used for image search at a reference keyword, and the question used for text
search.
user input query
role of keywords
Figure 31: interface by Mr. oyama
KartOO[13] is visual meta search engine. KartOO launches the query to
a set of search engines, gathers the results, compiles them and represents them
in a series of interactive maps through a proprietary algorithm In this map, the
found sites are represented by more or less important size pages, depending on
their relevance. When you move the pointer over these pages, t he concerned
keywords are illuminated and a brief description of the site appears on the left
side of the screen.
43
Relation of a link
Web pages
Figure 32: KartOO
Web Montagewebmontage uniﬁes a page dynamically. Web access is a routine pattern. A user looks at a page in the same order, when the same every
day. Since what supports the task of such a routine and Web browsing is not
developed, it is going to support this. The score is attached to the always visited Web page from a user’s history. And contents are arranged so that the
sum total of the score of the contents arranged on a Web screen may become
the maximum.
a part of contents of web page
Figure 33: Web montage
44
Chapter 9
Conclusion
In this paper, we propose a Cross-media meta search method. The features
of Cross-media meta search. This is a system that collects not only a text
but a picture, and various contents, such as music, from the contents on the
present various Web space. Harnessing the capability of the existing search
engine by using a meta-search, the amount of information can be increased and
the characteristic of various search engines can be used. Moreover, information
is processed so that a user can collect information eﬃciently.
The points that a Cross-media meta search diﬀers from the conventional
meta-search engine are the following points.
• The Cross-media meta search engine can collect not only Web pages, but
also mixed type multimedia contents by using search engine of various
media.
• User input query Q is changed into the form of having been suitable for
the search engine of various media.
• What uniﬁed the contents which extracted only one portion of a Web page
instead of URL is displayed as a reference result.
Cross-media meta search has such a feature.
There are two methods to realize Cross-media meta search.
• Query relaxation
the system which uses the search engine of a diﬀerent kind eﬃciently. user
input query is divided into a subset. Each subset is used for each search
engine. The common Web pages of each reference result of search engine
are collected. The solution which was not obtained conventionally can be
obtained by using query relaxation method.
• Information integration
the system which uniﬁes information so that a user can acquire information
eﬃciently.
45
Evaluation of our query relaxation approach shows that it can dramat-
ically improve the recall ratio for image retrievals. By the result of an experiment, query relaxation can improve recall considerably, maintaining precision
to about 70%. We believe that the query relaxation approach is an eﬃcient
method for image searches.
By this system, only by inputting a reference keyword, even if it does not
carry out troublesome search, a user can peruse many useful information about
a reference keyword easily. In this way, a user can utilize Web eﬀectively.
46
Acknowledgments
First, the author would like to thank Professor Kastumi TANAKA and As-
sistant Professor Satoshi OYAMA for supporting at all levels of this research.
And the author would like to thank advisers, Associate Professor Mizuho IWAIHARA and Associate Professor Hayato YAMANA for supporting this research.
Moreover the author would like to thank all the members of TANAKA Laboratory for supporting across the varied activity in the TANAKA Laboratory.
47
References
[1] Akihiro KUWABARA, Katsumi TANAKA : ”RelaxImage: A Cross-Media
Meta-Search Engine for Searching Images from Web Based on Query Relaxation” IEEE ICDE’2005, Tokyo, Japan, April 2005. (to appear)
[2] Akihiro KUWABARA, Kazutoshi SUMIYA, Katsumi TANAKA : ”Query
Relaxation and Answer Integration For Cross-Media Meta-Searches”
IEEE Int’l Conference on Multimedia and Expo (IEEE ICME’2004).
Taipei, Taiwan, June 2004. Published in ICME2004 Proceedings (CD).
www.icme2004.org.
[3] Akihiro KUWABARA, Hiroya TANAKA, Kazutoshi SUMIYA, Katsumi
TANAKA : ”Cross-Media Meta-Search by Query Relaxation”, DBSJ Letters Vol.3，No.1, June 2004, pp.97–100
[4] Akihiro KUWABARA, Satoshi OYAMA, Kazutoshi SUMIYA, Katsumi
TANAKA : ”Query Translation and Answer Integration Towards Multimedia Meta-Search”, DBSJ Letters Vol.2，No.1, May 2003, pp.55–58
[5] Google
http://www.google.co.jp/
[6] Altavista
http://altavista.com/
[7] Google image
http://images.google.co.jp/
[8] NAIST Computational Linguistics Lab
http://chasen.naist.jp/chasen/distribution.html.ja
[9] NAVER Japan
http://www.naver.co.jp/
[10] Fujitsu : MIRADOR-Search,
http://www.labs.fujitsu.com/News/1999/Dec/9-2.html
[11] Cyclone
http://cyclone.slis.tsukuba.ac.jp/
[12] Satoshi OYAMA ，Katsumi TANAKA : ”Web Search Using Hierarchical
Structuring of Queries” DBSJ Letters Vol.1，No.1, pp.63-66, 2002
48
[13] KartOO
http://www.kartoo.com/
[14] Corin R. Anderson, Eric Horvitz : Web Montage: A Dynamic Personalized
Start Page, WWW2002, pp.468–469(2002).
[15] M.C. Schraefel, Yuxiang Zhu, David Modjeska, Daniel Wigdor, Shengdong
Zhao : Hunter Gatherer: Interaction Support for the Creation and Management of Within-Web-Page Collections, WWW2002, pp.130–131(2002)
49