[1]
N-gram IDF: A Global Term Weighting Scheme Based on Information Distance
Technical Papers 2
/
Shirakawa, Masumi
/
Hara, Takahiro
/
Nishio, Shojiro
Proceedings of the 2015 International Conference on the World Wide Web
2015-05-18
v.1
p.960-970
© Copyright 2015 ACM
Summary: This paper first reveals the relationship between Inverse Document Frequency
(IDF), a global term weighting scheme, and information distance, a universal
metric defined by Kolmogorov complexity. We concretely give a theoretical
explanation that the IDF of a term is equal to the distance between the term
and the empty string in the space of information distance in which the
Kolmogorov complexity is approximated using Web documents and the Shannon-Fano
coding. Based on our findings, we propose N-gram IDF, a theoretical extension
of IDF for handling words and phrases of any length. By comparing weights among
N-grams of any N, N-gram IDF enables us to determine dominant N-grams among
overlapping ones and extract key terms of any length from texts without using
any NLP techniques. To efficiently compute the weight for all possible N-grams,
we adopt two string processing techniques, i.e., maximal substring extraction
using enhanced suffix array and document listing using wavelet tree. We
conducted experiments on key term extraction and Web search query segmentation,
and found that N-gram IDF was competitive with state-of-the-art methods that
were designed for each application using additional resources and efforts. The
results exemplified the potential of N-gram IDF.
[2]
Probabilistic semantic similarity measurements for noisy short texts using
Wikipedia entities
KM track: clusters, topics and similarity
/
Shirakawa, Masumi
/
Nakayama, Kotaro
/
Hara, Takahiro
/
Nishio, Shojiro
Proceedings of the 2013 ACM Conference on Information and Knowledge
Management
2013-10-27
p.903-908
© Copyright 2013 ACM
Summary: This paper describes a novel probabilistic method of measuring semantic
similarity for real-world noisy short texts like microblog posts. Our method
adds related Wikipedia entities to a short text as its semantic representation
and uses the vector of entities for computing semantic similarity. Adding
related entities to texts is generally a compound problem that involves the
extraction of key terms, finding related entities for each key term, and the
aggregation of related entities. Explicit Semantic Analysis (ESA), a popular
Wikipedia-based method, solves these problems by summing the weighted vectors
of related entities. However, this heuristic weighting highly depends on the
rule of majority decision and is not suited to short texts that contain few key
terms but many noisy terms. The proposed probabilistic method synthesizes these
procedures by extending naive Bayes and achieves robust estimates of related
Wikipedia entities for short texts. Experimental results on short text
clustering using Twitter data indicated that our method outperformed ESA for
short texts containing noisy terms.
[3]
Decentralised approach for a reusable crowdsourcing platform utilising
standard web servers
Workshop: PUCAA: 1st international workshop on pervasive urban crowdsensing
architecture and applications
/
Hara, Tenshi
/
Springer, Thomas
/
Bombach, Gerd
/
Schill, Alexander
Adjunct Proceedings of the 2013 International Joint Conference on Pervasive
and Ubiquitous Computing
2013-09-08
v.2
p.1063-1074
© Copyright 2013 ACM
Summary: Crowdsourcing has gained increasing interest during the last years as means
for solving complex tasks with the help of a flexible group of contributors.
The crowd can contribute with collecting data in the field, completing map
information or votes for ideas or products. Even though the participation of
large numbers of users with heterogeneous devices in crowdsourcing is a highly
recurrent task, generic infrastructures for crowdsourcing can be hardly found.
Especially the management of users, mobile devices and contributed data has to
be repetitively implemented in new projects. To ease the development of
crowdsourcing applications, in this paper we propose a generic platform for
crowdsourcing supporting diverse crowdsourcing scenarios, the ability to handle
large numbers of users and the involvement of heterogeneous mobile devices. The
evaluation is based on scalability and performance experiments in order to
demonstrate the feasibility of our approach.
[4]
Semi-transparent Augmented Reality System
Extended Abstracts
/
Tachikawa, Tomoya
/
Hara, Takenori
/
Toyono, Chiho
/
Motai, Goro
/
Iwazaki, Karin
/
Shuto, Keisuke
/
Uchiyama, Hiroko
/
Yoshimura, Sakuji
Proceedings of the 2012 International Conference on Advances in Computer
Entertainment
2012-11-03
p.569-572
Keywords: AR; Transparency; Exhibition
© Copyright 2012 Springer-Verlag
Summary: We have developed a new Semi-Transparent Augmented Reality (AR) system that
displays the inner structures of objects by making their surface
semi-transparent. In this system we combine the live video of the object of
interest and 3D computer graphics (3DCG) models with appropriate transparency
and in proper order using AR technology. This system shows the 3DCG models of
inner structures as if they existed inside the object.
[5]
Image registration for text-gaze alignment
Poster presentation
/
Martinez-Gomez, Pascual
/
Chen, Chen
/
Hara, Tadayoshi
/
Kano, Yoshinobu
/
Aizawa, Akiko
Proceedings of the 2012 International Conference on Intelligent User
Interfaces
2012-02-14
p.257-260
© Copyright 2012 ACM
Summary: Applications using eye-tracking devices need a higher accuracy in
recognition when the task reaches a certain complexity. Thus, more
sophisticated methods to correct eye-tracking measurement errors are necessary
to lower the penetration barrier of eye-trackers in unconstrained tasks. We
propose to take advantage of the content or the structure of textual
information displayed on the screen to build informed error-correction
algorithms that generalize well. The idea is to use feature-based image
registration techniques to perform a linear transformation of gaze coordinates
to find a good alignment with text printed on the screen. In order to estimate
the parameters of the linear transformation, three optimization strategies are
proposed to avoid the problem of local minima, namely Monte Carlo,
multi-resolution and multi-blur optimization. Experimental results show that a
more precise alignment of gaze data with words on the screen can be achieved by
using these methods, allowing a more reliable use of eye-trackers in complex
and unconstrained tasks.
[6]
EDITED BOOK
Human -- Computer Systems Interaction: Backgrounds and Applications 2: Part
2
Advances in Intelligent and Soft Computing 99
/
Hippe, Zdzislaw S.
/
Kulikowski, Juliusz L.
/
Mroczek, Teresa
2012
n.31
p.495
Springer Berlin Heidelberg
DOI: 10.1007/978-3-642-23172-8
== Part IV: Environment Monitoring and Robotic Systems ==
SSVEP-Based Brain-Computer Interface: On the Effect of Stimulus Parameters on VEPs Spectral Characteristics (3-14)
+ Byczuk, M.
+ Poryzala, P.
+ Materka, A.
Design and Development of a Guideline for Ergonomic Haptic Interaction (15-29)
+ Muñoz, L. M.
+ Ponsa, P.
+ Casals, A.
Partner Robots -- From Development to Business Implementation (31-39)
+ Ota, Y.
Goal Understanding and Self-generating Will for Autonomous Humanoid Robots (41-55)
+ Nauth, P.
A Talking Robot and Its Singing Performance by the Mimicry of Human Vocalization (57-73)
+ Kitani, M.
+ Hara, T.
+ Hanada, H.
+ Sawada, H.
An Orthopedic Surgical Robotic System-OrthoRoby (75-90)
+ Barkana, D. Erol
Methods for Reducing Operational Forces in Force-Sensorless Bilateral Control with Thrust Wires for Two-Degree-of-Freedom Remote Robots (91-107)
+ Sato, T.
+ Sakaino, S.
+ Yakoh, T.
== Part V: Diagnostic Systems ==
Applications of Neural Networks in Semantic Analysis of Skin Cancer Images (111-124)
+ Przystalski, K.
+ Nowak, L.
+ Ogorzalek, M.
+ Surówka, G.
Further Research on Automatic Estimation of Asymmetry of Melanocytic Skin Lesions (125-129)
+ Cudek, P.
+ Grzymala-Busse, J. W.
+ Hippe, Z. S.
Multispectral Imaging for Supporting Colonoscopy and Gastroscopy Diagnoses (131-145)
+ Switonski, A.
+ Bieda, R.
+ Wojciechowski, K.
A Machine Learning Approach to Mining Brain Stroke Data (147-158)
+ Mroczek, T.
+ Grzymala-Busse, J. W.
+ Hippe, Z. S.
+ Jurczak, P.
Using Eye-Tracking to Study Reading Patterns and Processes in Autism with Hyperlexia Profile (159-174)
+ Pazzaglia, R.
+ Ravarelli, A.
+ Balestra, A.
+ Orio, S.
+ Zanetti, M. A.
Ontology Design for Medical Diagnostic Knowledge (175-189)
+ Jaszuk, M.
+ Szostek, G.
+ Walczak, A.
Rule-Based Analysis of MMPI Data Using the Copernicus System (191-203)
+ Gomula, J.
+ Paja, W.
+ Pancerz, K.
+ Szkola, J.
Application of 2D Anisotropic Wavelet Edge Extractors for Image Interpolation (205-222)
+ Adamczyk, K.
+ Walczak, A.
Experimental Results of Model-Based Fuzzy Control Solutions for a Laboratory Antilock Braking System (223-234)
+ Precup, R. E.
+ Spataru, S. V.
+ Radac, M. B.
+ Petriu, E. M.
+ Preitl, S.
+ Dragos, C. A.
+ et al
== Part VI: Educational Systems ==
Remote Teaching and New Testing Method Applied in Higher Education (237-259)
+ Pyzik, L.
Points of View on Magnetic Levitation System Laboratory-Based Control Education (261-275)
+ Dragos, C. A.
+ Preitl, S.
+ Precup, R. E.
+ Petriu, E. M.
2D and 3D Visualizations of Creative Destruction for Entrepreneurship Education (277-294)
+ Noyes, E.
+ Deligiannidis, L.
Employing a Biofeedback Method Based on Hemispheric Synchronization in Effective Learning (295-309)
+ Kaszuba, K.
+ Kostek, B.
== Part VII: General Problems ==
Comparison of Fuzzy and Neural Systems for Implementation of Nonlinear Control Surfaces (313-324)
+ Xie, T. T.
+ Yu, H.
+ Wilamowski, B. M.
Hardware Implementation of Fuzzy Default Logic (325-343)
+ Pulka, A.
+ Milik, A.
Dwulit's Hull as Means of Optimization of kNN Algorithm (345-358)
+ Dwulit, M. P.
+ Szymanski, Z.
OWiki: Enabling an Ontology-Led Creation of Semantic Data (359-374)
+ Di Iorio, A.
+ Musetti, A.
+ Peroni, S.
+ Vitali, F.
Fuzzy Genetic Object Identification: Multiple Inputs/Multiple Outputs Case (375-394)
+ Rotshtein, A. P.
+ Rakytyanska, H. B.
Server-Side Query Language for Protein Structure Similarity Searching (395-415)
+ Malysiak-Mrozek, B.
+ Kozielski, S.
+ Mrozek, D.
A New Kinds of Rules for Approximate Reasoning Modeling (417-428)
+ Palasinski, M.
+ Fryc, B.
+ Machnicka, Z.
Technical Evaluation of Boolean Recommenders (429-441)
+ Chojnacki, S.
+ Klopotek, M. A.
Interval Uncertainty in CPL Models for Computer Aided Prognosis (443-461)
+ Bobrowski, L.
Neural Network Training with Second Order Algorithms (463-476)
+ Yu, H.
+ Wilamowski, B. M.
Complex Neural Models of Dynamic Complex Systems: Study of the Global Quality Criterion and Results (477-495)
+ Dralus, G.
[7]
"interacumos": an interactive digital cubic kaleidoscope
Creative showcase & interactive art
/
Kanebako, Junichi
/
Oda, Masafumi
/
Matsuo, Kanako
/
Shuto, Keisuke
/
Hara, Takenori
/
Motai, Goro
/
Yamazaki, Minori
/
Uchiyama, Hiroko
Proceedings of the 2011 International Conference on Advances in Computer
Entertainment Technology
2011-11-08
p.84
© Copyright 2011 ACM
Summary: We developed a new digital kaleidoscope called "interacumos," which
generates interactive videos with depth perception based on the infinite
reflections of a six-sided mirrored cubic kaleidoscope. We installed OLED
panels and pressure sensors in a cubic kaleidoscope, "interacumos" analyzes
movements of the hand with pressure sensors and generates interactive videos in
real time. As a result, the user can enjoy interactive videos as if playing a
tune on an instrument.
[8]
Story tube "Sto-tu"
Creative showcase
/
Uchiyama, Hiroko
/
Sato, Akiko
/
Takai, Mai
/
Shibasaki, Mina
/
Takeda, Yuki
/
Hara, Takenori
/
Ookura, Masahiro
/
Tanaka, Mina
/
Komatsubara, Shigeru
Proceedings of the 2009 International Conference on Advances in Computer
Entertainment Technology
2009-10-29
p.447
© Copyright 2009 ACM
Summary: Recently, several Augmented Reality (AR) Systems have been proposed [Yoshida
et al. 2008]. We developed a new system which is a combination of a
"trompe-l'oeil" image and AR technology which allows a user to experience and
enjoy multi-media content in an unprecedented manner. This system creates a
composite of CG and video taken from a camera which is inserted into a tube,
with the "trompe-l'oeil"-style story pictures drawn inside the tube. The user
then advances through the story by moving the camera back and forth inside the
tube. This means the user can enjoy the story contents in a tangible and
interactive environment with an intuitive interface. We maintain our
experiments that our system enables the user to appreciate and enjoy a typical
Japanese fantasy story.
[9]
SHI KI BAKO "Box of Four Seasons"
Creative showcase
/
Uchiyama, Hiroko
/
Sato, Akiko
/
Takai, Mai
/
Shibasaki, Mina
/
Ookura, Masahiro
/
Imai, Masataka
/
Hara, Takenori
/
Takeda, Yuki
/
Tanaka, Mina
/
Komatsubara, Shigeru
Proceedings of the 2009 International Conference on Advances in Computer
Entertainment Technology
2009-10-29
p.450
© Copyright 2009 ACM
Summary: Opening the drawers of "Box of Four Seasons", you could enjoy nature and
culture in Japan. Nature and culture of each season are stowed away into each
drawer, and awake vividly with AR technology. You could find a joy of living
with culture nurtured in nature of the earth.
[10]
User Profiling for Web Search Based on Biological Fluctuation
Agents, Avatars and Personalisation
/
Arase, Yuki
/
Hara, Takahiro
/
Nishio, Shojiro
HCI International 2009: 13th International Conference on Human-Computer
Interaction, Part III: Ambient, Ubiquitous and Intelligent Interaction
2009-07-19
v.3
p.239-247
Keywords: User profile; Web search; biological fluctuation
Copyright © 2009 Springer-Verlag
Summary: Because of the information flood on the Web, it has become difficult to
search necessary information. Although Web search engines assign authority
values to Web pages and show ranked results, it is not enough to find
information of interest easily, as users have to comb through reliable but out
of the focus information. In this situation, personalization of Web search
results is effective. To realize the personalization, a user profiling
technique is essential, however, since the users' interests are not stable and
are versatile, it should be flexible and tolerant to change of the environment.
In this paper, we propose a user profiling method based on the model of the
organisms' flexibility and environmental tolerance. We review the previous user
profiling methods and discuss the adequacy of applying this model to user
profiling.
[11]
A game based approach to assign geographical relevance to web images
User interfaces and mobile web/session: mobile web
/
Arase, Yuki
/
Xie, Xing
/
Duan, Manni
/
Hara, Takahiro
/
Nishio, Shojiro
Proceedings of the 2009 International Conference on the World Wide Web
2009-04-20
p.811-820
Keywords: geographical relevance, human computation, image annotation, image search
© Copyright 2009 International World Wide Web Conference Committee (IW3C2)
Summary: Geographical context is very important for images. Millions of images on the
Web have been already assigned latitude and longitude information. Due to the
rapid proliferation of such images with geographical context, it is still
difficult to effectively search and browse them, since we do not have ways to
decide their relevance. In this paper, we focus on the geographical relevance
of images, which is defined as to what extent the main objects in an image
match landmarks at the location where the image was taken. Recently,
researchers have proposed to use game based approaches to label large scale
data such as Web images. However, previous works have not shown the quality of
collected game logs in detail and how the logs can improve existing
applications. To answer these questions, we design and implement a Web-based
and multi-player game to collect human knowledge while people are enjoying the
game. Then we thoroughly analyze the game logs obtained during a three week
study with 147 participants and propose methods to determine the image
geographical relevance. In addition, we conduct an experiment to compare our
methods with a commercial search engine. Experimental results show that our
methods dramatically improve image search relevance. Furthermore, we show that
we can derive geographically relevant objects and their salient portion in
images, which is valuable for a number of applications such as image location
recognition.
[12]
Association thesaurus construction methods based on link co-occurrence
analysis for wikipedia
KM: semantic techniques
/
Ito, Masahiro
/
Nakayama, Kotaro
/
Hara, Takahiro
/
Nishio, Shojiro
Proceedings of the 2008 ACM Conference on Information and Knowledge
Management
2008-10-26
p.817-826
© Copyright 2008 ACM
Summary: Wikipedia, a huge scale Web based encyclopedia, attracts great attention as
an invaluable corpus for knowledge extraction because it has various impressive
characteristics such as a huge number of articles, live updates, a dense link
structure, brief anchor texts and URL identification for concepts. We have
already proved that we can use Wikipedia to construct a huge scale accurate
association thesaurus. The association thesaurus we constructed covers almost
1.3 million concepts and its accuracy is proved in detailed experiments.
However, we still need scalable methods to analyze the huge number of Web pages
and hyperlinks among articles in the Web based encyclopedia.
In this paper, we propose a scalable method for constructing an association
thesaurus from Wikipedia based on link co-occurrences. Link co-occurrence
analysis is more scalable than link structure analysis because it is a one-pass
process. We also propose integration method of tfidf and link co-occurrence
analysis. Experimental results show that both our proposed methods are more
accurate and scalable than conventional methods. Furthermore, the integration
of tfidf achieved higher accuracy than using only link co-occurrences.
[13]
OPA browser: a web browser for cellular phone users
Web
/
Arase, Yuki
/
Hara, Takahiro
/
Uemukai, Toshiaki
/
Nishio, Shojiro
Proceedings of the 2007 ACM Symposium on User Interface Software and
Technology
2007-10-07
p.71-80
© Copyright 2007 ACM
Summary: Cellular phones are widely used to access the WWW. However, most available
Web pages are designed for desktop PCs. Cellular phones only have small screens
and poor interfaces, and thus, it is inconvenient to browse such large sized
pages. In addition, cellular phone users browse Web pages in various
situations, so that appropriate presentation styles for Web pages depend on
users' situations. In this paper, we propose a novel Web browsing system for
cellular phones that allocates various functions for Web browsing on each
numerical key of a cellular phone. Users can browse Web pages comfortably,
selecting appropriate functions according to their situations by pushing a
single button.
[14]
A web browsing system for cellular-phone users based on adaptive
presentation
/
Arase, Yuki
/
Maekawa, Takuya
/
Hara, Takahiro
/
Uemukai, Toshiaki
/
Nishio, Shojiro
Universal Access in the Information Society
2007
v.6
n.3
p.259-271
© Copyright 2007 Springer-Verlag
Summary: Cellular phones are widely used to access the Web. However, most available
Web pages are designed for desktop PCs, and it is inconvenient to browse these
large Web pages on a cellular phone with a small screen and poor interfaces.
Users who browse a Web page on a cellular phone have to scroll through the
whole page to find the desired content, and must then search and scroll within
that content in detail to get useful information. This paper describes the
design and implementation of a novel Web browsing system for cellular phones.
This system includes a Web page overview to reduce scrolling operations when
finding objective content within the page. Furthermore, it adaptively presents
content according to its characteristics to reduce burdensome operations when
searching within content.
[15]
Image classification for mobile web browsing
Adaptivity & mobility
/
Maekawa, Takuya
/
Hara, Takahiro
/
Nishio, Shojiro
Proceedings of the 2006 International Conference on the World Wide Web
2006-05-23
p.43-52
Keywords: mobile computing, web browsing, web images
© Copyright 2006 International World Wide Web Conference Committee (IW3C2)
Summary: It is difficult for users of mobile devices such as cellular phones equipped
with a small screen and a poor input interface to browse Web pages designed for
desktop PCs with large displays. Many studies and commercial products have
tried to solve this problem. Web pages include images that have various roles
such as site menus, line headers for itemization, and page titles. However,
most studies of mobile Web browsing haven't paid much attention to the roles of
Web images. In this paper, we define eleven Web image categories according to
their roles and use these categories for proper Web image handling. We manually
categorized 3,901 Web images collected from forty Web sites and extracted image
features of each category according to the classification. By making use of the
extracted features, we devised an automatic Web image classification method.
Furthermore, we evaluated the automatic classification of real Web pages and
achieved up to 83.1% classification accuracy. We also implemented an automatic
Web page scrolling system as an application of our automatic image
classification method.
[16]
A web browsing system based on adaptive presentation of web contents for
cellular phones
Mobile web/accessibility overlaps
/
Arase, Yuki
/
Maekawa, Takuya
/
Hara, Takahiro
/
Uemukai, Toshiaki
/
Nishio, Shojiro
Proceedings of the 2006 International Cross-Disciplinary Conference on Web
Accessibility (W4A)
2006-05-22
p.86-89
© Copyright 2006 ACM
Summary: Cellular phones have already been widely used to access the Web. However,
most existing Web pages are designed for desktop PCs, and thus, it is
inconvenient to browse these large Web pages on a cellular phone with a small
screen and poor interfaces. Users who browse a Web page on a cellular phone
have to scroll the whole page to find an objective content, and then, have to
scroll within the content in detail to get useful information. In this paper,
we propose a novel browsing system to break off these burdensome operations by
adaptively presenting Web contents according to their characteristics.
[17]
An agent system for ontology sharing on WWW
Posters
/
Nakayama, Kotaro
/
Hara, Takahiro
/
Nishio, Shojiro
Proceedings of the 2005 International Conference on the World Wide Web
2005-05-10
v.2
p.964-965
Keywords: agent technologies, ontology, semantic web, web services
© Copyright 2005 International World Wide Web Conference Committee (IW3C2)
Summary: Semantic Web Services (SWS), a new generation WWW technology, will
facilitate the automation of Web service tasks, including automated Web service
discovery, execution, composition and mediation by using XML based metadata and
ontology. There have been several efforts to build knowledge representation
languages for Web Services. However, only few attempts have so far been made to
develop applications based on SWS. Especially, front-end agent systems for
users are one of the urgent research areas. The purpose of this paper is to
introduce our new integrated front-end agent system for ontology management and
SWS management.