HCI Bibliography : Search Results skip to search form | skip to results |
Database updated: 2016-05-10 Searches since 2006-12-01: 32,235,920
director@hcibib.org
Hosted by ACM SIGCHI
The HCI Bibliogaphy was moved to a new server 2015-05-12 and again 2016-01-05, substantially degrading the environment for making updates.
There are no plans to add to the database.
Please send questions or comments to director@hcibib.org.
Query: hu_y* Results: 38 Sorted by: Date  Comments?
Help Dates
Limit:   
<<First <Previous Permalink Next> Last>> Records: 1 to 25 of 38 Jump to: 2016 | 15 | 14 | 13 | 12 | 11 | 10 | 09 | 08 | 07 | 06 | 05 | 03 |
[1] Access: A Mobile Application to Improve Accessibility Video Showcase Presentations / Yang, Yi / Hu, Yunqi / Hong, Yidi / Joshi, Varun / Kolathumani, Radhika Extended Abstracts of the ACM CHI'16 Conference on Human Factors in Computing Systems 2016-05-07 v.2 p.5
ACM Digital Library Link
Summary: This video introduces Access, a mobile application to provide information about accessibility information of local establishment and public spaces. We partnered with a local nonprofit in Jackson, MI which allowed us to access existing data they had collected in the three county areas of Jackson, Hillsdale and Lenawee in Michigan. The application is focused on providing information on accessibility of local establishments and public spaces, and how the mechanism works will be clearly explained in the video. Additionally, the video demonstrates details of user interfaces of Access and information flow, and also provides user scenarios depicting how the solution fits in the life of wheelchair users and solves problems.

[2] Video driven pedestrian visualization with characteristic appearances Crowds & human / Hu, Yu / Wu, Wei / Zhou, Zhong Proceedings of the 2015 ACM Symposium on Virtual Reality Software and Technology 2015-11-13 p.183-186
ACM Digital Library Link
Summary: Augmented virtual environment (AVE) could visualize plausible live views from videos by projecting dynamic imagery to the 3D environment. Static objects in a video can be rendered in new views since they are easily modeled beforehand, while moving ones that don't have exact online models will be distorted from different views without proper depth. To cope with the problem, we introduce a novel method to visualize pedestrians, which are common in outdoor surveillance. Our method detects pedestrians and produces their trajectories. Then such pedestrian characteristic appearances as geometric information, texture and walking animation are transferred to a stand-in 3D animation model in the virtual environment. Experiments show our visualization can reveal person's characteristic appearances.

[3] Collaborative Fashion Recommendation: A Functional Tensor Factorization Approach Session 3: Emotional and Social Signals in Multimedia / Hu, Yang / Yi, Xi / Davis, Larry S. Proceedings of the 2015 ACM International Conference on Multimedia 2015-10-26 p.129-138
ACM Digital Library Link
Summary: With the rapid expansion of online shopping for fashion products, effective fashion recommendation has become an increasingly important problem. In this work, we study the problem of personalized outfit recommendation, i.e. automatically suggesting outfits to users that fit their personal fashion preferences. Unlike existing recommendation systems that usually recommend individual items, we suggest sets of items, which interact with each other, to users. We propose a functional tensor factorization method to model the interactions between user and fashion items. To effectively utilize the multi-modal features of the fashion items, we use a gradient boosting based method to learn nonlinear functions to map the feature vectors from the feature space into some low dimensional latent space. The effectiveness of the proposed algorithm is validated through extensive experiments on real world user data from a popular fashion-focused social network.

[4] Deep Multimodal Speaker Naming Poster Session 2 / Hu, Yongtao / Ren, Jimmy SJ. / Dai, Jingwen / Yuan, Chang / Xu, Li / Wang, Wenping Proceedings of the 2015 ACM International Conference on Multimedia 2015-10-26 p.1107-1110
ACM Digital Library Link
Summary: Automatic speaker naming is the problem of localizing as well as identifying each speaking character in a TV/movie/live show video. This is a challenging problem mainly attributes to its multimodal nature, namely face cue alone is insufficient to achieve good performance. Previous multimodal approaches to this problem usually process the data of different modalities individually and merge them using handcrafted heuristics. Such approaches work well for simple scenes, but fail to achieve high performance for speakers with large appearance variations. In this paper, we propose a novel convolutional neural networks (CNN) based learning framework to automatically learn the fusion function of both face and audio cues. We show that without using face tracking, facial landmark localization or subtitle/transcript, our system with robust multimodal feature extraction is able to achieve state-of-the-art speaker naming performance evaluated on two diverse TV series. The dataset and implementation of our algorithm are publicly available online.

[5] Digital Rights Strategies in a Virtual World Marketplace Electronic, Mobile and Ubiquitous Commerce / Hu, Yuanrong / Fan, Si / Wang, Qiuhong HCIB 2015: 2nd International Conference on HCI in Business 2015-08-02 p.300-311
Keywords: Copyright strategy; Digital products; Virtual products
Link to Digital Content at Springer
Summary: This paper adopts the Heckman two-step model to analyze the impact of copyright strategy on sales performance of digital product, using the panel data comes from online virtual goods transaction website Xstreet.com. The results show that (1) significant relationship exists between sales performance and copyright strategy of digital product, but the influence of each copyright strategy on sales performance are different; (2) A seller's copyright structure within same product line can also affect sales performance of a digital product, thus in order to optimize the copyright combinations, a seller should fully consider the copyright strategy within the whole product line.

[6] Energy and Performance of Smartphone Radio Bundling in Outdoor Environments Technical Papers 2 / Nika, Ana / Zhu, Yibo / Ding, Ning / Jindal, Abhilash / Hu, Y. Charlie / Zhou, Xia / Zhao, Ben Y. / Zheng, Haitao Proceedings of the 2015 International Conference on the World Wide Web 2015-05-18 v.1 p.809-819
ACM Digital Library Link
Summary: Most of today's mobile devices come equipped with both cellular LTE and WiFi wireless radios, making radio bundling (simultaneous data transfers over multiple interfaces) both appealing and practical. Despite recent studies documenting the benefits of radio bundling with MPTCP, many fundamental questions remain about potential gains from radio bundling, or the relationship between performance and energy consumption in these scenarios. In this study, we seek to answer these questions using extensive measurements to empirically characterize both energy and performance for radio bundling approaches. In doing so, we quantify potential gains of bundling using MPTCP versus an ideal protocol. We study the links between traffic partitioning and bundling performance, and use a novel componentized energy model to quantify the energy consumed by CPUs (and radios) during traffic management. Our results show that MPTCP achieves only a fraction of the total performance gain possible, and that its energy-agnostic design leads to considerable power consumption by the CPU. We conclude that not only there is room for improved bundling performance, but an energy-aware bundling protocol is likely to achieve a much better tradeoff between performance and power consumption.

[7] GalaxyExplorer: Influence-Driven Visual Exploration of Context-Specific Social Media Interactions Demonstrations / Liu, Xiaotong / Parthasarathy, Srinivasan / Shen, Han-Wei / Hu, Yifan Companion Proceedings of the 2015 International Conference on the World Wide Web 2015-05-18 v.2 p.215-218
ACM Digital Library Link
Summary: The ever-increasing size and complexity of social networks place a fundamental challenge to visual exploration and analysis tasks. In this paper, we present GalaxyExplorer, an influence-driven visual analysis system for exploring users of various influence and analyzing how they influence others in a social network. GalaxyExplorer reduces the size and complexity of a social network by dynamically retrieving theme-based graphs, and analyzing users' influence and passivity regarding specific themes and dynamics in response to disaster events. In GalaxyExplorer, a galaxy-based visual metaphor is introduced to simplify the visual complexity of a large graph with a focus+context view. Various interactions are supported for visual exploration. We present experimental results on real-world datasets that show the effectiveness of GalaxyExplorer in theme-aware influence analysis.

[8] Iterative Multi-View Hashing for Cross Media Indexing Multimedia Search and Indexing / Hu, Yao / Jin, Zhongming / Ren, Hongyi / Cai, Deng / He, Xiaofei Proceedings of the 2014 ACM International Conference on Multimedia 2014-11-03 p.527-536
ACM Digital Library Link
Summary: Cross media retrieval engines have gained massive popularity with rapid development of the Internet. Users may perform queries in a corpus consisting of audio, video, and textual information. To make such systems practically possible for large mount of multimedia data, two critical issues must be carefully considered: (a) reduce the storage as much as possible; (b) model the relationship of the heterogeneous media data. Recently academic community have proved that encoding the data into compact binary codes can drastically reduce the storage and computational cost. However, it is still unclear how to integrate multiple information sources properly into the binary code encoding scheme.
    In this paper, we study the cross media indexing problem by learning the discriminative hashing functions to map the multi-view datum into a shared hamming space. Not only meaningful within-view similarity is required to be preserved, we also incorporate the between-view correlations into the encoding scheme, where we map the similar points close together and push apart the dissimilar ones. To this end, we propose a novel hashing algorithm called Iterative Multi-View Hashing (IMVH) by taking these information into account simultaneously. To solve this joint optimization problem efficiently, we further develop an iterative scheme to deal with it by using a more flexible quantization model. In particular, an optimal alignment is learned to maintain the between-view similarity in the encoding scheme. And the binary codes are obtained by directly solving a series of binary label assignment problems without continuous relaxation to avoid the unnecessary quantization loss. In this way, the proposed algorithm not only greatly improves the retrieval accuracy but also performs strong robustness. An extensive set of experiments clearly demonstrates the superior performance of the proposed method against the state-of-the-art techniques on both multimodal and unimodal retrieval tasks.

[9] Matrix Completion for Cross-view Pairwise Constraint Propagation Posters 1 / Yang, Zheng / Hu, Yao / Liu, Haifeng / Chen, Huajun / Wu, Zhaohui Proceedings of the 2014 ACM International Conference on Multimedia 2014-11-03 p.897-900
ACM Digital Library Link
Summary: As pairwise constraints are usually easier to access than label information, pairwise constraint propagation attracts more and more attention in semi-supervised learning. Most existing pairwise constraint propagation methods are based on canonical graph propagation model, which heavily depends on the edge weights in the graph and cannot preserve local and global consistency simultaneously. In order to address this drawback, we cast cross-view pairwise constraint propagation into a problem of low rank matrix completion and propose a Matrix Completion method for cross-view Pairwise Constraint Propagation (MCPCP). With low rank requirement and graph regularization, our MCPCP can preserve local and global consistency simultaneously. We develop an algorithm based on alternating direction method of multipliers (ADMM) to solve the optimization problem. Finally, the effectiveness of MCPCP is demonstrated in cross-view multimedia retrieval.

[10] CONR: A Novel Method for Sentiment Word Identification KM Track Posters / Liang, Jiguang / Zhou, Xiaofei / Hu, Yue / Guo, Li / Bai, Shuo Proceedings of the 2014 ACM Conference on Information and Knowledge Management 2014-11-03 p.1943-1946
ACM Digital Library Link
Summary: Sentiment word identification (SWI) is of high relevance to sentiment analysis technologies and applications. Currently most SWI methods heavily rely on sentiment seed words that have limited sentiment information. Even though there emerge non-seed approaches based on sentiment labels of documents, but in which the context information has not been fully considered. In this paper, based on matrix factorization with co-occurrence neighbor regularization which is derived from context, we propose a novel non-seed model called CONR for SWI. Instead of seed words, CONR exploits two important factors: sentiment matching and sentiment consistency for sentiment word identification. Experimental results on four publicly available datasets show that CONR can outperform the state of-the-art methods.

[11] Evaluation of hands-on clinical exam performance using marker-less video tracking Health Care: HC13 -- Methodologies for Evaluating Health-Care Performance / Azari, David P. / Pugh, Carla M. / Laufer, Shlomi / Cohen, Elaine / Kwan, Calvin / Chen, Chia-Hsiung (Eric) / Yen, Thomas Y. / Hu, Yu Hen / Ray, Rebecca D. / Radwin, Robert G. Proceedings of the Human Factors and Ergonomics Society 2014 Annual Meeting 2014-10-27 p.793-797
doi 10.1177/1541931214581145
Link to HFES Digital Content
Summary: This study investigates the potential of using marker-less video tracking for evaluating hands-on clinical skills. Experienced family practitioners attending a national conference were recruited and asked to conduct a breast examination on a simulator that presents different clinical pathologies. Videos were taken of the clinician's hands during the exam. Video processing software for tracking and quantifying hand motion kinematics was used. Videos were divided into two segments: a general search segment and a mass exploration segment. The general exploration segments exhibited motion patterns which included 72% faster movement and 73% higher acceleration across clinical pathologies. The most complex pathology exhibited 14% greater displacement for pressing/rubbing than for general exploration. Marker-less video kinematic tracking shows promise in discriminating between different examination procedures, clinicians, and pathologies.

[12] An Equation for Estimating Hand Activity Level Based on Measured Hand Speed and Duty Cycle Occupational Ergonomics (formerly Industrial Ergonomics): OE4 -- Ergonomics Exposure Assessment / Akas, Oguz / Azari, David / Chen, Chia-Hsiung (Eric) / Hu, Yu Hen / Armstrong, Thomas J. / Ulin, Sheryl S. / Radwin, Robert G. Proceedings of the Human Factors and Ergonomics Society 2014 Annual Meeting 2014-10-27 p.1600-1604
doi 10.1177/1541931214581333
Link to HFES Digital Content
Summary: We are developing video processing algorithms for automatically measuring the ACGIH TLV® hand activity level (HAL) using marker-less tracking of hand movements. An equation for computing HAL ratings directly from tracked kinematics, rather than using a frequency-duty cycle (DC) look-up table, more readily lends itself to automated processing. Videos from the 33 Latko et al. (1997) jobs were digitized and analyzed using marker-less tracking, and hand root mean square (RMS) speed (S) was measured. A linear regression model was developed for predicting the average observer rated HAL based on the measured hand RMS speed and DC. Since the videos did not contain distance calibration, speed was quantified in pixels/s and normalized by the distance of each worker's hand breadth, measured in pixels. A Monte Carlo simulation was performed using the US Army (1991) hand anthropometry data to determine how variation is introduced in the equation as hand breadth varies. The resulting equation was HAL= -1.06 + 0.0047 S + 0.053 DC and it predicted HAL ratings within ±1. The development of an accurate equation for estimating HAL ratings should enable use of automated and objective measurement in practice. While expert observer HAL ratings offer speed and efficiency, use of objective measurements based on worker hand kinematics should provide greater reliability, as well as offering specific engineering aspects of the job that may be addressed for reducing exposures and the risk of musculoskeletal disorders. Furthermore, automated videos analysis may help improve the speed and efficiency of making objective measurements in practice.

[13] Developing early warning systems to predict students' online learning performance / Hu, Ya-Han / Lo, Chia-Lun / Shih, Sheng-Pao Computers in Human Behavior 2014-07 v.36 n.0 p.469-478
Keywords: Learning management system
Keywords: e-Learning
Keywords: Early warning system
Keywords: Data-mining
Keywords: Learning performance prediction
Link to Article at sciencedirect
Summary: An early warning system can help to identify at-risk students, or predict student learning performance by analyzing learning portfolios recorded in a learning management system (LMS). Although previous studies have shown the applicability of determining learner behaviors from an LMS, most investigated datasets are not assembled from online learning courses or from whole learning activities undertaken on courses that can be analyzed to evaluate students' academic achievement. Previous studies generally focus on the construction of predictors for learner performance evaluation after a course has ended, and neglect the practical value of an "early warning" system to predict at-risk students while a course is in progress. We collected the complete learning activities of an online undergraduate course and applied data-mining techniques to develop an early warning system. Our results showed that, time-dependent variables extracted from LMS are critical factors for online learning. After students have used an LMS for a period of time, our early warning system effectively characterizes their current learning performance. Data-mining techniques are useful in the construction of early warning systems; based on our experimental results, classification and regression tree (CART), supplemented by AdaBoost is the best classifier for the evaluation of learning performance investigated by this study.

[14] How to Improve Your Search Engine Ranking: Myths and Reality / Su, Ao-Jan / Hu, Y. Charlie / Kuzmanovic, Aleksandar / Koh, Cheng-Kok ACM Transactions on The Web 2014-03 v.8 n.2 p.8
ACM Digital Library Link
Summary: Search engines have greatly influenced the way people access information on the Internet, as such engines provide the preferred entry point to billions of pages on the Web. Therefore, highly ranked Web pages generally have higher visibility to people and pushing the ranking higher has become the top priority for Web masters. As a matter of fact, Search Engine Optimization (SEO) has became a sizeable business that attempts to improve their clients' ranking. Still, the lack of ways to validate SEO's methods has created numerous myths and fallacies associated with ranking algorithms.
    In this article, we focus on two ranking algorithms, Google's and Bing's, and design, implement, and evaluate a ranking system to systematically validate assumptions others have made about these popular ranking algorithms. We demonstrate that linear learning models, coupled with a recursive partitioning ranking scheme, are capable of predicting ranking results with high accuracy. As an example, we manage to correctly predict 7 out of the top 10 pages for 78% of evaluated keywords. Moreover, for content-only ranking, our system can correctly predict 9 or more pages out of the top 10 ones for 77% of search terms. We show how our ranking system can be used to reveal the relative importance of ranking features in a search engine's ranking function, provide guidelines for SEOs and Web masters to optimize their Web pages, validate or disprove new ranking features, and evaluate search engine ranking results for possible ranking bias.

[15] EDITED BOOK Handbook of Human Centric Visualization / Huang, Weidong 2014 n.29 p.743 Springer New York
DOI: 10.1007/978-1-4614-7485-2
ISBN: 978-1-4614-7484-5 (print), 978-1-4614-7485-2 (online)
Online Access
== Part I: Visual Communication ==
Visualizing Thought (3-40)
	+ Tversky, Barbara
Gryphon: A 'Little' Domain-Specific Programming Language for Diffusion MRI Visualizations (41-61)
	+ Chen, Jian
	+ Cai, Haipeng
	+ Auchus, Alexander P.
	+ Laidlaw, David H.
Viewing Abstract Data as Maps (63-89)
	+ Gansner, Emden R.
	+ Hu, Yifan
	+ Kobourov, Stephen G.
== Part II: Theory and Science ==
Individual Differences and Translational Science in the Design of Human-Centered Visualizations (93-113)
	+ Green, Tera Marie
	+ Arias-Hernandez, Richard
	+ Fisher, Brian
Evaluating Visualization Environments: Cognitive, Social, and Cultural Perspectives (115-145)
	+ Hundhausen, Christopher D.
On the Prospects for a Science of Visualization (147-175)
	+ Rensink, Ronald A.
== Part III: Principles, Guidelines and Recommendations ==
Toward a Better Understanding and Application of the Principles of Visual Communication (179-201)
	+ Bae, Juhee
	+ Watson, Benjamin
Pep Up Your Time Machine: Recommendations for the Design of Information Visualizations of Time-Dependent Data (203-225)
	+ Kriglstein, Simone
	+ Pohl, Margit
	+ Smuc, Michael
Using Textbook Illustrations to Extract Design Principles for Algorithm Visualizations (227-249)
	+ Velázquez-Iturbide, J. Ángel
== Part IV: Methods ==
Conceptual Design for Sensemaking (253-283)
	+ Blandford, Ann
	+ Faisal, Sarah
	+ Attfield, Simon
An Introduction and Guide to Evaluation of Visualization Techniques Through User Studies (285-313)
	+ Forsell, Camilla
	+ Cooper, Matthew
User-Centered Evaluation of Information Visualization Techniques: Making the HCI-InfoVis Connection Explicit (315-336)
	+ Freitas, Carla M. D. S.
	+ Pimenta, Marcelo S.
	+ Scapin, Dominique L.
Eye Tracking on Visualizations: Progressive Extraction of Scanning Strategies (337-372)
	+ Goldberg, Joseph H.
	+ Helfman, Jonathan I.
Evaluating Overall Quality of Graph Visualizations Indirectly and Directly (373-390)
	+ Huang, Weidong
Visual Analysis of Eye Tracking Data (391-409)
	+ Raschke, Michael
	+ Blascheck, Tanja
	+ Burch, Michael
User Studies in Visualization: A Reflection on Methods (411-426)
	+ Tory, Melanie
== Part V: Perception and Cognition ==
On the Benefits and Drawbacks of Radial Diagrams (429-451)
	+ Burch, Michael
	+ Weiskopf, Daniel
Measuring Memories for Objects and Their Locations in Immersive Virtual Environments: The Subjective Component of Memorial Experience (453-471)
	+ Coxon, Matthew
	+ Mania, Katerina
Human-Centric Chronographics: Making Historical Time Memorable (473-511)
	+ Korallo, Liliya
	+ Davis, Stephen Boyd
	+ Foreman, Nigel
	+ Moar, Magnus
Visualizing Multiple Levels and Dimensions of Social Network Properties (513-525)
	+ McGrath, Cathleen
	+ Blythe, Jim
	+ Krackhardt, David
== Part VI: Dynamic Visualization ==
Adaptive Diagrams: A Research Agenda to Explore How Learners Can Manipulate Online Diagrams to Self-Manage Cognitive Load (529-550)
	+ Agostinho, Shirley
	+ Tindall-Ford, Sharon
	+ Bokosmaty, Sahar
Dynamic Visualisations and Motor Skills (551-580)
	+ Castro-Alonso, Juan Cristobal
	+ Ayres, Paul
	+ Paas, Fred
Dynamic Visualizations: A Two-Edged Sword?  (581-604)
	+ Lowe, Richard K.
Simultaneous and Sequential Presentation of Realistic and Schematic Instructional Dynamic Visualizations (605-622)
	+ Nugteren, Michelle L.
	+ Tabbers, Huib K.
	+ Scheiter, Katharina
	+ Paas, Fred
How Do You Connect Moving Dots? Insights from User Studies on Dynamic Network Visualizations (623-650)
	+ Smuc, Michael
	+ Federico, Paolo
	+ Windhager, Florian
	+ Aigner, Wolfgang
	+ Zenk, Lukas
	+ Miksch, Silvia
== Part VII: Interaction ==
Interaction Taxonomy for Tracking of User Actions in Visual Analytics Applications (653-670)
	+ von Landesberger, Tatiana
	+ Fiebig, Sebastian
	+ Bremm, Sebastian
	+ Kuijper, Arjan
	+ Fellner, Dieter W.
Common Visualizations: Their Cognitive Utility (671-691)
	+ Parsons, Paul
	+ Sedig, Kamran
Distribution of Information Processing While Performing Complex Cognitive Activities with Visualization Tools (693-715)
	+ Parsons, Paul
	+ Sedig, Kamran
Human-Centered Interactivity of Visualization Tools: Micro- and Macro-level Considerations (717-743)
	+ Sedig, Kamran
	+ Parsons, Paul
	+ Dittmer, Mark
	+ Haworth, Robert

[16] Efficient reconstruction, decomposition and editing for spatially-varying reflectance data Image-based rendering and tracking / Hu, Yong / Wang, Shan / Qi, Yue Proceedings of the 2013 Conference on Graphics Interface 2013-05-29 p.55-62
ACM Digital Library Link
Summary: We present a new method for modeling real-world surface reflectance, described with non-parametric spatially-varying bidirectional reflectance distribution functions (SVBRDF). Our method seeks to achieve high reconstruction accuracy, compactness and "editability" of representation meanwhile speeding up the SVBRDF modeling processes. For a planar surface, we 1) design a capturing device to acquire reflectance samples at dense surface locations; 2) propose a Laplacian-based angular interpolation scheme for a 2D slice of BRDF at a given surface location, and then a Kernel Nyström method for SVBRDF data matrix reconstruction; 3) propose a practical algorithm to extract linear-independent basis BRDFs, and to calculate blending weights through projecting reconstructed reflectance onto these bases. Results demonstrate that our approach models real-world reflectance with both high accuracy and high visual fidelity for real-time virtual environment rendering.

[17] Whoo.ly: facilitating information seeking for hyperlocal communities using social media Papers: tensions in social media / Hu, Yuheng / Farnham, Shelly D. / Monroy-Hernández, Andrés Proceedings of ACM CHI 2013 Conference on Human Factors in Computing Systems 2013-04-27 v.1 p.3481-3490
ACM Digital Library Link
Summary: Social media systems promise powerful opportunities for people to connect to timely, relevant information at the hyper local level. Yet, finding the meaningful signal in noisy social media streams can be quite daunting to users. In this paper, we present and evaluate Whoo.ly, a web service that provides neighborhood-specific information based on Twitter posts that were automatically inferred to be hyperlocal. Whoo.ly automatically extracts and summarizes hyperlocal information about events, topics, people, and places from these Twitter posts. We provide an overview of our design goals with Whoo.ly and describe the system including the user interface and our unique event detection and summarization algorithms. We tested the usefulness of the system as a tool for finding neighborhood information through a comprehensive user study. The outcome demonstrated that most participants found Whoo.ly easier to use than Twitter and they would prefer it as a tool for exploring their neighborhoods.

[18] A novel local patch framework for fixing supervised learning models KM track: classification and semantic methods / Wang, Yilei / Wei, Bingzheng / Yan, Jun / Hu, Yang / Deng, Zhi-Hong / Chen, Zheng Proceedings of the 2012 ACM Conference on Information and Knowledge Management 2012-10-29 p.1233-1242
ACM Digital Library Link
Summary: In the past decades, machine learning models, especially supervised learning algorithms, have been widely used in various real world applications. However, no matter how strong a learning model is, it will suffer from the prediction errors when it is applied to real world problems. Due to the black box nature of supervised learning models, it is a challenging problem to fix the supervised learning models by further learning from the failure cases it generates. In this paper, we propose a novel Local Patch Framework (LPF) to locally fix supervised learning models by learning from its predicted failure cases. Since the learning models are generally globally optimized during training process, our proposed LPF assumes that most of the learning errors are led by local errors in the model. Thus we aim to break the black boxes of learning models by identifying and fixing the local errors of various models automatically. The proposed LPF has two key steps, which are local error region subspace learning and local patch model learning. Through this way, we aim to fix the errors of learning models locally and automatically with certain generalization ability on unseen testing data. Experiments on both classification and ranking problems show that the proposed LPF is effective and outperforms the original algorithms and the incremental learning model.

[19] Personalized document clustering with dual supervision Search and sensemaking / Hu, Yeming / Milios, Evangelos E. / Blustein, James / Liu, Shali Proceedings of the 2012 ACM Symposium on Document Engineering 2012-09-04 p.161-170
ACM Digital Library Link
Summary: The potential for semi-supervised techniques to produce personalized clusters has not been explored. This is due to the fact that semi-supervised clustering algorithms used to be evaluated using oracles based on underlying class labels. Although using oracles allows clustering algorithms to be evaluated quickly and without labor intensive labeling, it has the key disadvantage that oracles always give the same answer for an assignment of a document or a feature. However, different human users might give different assignments of the same document and/or feature because of different but equally valid points of view. In this paper, we conduct a user study in which we ask participants (users) to group the same document collection into clusters according to their own understanding, which are then used to evaluate semi-supervised clustering algorithms for user personalization. Through our user study, we observe that different users have their own personalized organizations of the same collection and a user's organization changes over time. Therefore, we propose that document clustering algorithms should be able to incorporate user input and produce personalized clusters based on the user input. We also confirm that semi-supervised algorithms with noisy user input can still produce better organizations matching user's expectation (personalization) than traditional unsupervised ones. Finally, we demonstrate that labeling keywords for clusters at the same time as labeling documents can improve clustering performance further compared to labeling only documents with respect to user personalization.

[20] Mining query subtopics from search log data User intent / Hu, Yunhua / Qian, Yanan / Li, Hang / Jiang, Daxin / Pei, Jian / Zheng, Qinghua Proceedings of the 35th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval 2012-08-12 p.305-314
ACM Digital Library Link
Summary: Most queries in web search are ambiguous and multifaceted. Identifying the major senses and facets of queries from search log data, referred to as query subtopic mining in this paper, is a very important issue in web search. Through search log analysis, we show that there are two interesting phenomena of user behavior that can be leveraged to identify query subtopics, referred to as 'one subtopic per search' and 'subtopic clarification by keyword'. One subtopic per search means that if a user clicks multiple URLs in one query, then the clicked URLs tend to represent the same sense or facet. Subtopic clarification by keyword means that users often add an additional keyword or keywords to expand the query in order to clarify their search intent. Thus, the keywords tend to be indicative of the sense or facet. We propose a clustering algorithm that can effectively leverage the two phenomena to automatically mine the major subtopics of queries, where each subtopic is represented by a cluster containing a number of URLs and keywords. The mined subtopics of queries can be used in multiple tasks in web search and we evaluate them in aspects of the search result presentation such as clustering and re-ranking. We demonstrate that our clustering algorithm can effectively mine query subtopics with an F1 measure in the range of 0.896-0.956. Our experimental results show that the use of the subtopics mined by our approach can significantly improve the state-of-the-art methods used for search result clustering. Experimental results based on click data also show that the re-ranking of search result based on our method can significantly improve the efficiency of users' ability to find information.

[21] Genre classification for million song dataset using confidence-based classifiers combination Poster abstracts / Hu, Yajie / Ogihara, Mitsunori Proceedings of the 35th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval 2012-08-12 p.1083-1084
ACM Digital Library Link
Summary: We proposed a method to classify songs in the Million Song Dataset according to song genre. Since songs have several data types, we trained sub-classifiers by different types of data. These sub-classifiers are combined using both classifier authority and classification confidence for a particular instance. In the experiments, the combined classifier surpasses all of these sub-classifiers and the SVM classifier using concatenated vectors from all data types. Finally, the genre labels for the Million Song Dataset are provided.

[22] Combining machine learning and human judgment in author disambiguation Science, the past, and the future / Qian, Yanan / Hu, Yunhua / Cui, Jianling / Zheng, Qinghua / Nie, Zaiqing Proceedings of the 2011 ACM Conference on Information and Knowledge Management 2011-10-24 p.1241-1246
ACM Digital Library Link
Summary: Author disambiguation in digital libraries becomes increasingly difficult as the number of publications and consequently the number of ambiguous author names keep growing. The fully automatic author disambiguation approach could not give satisfactory results due to the lack of signals in many cases. Furthermore, human judgment on the basis of automatic algorithms is also not suitable because the automatically disambiguated results are often mixed and not understandable for humans. In this paper, we propose a Labeling Oriented Author Disambiguation approach, called LOAD, to combine machine learning and human judgment together in author disambiguation. LOAD exploits a framework which consists of high precision clustering, high recall clustering, and top dissimilar clusters selection and ranking. In the framework, supervised learning algorithms are used to train the similarity functions between publications and a clustering algorithm is further applied to generate clusters. To validate the effectiveness and efficiency of the proposed LOAD approach, comprehensive experiments are conducted. Comparing to conventional author disambiguation algorithms, the LOAD yields much more accurate results to assist human labeling. Further experiments show that the LOAD approach can save labeling time dramatically.

[23] Semantics-Enabled Policies for Information Sharing and Protection in the Cloud / Hu, Yuh-Jong / Wu, Win-Nan / Yang, Jiun-Jan Proceedings of the 2011 International Conference on Social Informatics 2011-10-06 p.198-211
Keywords: semantics-enabled policies; information sharing; data protection; national security; cloud computing; privacy for social network cloud
Link to Digital Content at Springer
Summary: The cloud computing platform provides utility computing allowing people to have convenient and flexible information sharing services on the web. We investigate the inter-disciplinary area of information technology and law and use semantics-enabled policies for modeling legal regulations in the cloud. The semantics-enabled policies of information sharing and protection are represented as a combination of ontologies and rules to capture the concept of security and privacy laws. Ontologies are abstract knowledge representations of information sharing and protection which extracted manually from the data sharing and protection laws. Rules provide further enforcement power after ontologies have been constructed. The emerging challenges of legalizing semantics-enabled policies for laws in the cloud include mitigating the gap between semantics-enabled policy and laws to avoid any ambiguity in the policy representation, and resolving possible conflicts among policies when they are required to integrate the laws from multiple jurisdictions.

[24] Modeling spatially-varying reflectance based on Kernel Nyström Poster session 1 / Hu, Yong / Qi, Yue / Shen, Fangyang Proceedings of the 2010 ACM Symposium on Virtual Reality Software and Technology 2010-11-22 p.91-92
ACM Digital Library Link
Summary: We present a new method for modeling real-world surface reflectance, described with non-parametric spatially-varying bidirectional reflectance distribution functions (SVBRDF). Our method seeks to achieve high reconstruction accuracy, compactness and "editability" of representation meanwhile speeding up both the SVBRDF capturing and modeling processes. For a planar surface, we 1) design a fast capturing device to acquire reflectance samples at dense surface locations; 2) propose a Laplacian-based angular interpolation scheme for a 2D slice of BRDF at a given surface location, and then a Kernel Nyström method for SVBRDF data matrix reconstruction; 3) propose a practical algorithm to extract linear-independent basis BRDFs, and to calculate blending weights through projecting reconstructed reflectance onto these bases. Results demonstrate that our approach models real-world reflectance with both high accuracy and high visual fidelity for real-time virtual environment rendering.

[25] Emotion detection via discriminative kernel method Cognitive systems and machine learning for assistive environments / Wang, Hua / Huang, Heng / Hu, Yanzi / Anderson, Mindi / Rollins, Pamela / Makedon, Fillia Proceedings of the 3rd International Conference on PErvasive Technologies Related to Assistive Environments 2010-06-23 p.7
Keywords: discriminative learning, emotion detection, facial expression recognition, facial feature, kernel
ACM Digital Library Link
Summary: Human emotion detection is of substantial importance in diverse pervasive applications in assistive environments. Because facial expressions provide a key mechanism for understanding and conveying emotion, automatic emotion detection through facial expression recognition has attracted increased attention in both scientific research and practical applications in recent years. Traditional facial expression recognition methods normally use only one type of facial expression data, either static data extracted from one single face image or motion dependent data obtained from dynamic face image sequences, but seldom employ both. In this work, we propose a novel Discriminative Kernel Facial Emotion Recognition (DKFER) method to integrate these two types of facial expression data using a hybrid kernel, such that the advantages of both of them are exploited. In addition, by using Linear Discriminant Analysis (LDA) to transform the two types of original facial expression data into two more discriminative lower-dimensional subspaces, the succeeding classification for emotion detection can be carried out in a more efficient and effective way. Encouraging experimental results in empirical studies demonstrate the practical usage of the proposed DKFER method for emotion detection.
<<First <Previous Permalink Next> Last>> Records: 1 to 25 of 38 Jump to: 2016 | 15 | 14 | 13 | 12 | 11 | 10 | 09 | 08 | 07 | 06 | 05 | 03 |