Automatic classification of users’ health information need context : logistic regression analysis of mouse-click and eye-tracker data

Background: Users searching for health information on the Internet may be searching for their own health issue, searching for someone else’s health issue, or browsing with no particular health issue in mind. Previous research has found that these three categories of users focus on different types of...

Full description

Saved in:
Bibliographic Details
Main Authors: Pian, Wenjing, Khoo, Christopher Soo Guan, Chi, Jianxing
Other Authors: Wee Kim Wee School of Communication and Information
Format: Article
Language:English
Published: 2018
Subjects:
Online Access:https://hdl.handle.net/10356/88379
http://hdl.handle.net/10220/45739
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-88379
record_format dspace
institution Nanyang Technological University
building NTU Library
country Singapore
collection DR-NTU
language English
topic DRNTU::Library and information science::Libraries
Information-seeking Behavior
Social Media
spellingShingle DRNTU::Library and information science::Libraries
Information-seeking Behavior
Social Media
Pian, Wenjing
Khoo, Christopher Soo Guan
Chi, Jianxing
Automatic classification of users’ health information need context : logistic regression analysis of mouse-click and eye-tracker data
description Background: Users searching for health information on the Internet may be searching for their own health issue, searching for someone else’s health issue, or browsing with no particular health issue in mind. Previous research has found that these three categories of users focus on different types of health information. However, most health information websites provide static content for all users. If the three types of user health information need contexts can be identified by the Web application, the search results or information offered to the user can be customized to increase its relevance or usefulness to the user. Objective: The aim of this study was to investigate the possibility of identifying the three user health information contexts (searching for self, searching for others, or browsing with no particular health issue in mind) using just hyperlink clicking behavior; using eye-tracking information; and using a combination of eye-tracking, demographic, and urgency information. Predictive models are developed using multinomial logistic regression. Methods: A total of 74 participants (39 females and 35 males) who were mainly staff and students of a university were asked to browse a health discussion forum, Healthboards.com. An eye tracker recorded their examining (eye fixation) and skimming (quick eye movement) behaviors on 2 types of screens: summary result screen displaying a list of post headers, and detailed post screen. The following three types of predictive models were developed using logistic regression analysis: model 1 used only the time spent in scanning the summary result screen and reading the detailed post screen, which can be determined from the user’s mouse clicks; model 2 used the examining and skimming durations on each screen, recorded by an eye tracker; and model 3 added user demographic and urgency information to model 2. Results: An analysis of variance (ANOVA) analysis found that users’ browsing durations were significantly different for the three health information contexts (P<.001). The logistic regression model 3 was able to predict the user’s type of health information context with a 10-fold cross validation mean accuracy of 84% (62/74), followed by model 2 at 73% (54/74) and model 1 at 71% (52/78). In addition, correlation analysis found that particular browsing durations were highly correlated with users’ age, education level, and the urgency of their information need. Conclusions: A user’s type of health information need context (ie, searching for self, for others, or with no health issue in mind) can be identified with reasonable accuracy using just user mouse clicks that can easily be detected by Web applications. Higher accuracy can be obtained using Google glass or future computing devices with eye tracking function.
author2 Wee Kim Wee School of Communication and Information
author_facet Wee Kim Wee School of Communication and Information
Pian, Wenjing
Khoo, Christopher Soo Guan
Chi, Jianxing
format Article
author Pian, Wenjing
Khoo, Christopher Soo Guan
Chi, Jianxing
author_sort Pian, Wenjing
title Automatic classification of users’ health information need context : logistic regression analysis of mouse-click and eye-tracker data
title_short Automatic classification of users’ health information need context : logistic regression analysis of mouse-click and eye-tracker data
title_full Automatic classification of users’ health information need context : logistic regression analysis of mouse-click and eye-tracker data
title_fullStr Automatic classification of users’ health information need context : logistic regression analysis of mouse-click and eye-tracker data
title_full_unstemmed Automatic classification of users’ health information need context : logistic regression analysis of mouse-click and eye-tracker data
title_sort automatic classification of users’ health information need context : logistic regression analysis of mouse-click and eye-tracker data
publishDate 2018
url https://hdl.handle.net/10356/88379
http://hdl.handle.net/10220/45739
_version_ 1681046965624766464
spelling sg-ntu-dr.10356-883792020-03-07T12:15:50Z Automatic classification of users’ health information need context : logistic regression analysis of mouse-click and eye-tracker data Pian, Wenjing Khoo, Christopher Soo Guan Chi, Jianxing Wee Kim Wee School of Communication and Information DRNTU::Library and information science::Libraries Information-seeking Behavior Social Media Background: Users searching for health information on the Internet may be searching for their own health issue, searching for someone else’s health issue, or browsing with no particular health issue in mind. Previous research has found that these three categories of users focus on different types of health information. However, most health information websites provide static content for all users. If the three types of user health information need contexts can be identified by the Web application, the search results or information offered to the user can be customized to increase its relevance or usefulness to the user. Objective: The aim of this study was to investigate the possibility of identifying the three user health information contexts (searching for self, searching for others, or browsing with no particular health issue in mind) using just hyperlink clicking behavior; using eye-tracking information; and using a combination of eye-tracking, demographic, and urgency information. Predictive models are developed using multinomial logistic regression. Methods: A total of 74 participants (39 females and 35 males) who were mainly staff and students of a university were asked to browse a health discussion forum, Healthboards.com. An eye tracker recorded their examining (eye fixation) and skimming (quick eye movement) behaviors on 2 types of screens: summary result screen displaying a list of post headers, and detailed post screen. The following three types of predictive models were developed using logistic regression analysis: model 1 used only the time spent in scanning the summary result screen and reading the detailed post screen, which can be determined from the user’s mouse clicks; model 2 used the examining and skimming durations on each screen, recorded by an eye tracker; and model 3 added user demographic and urgency information to model 2. Results: An analysis of variance (ANOVA) analysis found that users’ browsing durations were significantly different for the three health information contexts (P<.001). The logistic regression model 3 was able to predict the user’s type of health information context with a 10-fold cross validation mean accuracy of 84% (62/74), followed by model 2 at 73% (54/74) and model 1 at 71% (52/78). In addition, correlation analysis found that particular browsing durations were highly correlated with users’ age, education level, and the urgency of their information need. Conclusions: A user’s type of health information need context (ie, searching for self, for others, or with no health issue in mind) can be identified with reasonable accuracy using just user mouse clicks that can easily be detected by Web applications. Higher accuracy can be obtained using Google glass or future computing devices with eye tracking function. Published version 2018-08-29T08:11:05Z 2019-12-06T17:01:58Z 2018-08-29T08:11:05Z 2019-12-06T17:01:58Z 2017 Journal Article Pian, W., Khoo, C. S. G., & Chi, J. (2017). Automatic classification of users’ health information need context : logistic regression analysis of mouse-click and eye-tracker data. Journal of Medical Internet Research, 19(12), e424-. doi:10.2196/jmir.8354 1439-4456 https://hdl.handle.net/10356/88379 http://hdl.handle.net/10220/45739 10.2196/jmir.8354 en Journal of Medical Internet Research © 2017 Wenjing Pian, Christopher SG Khoo, Jianxing Chi. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 21.12.2017. This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included. 16 p. application/pdf