ResumeNet : a learning-based framework for automatic resume quality assessment

Recruitment of appropriate people for certain positions is critical for any companies or organizations. Manually screening to select appropriate candidates from large amounts of resumes can be exhausted and time-consuming. However, there is no public tool that can be directly used for automatic resu...

Full description

Saved in:
Bibliographic Details
Main Authors: Luo, Yong, Zhang, Huaizheng, Wang, Yongjie, Wen, Yonggang, Zhang, Xinwen
Other Authors: School of Computer Science and Engineering
Format: Conference or Workshop Item
Language:English
Published: 2020
Subjects:
Online Access:https://hdl.handle.net/10356/143040
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-143040
record_format dspace
spelling sg-ntu-dr.10356-1430402020-07-22T07:48:25Z ResumeNet : a learning-based framework for automatic resume quality assessment Luo, Yong Zhang, Huaizheng Wang, Yongjie Wen, Yonggang Zhang, Xinwen School of Computer Science and Engineering 2018 IEEE International Conference on Data Mining (ICDM) Engineering::Computer science and engineering Resume Quality Assessment Dataset and Features Recruitment of appropriate people for certain positions is critical for any companies or organizations. Manually screening to select appropriate candidates from large amounts of resumes can be exhausted and time-consuming. However, there is no public tool that can be directly used for automatic resume quality assessment (RQA). This motivates us to develop a method for automatic RQA. Since there is also no public dataset for model training and evaluation, we build a dataset for RQA by collecting around 10K resumes, which are provided by a private resume management company. By investigating the dataset, we identify some factors or features that could be useful to discriminate good resumes from bad ones, e.g., the consistency between different parts of a resume. Then a neural-network model is designed to predict the quality of each resume, where some text processing techniques are incorporated. To deal with the label deficiency issue in the dataset, we propose several variants of the model by either utilizing the pair/triplet-based loss, or introducing some semi-supervised learning technique to make use of the abundant unlabeled data. Both the presented baseline model and its variants are general and easy to implement. Various popular criteria including the receiver operating characteristic (ROC) curve, F-measure and ranking-based average precision (AP) are adopted for model evaluation. We compare the different variants with our baseline model. Since there is no public algorithm for RQA, we further compare our results with those obtained from a website that can score a resume. Experimental results in terms of different criteria demonstrate effectiveness of the proposed method. We foresee that our approach would transform the way of future human resources management. National Research Foundation (NRF) Accepted version This work is supported by Singapore NRF2015ENCGDCR01001-003, administrated via IMDA and NRF2015ENCGBICRD001-012, administrated via BCA. 2020-07-22T07:48:25Z 2020-07-22T07:48:25Z 2018 Conference Paper Luo, Y., Zhang, H., Wang, Y., Wen, Y., & Zhang, X. (2018). ResumeNet : a learning-based framework for automatic resume quality assessment. Proceedings of 2018 IEEE International Conference on Data Mining (ICDM), 307-316. doi:10.1109/icdm.2018.00046 978-1-5386-9160-1 https://hdl.handle.net/10356/143040 10.1109/icdm.2018.00046 2-s2.0-85061375946 307 316 en © 2018 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The published version is available at: https://doi.org/10.1109/icdm.2018.00046 application/pdf
institution Nanyang Technological University
building NTU Library
country Singapore
collection DR-NTU
language English
topic Engineering::Computer science and engineering
Resume Quality Assessment
Dataset and Features
spellingShingle Engineering::Computer science and engineering
Resume Quality Assessment
Dataset and Features
Luo, Yong
Zhang, Huaizheng
Wang, Yongjie
Wen, Yonggang
Zhang, Xinwen
ResumeNet : a learning-based framework for automatic resume quality assessment
description Recruitment of appropriate people for certain positions is critical for any companies or organizations. Manually screening to select appropriate candidates from large amounts of resumes can be exhausted and time-consuming. However, there is no public tool that can be directly used for automatic resume quality assessment (RQA). This motivates us to develop a method for automatic RQA. Since there is also no public dataset for model training and evaluation, we build a dataset for RQA by collecting around 10K resumes, which are provided by a private resume management company. By investigating the dataset, we identify some factors or features that could be useful to discriminate good resumes from bad ones, e.g., the consistency between different parts of a resume. Then a neural-network model is designed to predict the quality of each resume, where some text processing techniques are incorporated. To deal with the label deficiency issue in the dataset, we propose several variants of the model by either utilizing the pair/triplet-based loss, or introducing some semi-supervised learning technique to make use of the abundant unlabeled data. Both the presented baseline model and its variants are general and easy to implement. Various popular criteria including the receiver operating characteristic (ROC) curve, F-measure and ranking-based average precision (AP) are adopted for model evaluation. We compare the different variants with our baseline model. Since there is no public algorithm for RQA, we further compare our results with those obtained from a website that can score a resume. Experimental results in terms of different criteria demonstrate effectiveness of the proposed method. We foresee that our approach would transform the way of future human resources management.
author2 School of Computer Science and Engineering
author_facet School of Computer Science and Engineering
Luo, Yong
Zhang, Huaizheng
Wang, Yongjie
Wen, Yonggang
Zhang, Xinwen
format Conference or Workshop Item
author Luo, Yong
Zhang, Huaizheng
Wang, Yongjie
Wen, Yonggang
Zhang, Xinwen
author_sort Luo, Yong
title ResumeNet : a learning-based framework for automatic resume quality assessment
title_short ResumeNet : a learning-based framework for automatic resume quality assessment
title_full ResumeNet : a learning-based framework for automatic resume quality assessment
title_fullStr ResumeNet : a learning-based framework for automatic resume quality assessment
title_full_unstemmed ResumeNet : a learning-based framework for automatic resume quality assessment
title_sort resumenet : a learning-based framework for automatic resume quality assessment
publishDate 2020
url https://hdl.handle.net/10356/143040
_version_ 1681056667180990464