On aggregating salaries of occupations from job post and review data

The popularity of job websites has significantly changed the way people learn about different occupations. Among the insights offered by these websites are the statistics of occupation salaries which are useful information for job seekers, career coaches, graduating students, and labor related gover...

Full description

Saved in:
Bibliographic Details
Main Authors: HUNG, Chih-Chieh, LIM, Ee-peng
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2021
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/6636
https://ink.library.smu.edu.sg/context/sis_research/article/7639/viewcontent/09380460.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-7639
record_format dspace
spelling sg-smu-ink.sis_research-76392022-01-14T03:35:23Z On aggregating salaries of occupations from job post and review data HUNG, Chih-Chieh LIM, Ee-peng The popularity of job websites has significantly changed the way people learn about different occupations. Among the insights offered by these websites are the statistics of occupation salaries which are useful information for job seekers, career coaches, graduating students, and labor related government agencies. Such statistics include the distribution of job salaries of each occupation, such as average or quantiles. However, significant variability in salary (and review salary) can be found among jobs of the same occupation as we gather job post and review data from job websites. Such variability shows the existence of biases, including salary competitiveness in job posts and salary inflation in job reviews. Based on the observation, we aim at developing an approach to derive occupation salary for a job market, named unbiased salary, by aggregating offer salaries from job posts and review salaries from review data and at the same time removing their biases. To achieve this goal, we proposed COC-model to learn unbiased salaries of occupations, competitiveness of companies and inflation of companies efficiently. COC here is an abbreviation of ‘‘Company, Occupation, Company’’, which represents two different connections between companies and occupations from job posting site and job review site. COC-model represents the dependency of salary information between companies and occupations in job post data and job review data. It begins with defining three latent variables, say competitiveness, inflation, and unbiased salary, based on their dependencies. Instead of computing these variables iteratively, we formulate the interaction among these three latent variables into a matrix form so that these values could be then efficiently learned in a unified way by a series of matrix operations. Extensive experiments are conducted, including empirical studies about competitiveness and inflation of companies using real dataset and performance testing by synthetic dataset. The experimental results show that COC-model can not only derive unbiased salaries effectively but also help us to understand latent biases in job post and job review data. 2021-11-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/6636 info:doi/10.1109/ACCESS.2021.3066204 https://ink.library.smu.edu.sg/context/sis_research/article/7639/viewcontent/09380460.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Salary aggregation unbiased occupation salary bias modeling Databases and Information Systems
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Salary aggregation
unbiased occupation salary
bias modeling
Databases and Information Systems
spellingShingle Salary aggregation
unbiased occupation salary
bias modeling
Databases and Information Systems
HUNG, Chih-Chieh
LIM, Ee-peng
On aggregating salaries of occupations from job post and review data
description The popularity of job websites has significantly changed the way people learn about different occupations. Among the insights offered by these websites are the statistics of occupation salaries which are useful information for job seekers, career coaches, graduating students, and labor related government agencies. Such statistics include the distribution of job salaries of each occupation, such as average or quantiles. However, significant variability in salary (and review salary) can be found among jobs of the same occupation as we gather job post and review data from job websites. Such variability shows the existence of biases, including salary competitiveness in job posts and salary inflation in job reviews. Based on the observation, we aim at developing an approach to derive occupation salary for a job market, named unbiased salary, by aggregating offer salaries from job posts and review salaries from review data and at the same time removing their biases. To achieve this goal, we proposed COC-model to learn unbiased salaries of occupations, competitiveness of companies and inflation of companies efficiently. COC here is an abbreviation of ‘‘Company, Occupation, Company’’, which represents two different connections between companies and occupations from job posting site and job review site. COC-model represents the dependency of salary information between companies and occupations in job post data and job review data. It begins with defining three latent variables, say competitiveness, inflation, and unbiased salary, based on their dependencies. Instead of computing these variables iteratively, we formulate the interaction among these three latent variables into a matrix form so that these values could be then efficiently learned in a unified way by a series of matrix operations. Extensive experiments are conducted, including empirical studies about competitiveness and inflation of companies using real dataset and performance testing by synthetic dataset. The experimental results show that COC-model can not only derive unbiased salaries effectively but also help us to understand latent biases in job post and job review data.
format text
author HUNG, Chih-Chieh
LIM, Ee-peng
author_facet HUNG, Chih-Chieh
LIM, Ee-peng
author_sort HUNG, Chih-Chieh
title On aggregating salaries of occupations from job post and review data
title_short On aggregating salaries of occupations from job post and review data
title_full On aggregating salaries of occupations from job post and review data
title_fullStr On aggregating salaries of occupations from job post and review data
title_full_unstemmed On aggregating salaries of occupations from job post and review data
title_sort on aggregating salaries of occupations from job post and review data
publisher Institutional Knowledge at Singapore Management University
publishDate 2021
url https://ink.library.smu.edu.sg/sis_research/6636
https://ink.library.smu.edu.sg/context/sis_research/article/7639/viewcontent/09380460.pdf
_version_ 1770576014146011136