On aggregating salaries of occupations from job post and review data
The popularity of job websites has significantly changed the way people learn about different occupations. Among the insights offered by these websites are the statistics of occupation salaries which are useful information for job seekers, career coaches, graduating students, and labor related gover...
Saved in:
Main Authors: | , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2021
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/6636 https://ink.library.smu.edu.sg/context/sis_research/article/7639/viewcontent/09380460.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-7639 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-76392022-01-14T03:35:23Z On aggregating salaries of occupations from job post and review data HUNG, Chih-Chieh LIM, Ee-peng The popularity of job websites has significantly changed the way people learn about different occupations. Among the insights offered by these websites are the statistics of occupation salaries which are useful information for job seekers, career coaches, graduating students, and labor related government agencies. Such statistics include the distribution of job salaries of each occupation, such as average or quantiles. However, significant variability in salary (and review salary) can be found among jobs of the same occupation as we gather job post and review data from job websites. Such variability shows the existence of biases, including salary competitiveness in job posts and salary inflation in job reviews. Based on the observation, we aim at developing an approach to derive occupation salary for a job market, named unbiased salary, by aggregating offer salaries from job posts and review salaries from review data and at the same time removing their biases. To achieve this goal, we proposed COC-model to learn unbiased salaries of occupations, competitiveness of companies and inflation of companies efficiently. COC here is an abbreviation of ‘‘Company, Occupation, Company’’, which represents two different connections between companies and occupations from job posting site and job review site. COC-model represents the dependency of salary information between companies and occupations in job post data and job review data. It begins with defining three latent variables, say competitiveness, inflation, and unbiased salary, based on their dependencies. Instead of computing these variables iteratively, we formulate the interaction among these three latent variables into a matrix form so that these values could be then efficiently learned in a unified way by a series of matrix operations. Extensive experiments are conducted, including empirical studies about competitiveness and inflation of companies using real dataset and performance testing by synthetic dataset. The experimental results show that COC-model can not only derive unbiased salaries effectively but also help us to understand latent biases in job post and job review data. 2021-11-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/6636 info:doi/10.1109/ACCESS.2021.3066204 https://ink.library.smu.edu.sg/context/sis_research/article/7639/viewcontent/09380460.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Salary aggregation unbiased occupation salary bias modeling Databases and Information Systems |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Salary aggregation unbiased occupation salary bias modeling Databases and Information Systems |
spellingShingle |
Salary aggregation unbiased occupation salary bias modeling Databases and Information Systems HUNG, Chih-Chieh LIM, Ee-peng On aggregating salaries of occupations from job post and review data |
description |
The popularity of job websites has significantly changed the way people learn about different occupations. Among the insights offered by these websites are the statistics of occupation salaries which are useful information for job seekers, career coaches, graduating students, and labor related government agencies. Such statistics include the distribution of job salaries of each occupation, such as average or quantiles. However, significant variability in salary (and review salary) can be found among jobs of the same occupation as we gather job post and review data from job websites. Such variability shows the existence of biases, including salary competitiveness in job posts and salary inflation in job reviews. Based on the observation, we aim at developing an approach to derive occupation salary for a job market, named unbiased salary, by aggregating offer salaries from job posts and review salaries from review data and at the same time removing their biases. To achieve this goal, we proposed COC-model to learn unbiased salaries of occupations, competitiveness of companies and inflation of companies efficiently. COC here is an abbreviation of ‘‘Company, Occupation, Company’’, which represents two different connections between companies and occupations from job posting site and job review site. COC-model represents the dependency of salary information between companies and occupations in job post data and job review data. It begins with defining three latent variables, say competitiveness, inflation, and unbiased salary, based on their dependencies. Instead of computing these variables iteratively, we formulate the interaction among these three latent variables into a matrix form so that these values could be then efficiently learned in a unified way by a series of matrix operations. Extensive experiments are conducted, including empirical studies about competitiveness and inflation of companies using real dataset and performance testing by synthetic dataset. The experimental results show that COC-model can not only derive unbiased salaries effectively but also help us to understand latent biases in job post and job review data. |
format |
text |
author |
HUNG, Chih-Chieh LIM, Ee-peng |
author_facet |
HUNG, Chih-Chieh LIM, Ee-peng |
author_sort |
HUNG, Chih-Chieh |
title |
On aggregating salaries of occupations from job post and review data |
title_short |
On aggregating salaries of occupations from job post and review data |
title_full |
On aggregating salaries of occupations from job post and review data |
title_fullStr |
On aggregating salaries of occupations from job post and review data |
title_full_unstemmed |
On aggregating salaries of occupations from job post and review data |
title_sort |
on aggregating salaries of occupations from job post and review data |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2021 |
url |
https://ink.library.smu.edu.sg/sis_research/6636 https://ink.library.smu.edu.sg/context/sis_research/article/7639/viewcontent/09380460.pdf |
_version_ |
1770576014146011136 |