Ellie Li

Talent.AI

Internal Job Recommendation System

Sponsor:

United Technologies (UTC)

Team:

3 data scientists from UTC HR analytics + 5 students from WPI data science

Challenges:

There is no existing dictionary for us to identify which words or phrases are perfessional skills in job descriptions.( solution: combined TextRank, Tf-idf, NER and RandomForest to filter out perfessional skills.
Computational time. (solution: ElasticSearch)
Hard to define the number and name of skill sets generated by K-means. (solution: removed clustering steps and introduced KNN to get the most similar skills of target skill.)

Contribution:

Designed algorithms to clean up irrelevant noisy words and extract skills from over 20,000 UTC’s job descriptions
Trained Word2Vec with UTC’s job descriptions to obtain n-grams semantic embedding for extracted skills
Adopted KNN to find each skill’s synonyms to generate extended skill sets and built and stored inverted index through ElasticSearch
Automated the job recommendation system allowing users to upload resumes and receive job recommendations

Feedbacks:

"Very cool. Thank you for sharing. The first match sounds like it aligned best with me!"

“I think this is a very powerful tool! I think with the resume I sent to you I would most definitely apply to this job as it touches on a lot of the skills that I listed…"

"All are jobs that would fit my background, but I personally know I wouldn’t want to work in those roles."

"I think the first 3 JD match my profile but not so much the fourth one."

"I went through the 5 recommendations and have some feedback : 1 - Poor Match, 2 - Good Match, 3 - Poor Match, 4 - Worst Match, 5 - Best Match."

WHAT DID I LEARN?

Clarify the clients' goal before doing anything. Then stick on it.
Ready to make changes during implementation process. In this project, we are initially supposed to do clustering on skill sets and label these classes. However, sponsors found it is impossible to give a proper name to each skill class when we show the clustering results to them. Then they changed their requirements and we changed our methodology accordingly.
Stay “whelmed” - neither overwhelmed nor underwhelmed. We used to say yes when clients come up with new ideas and want us to have a try since we're willing to show a good personality. The most rewarding piece is, when you are open to new challenges, people will reach out to you. And these 'bonus' tasks will disturb our original timeline. Therefore, it's better to spend some time during meeting to discuss the feasibility of new idea and then decide whether to spend time on it.