Sze Hang Wong (Timothy Wong)

Logo


A dedicated and challenge loving data analyst with 10+ years of commercial experience and a master's degree in Data Science from the University of Bristol. Skillful in ETL and data analytics, I helped my team to build up our own data warehouse and automate KPI reporting, so we could focus on other ad-hoc research projects.

Currently I am keen to develop experience in work related to data science and data engineering.

Visit my LinkedIn Profile

View my CV

Technical Skills: Python, R, SQL, SPSS, Alteryx, Tableau, AWS

Previous projects in data science, machine learning and NLP


NLP project for Identifying Context-Specific Values in Arguments

Human values are broad motivational goals — when we think about values, we think of what is important to us in life. We find an argument from others as persuasive or not based on whether or not that argument promotes the values we prefer. For example, a person preferring the value of freedom may find the argument “loneliness and isolation are a bigger killer than corona” persuasive in favour of lifting Covid-19 restrictions. Value preferences, however, are context specific. A person with a preference for freedom over safety in Covid-19 context may prefer safety over freedom in another context. This project seeks to create a highly computerised method to identify context-specific values in arguments.

This github repository consists of all the data, models and python code to recreate my MSc Data Science project for “Identifying Context-Specific Values in Arguments”. This project heavily relys on BERT (Bidirectional Encoder Representations from Transformers) models, which is a Neural Networks Model, to identify human values under text documents.

View code on Github


Analysing the best places for young people to live in England and Wales

University students graduate in every September (including myself). Before that, they need to decide where to live and work. In this project, analysis on house price, salary, residents’ age, suicide rate, and personal well-being estimates on every local authority in the England and Wales. The results show that if higher salary is desired but affordability for housing is limited:

View code on Colab


Geospatial and cluster analysis of the best places for Data Science graduates to live in England and Wales

This data visualisation work on Tableau is a follow-up of the previous project. It aims to utilise the visualization of geospatial data and cluster analysis to provide necessary and relevant information for the users (MSc Data Science students) for the decision process of selecting better areas to live. Three goals would be achieved from the data visualizations:

The cluster of local authorities in red colour has the best performance except higher unemployment rate.

The results show that:

Download Tableau visualisation

View report (pdf)

View code on Colab


Previous commercial projects

A selection of analytical projects on the company’s internal data.


ETL, Data warehousing and KPI reporting automation:

Achievements:


Forecasting (simulation):

Achievements: