About Me

Hi there, my name is Arpan Mishra and I currently work as a Data Scientist at ZS Associates. I graduated in July 2021 with a Bachelor of Science in Statistics from KMC, University of Delhi.

I’m passionate about creating data driven solutions for real world problems, my main interests lie in the field of machine learning, core statistics and natural language processing.

If i’m not coding then you can find me playing my ukulele or crushing someone on chess.com, challenges are accepted ♚

Experience

This is how my professional journey has been until now


Data Science Associate at ZS Associates
November 2021 – Present | New Delhi, India


Research Intern at Inria
June 2021 – September 2021 | Lille, France

  • Worked with medical data for mental health patients with a history of suicide attempts.
  • The objective was to model the recurrence of a suicide attempt from demographic as well as medical survey data by VigilanS using parametric as well as non parametric statistical methods.
  • We also conduct spatial analysis of the patients and use geostatistical models to include the effect of spatial autocorrelation.
  • A paper detailing the process and the conclusions will be published soon.

Machine Learning Engineer at Omdena
August 2020 – December 2020 | Remote

  • Worked with satellite imagery and survey data from Census and DHS.
  • The objective was to use satellite imagery in order to predict those socio-economic indicators of regions in India, which can act as a proxy for economic well-being.
  • Used Landsat 7 & 8 Satellite Images and census data to create a model which predicts district level census variables using a multi modal - multi task learning approach. Further we used DHS data and sentinel images to classify the Asset Wealth Index of clusters across India.
  • This project was hosted by World Resources Institute (WRI) and is under UN´s Sustainable Development Goal 8 (Decent Work & Economic Growth).

Data Science Intern at AAIT
July 2020 – September 2020 | Remote

  • Worked in predicting stock price trend, up/down close price prediction and change forecasting.
  • Used classical machine learning and deep learning with major focus being on techniques like ANN, LSTM, CONV-LSTM etc.

Data Analyst Intern at Mindler
June 2019 – July 2019 | New Delhi, India

  • Worked with the sales and advertisement teams in order to draw insights from the website traffic data as well as the data created by the sales team.
  • Analysed various metrics such as User Traffic, Bounce Rate, Exit Percentage etc. to help the organisation plan their ad campaigns efficiently.

Skills

Projects

These are some of the personal projects that I have built in the past.


ross

Rossman Sales Prediction

Created a tool to predict the daily sales of any store of the Rossmann drug store chain which is the 2nd largest drug store chain in Germany.

anime

Sentiment Extraction using Bert

Used Bert to detect the sentiment of a given text and further extract the words that best conveys the detected sentiment.

svm

Generating Anime Synopsis using Deep Learning

I used two techniques, LSTMs and then a fine tuned GPT2 for comparing their language modeling capabilities and the results were astounding!

pred

Global Suicide Analysis EDA

I analyzed the global suicide data for 90+ countries from the year 1985 - 2015 in R. Various statistical tests and data visualization techniques were used to explain the data.

ross

Text Analysis Webapp

The purpose of this app is to offer anyone starting off an NLP projects a fast and convenient means of exploring the text data cutting down the time between EDA and Modelling.

anime

Rubik’s Cube Rotation Prediction

Predicting the X-Axis Rotation for a give rubik’s cube using Resnet-50. This was part of the AI Blitz Challenge, a hackathon hosted by AI Crowd.

anime

Selfie Filter using CNN

I used a CNN architecture for facial keypoint detection and further used openCV to achieve the desired effect of a sunglass filter which works real time with a webcam.

Blog

Here are few of the blogs that I have written related to machine learning, data science and the projects that I have built.


SAT

Faster Machine Learning Using Hub by Activeloop

A code walkthrough of using the hub package for satellite imagery

anime

Let’s make some Anime using Deep Learning

Comparing text generation methods: LSTM vs GPT2

svm

Decoding Support Vector Machines

Intuitively understand how Support Vector Machines work

pred

Predicting HR Attrition using Support Vector Machines

Learn to train an SVM model following best practices

Contact