Gender-based analysis of Breast Cancer - Part II

Posted on Thu 09 July 2020 in Data Science • Tagged with Breast Cancer, SEER, Survival Analysis, Cox Proportional-Hazards Model, Data Analysis

In the previous blog post, I ventured into gender based differences in breast cancer. In this post, I attempt to find out key predictors of breast cancer survival in the dataset obtained after propensity score matching in the previous blog post as this will help us understand better, the results …


Continue reading

Gender-based analysis of Breast Cancer - Part I

Posted on Thu 14 May 2020 in Data Science • Tagged with Breast Cancer, Survival Analysis, Kaplan Meier Curve, Propensity Score Matching, Data Analysis

Breast cancer is one of the two most common cancers reported worldwide in 2018 among both the genders. Along with lung cancer, breast cancer accounted for 24.6% of all the cancers 1. As the name suggests, breast cancer is cancerous growth of cells in the breast. As both women …


Continue reading

We are not alone in our body : a home to trillions of tiny friends and foes

Posted on Mon 19 August 2019 in Data Science • Tagged with Biomedical Science, Data Analysis, Microbiology

Each one of us harbors ten times more microbial cells than our own human cells. These commensals (organisms that live with you) govern a range of functions in health, disease and our behaviour. Some of the functions they are involved in are food digestion, drug metabolism, regulating metabolic rate, immunity …


Continue reading

Analyzing Hidden Themes in Retracted Biomedical Literature

Posted on Sun 12 May 2019 in Data Science • Tagged with Biomedical Science, Life Science, PubMed, Data Analysis, LDA, Topic Modeling, pyLDAvis, NLP

In my previous blog post, I discussed various players involved in retractions in biomedical and life science literature. One question that intrigued me and was left unanswered was — what are the research topics/themes that are present in these retracted publications? I attempt to answer this question in this blog …


Continue reading

On Retractions in Biomedical Literature

Posted on Sat 23 March 2019 in Data Science • Tagged with Biomedical Science, Biology, Research, PubMed, Data Analysis

The fierce competition in academia and the rush to publish, many times lead to flawed results and conclusions in scientific publications. While some of these are honest mistakes, others are deliberate scientific misconduct. According to one study, 76% of retractions were due to scientific misconduct in papers retracted from a …


Continue reading

Last Ten Years Of Cinema

Posted on Sat 23 February 2019 in Data Science • Tagged with Movies, Data Analysis

In the last blog post, I talked about how I built a Content-Based Recommender System that could make movie recommendations catering to my taste. The data was collected from IMDb and contained information about 48000 movies. While working on that project, I took a journey to the cinema landscape that …


Continue reading

A Simple Movie Recommender System

Posted on Sun 20 January 2019 in Data Science • Tagged with Movies, Data Analysis, Recommender Systems

A few days back when I was sifting through movies on Netflix, I decided to have some fun building my own movie recommender system. With the number of choices for movies, books, restaurants, news items as well as people to follow or become friends with on the Internet increasing every …


Continue reading