Kidney transplant is often the treatment of choice to people with end-stage kidney disease. Understanding the characteristic of stable patients and patients who experienced rejection can be one of the ways to understand more about kidney transplant rejection. In this assignment, select a microarray or RNA-seq data that measure gene expression of patients' cells and will use the gene expression to predict for the patient outcome (i.e., stable versus rejection) using a selected machine learning approach.
This multi-media discipline report will consist of three components:
An executive summary
A shiny app
Video presentation (worth 10%)
Some suggestions of gene expression data are
Microarray GSE14346 GSE15296 GSE21374 GSE46474
RNA-seq GSE120396 (provided in lab) GSE131179 (provided in lab) GSE120649 (provided in lab) GSE86884
Part A) Prepare an executive summary with no more than 750 words providing an overview that will highlight a particular analytical approach addressing the question of interest and a brief guide to the shiny app. Points to includes are:
Provide a clear statement of the question you intend to address or the topic that you intend to focus on your multi-media discipline report.
What is your approach to addressing the question stated in (1) and what is the key technique in your approach (e.g. random forest, lasso, Bayesian network etc.)? Select ONE method and provide a concise technical description.
Identify potential shortcomings or issues associated with the data analytics that you have performed and discuss a possible approach to address the issue. Here, a strategy doesn't necessarily refer to a model, but it must address the issue.
Create and describe the interactive graphics (or shiny app) that illustrate one aspect of your report, and please provide the link to the shiny app (this can either be a web page or a GitHub link).
Sample Solution