Year 2021

Instruction

  • You will set up a question you want to address from the dataset. If you have one, please send it TA via Slack DM.

  • The question should not be overlapped with other students. So if you made a question, please send me as soon as possible. First come first serve.

  • Your assignment has to be written in the R notebook (as your daily assignment), including plot(s), the codes for data munging and plot, and description about your question, aim, finding. It would be great if you add some explanations on terms you have learnt over the assignment.

  • The number of plots in your assignment is limited to 2. Multi-panel plots (like facet_wrap or cowplot) are allowed.

  • Language: English + R.

  • Submission has to be made via Slack DM.

  • Document file format: PDF (you can knit it or convert PDF from html)

  • Please submit your assignment by 12th Nov, 10pm. You can have only 70% score for the late submission

Evaluation

  • Accuracy. How well you describe the dataset related to your question in the R notebook?

  • Flowing Data. Whether/how many times you use tidyverse functions for data munging? Whether the code is briefly and efficiently written for your task (e.g. %>%)?

  • Storytelling with Data. Whether you fully utilize the dataset for your question? Whether you understand the aim and methods of the dataset?

  • Readability of your plot. How much information you summarize into your plot?

  • Right visual format. Whether you properly use color, scale, and data type for your data visualization.

Topic

Chen et al. (2020), Cell, Proteogenomics of Non-smoking Lung Cancer in East Asia Delineates Molecular Signatures of Pathogenesis and Progression

Please read this article and understand the dataset provided in the supplementary table. Set up your question from the dataset and visualize information for your question.

MISC.

Examples from previous studies

Here are some examples, which made me impressed.

Last updated