GitHub

DoSStoolkit

The DoSS Toolkit is a collection of self-paced materials to help you learn and use R.

We all know that R is a critical part of applied statistics and data science these days, but it can have a steep learning curve and be intimidating to get started with.

The Department of Statistical Sciences (DoSS) toolkit is a free series of open source online modules by undergraduates, that their fellow students and the public can use to learn the essentials of R.

--

Style Guide

Based on: https://github.com/UBC-DSCI/introduction-to-datascience

General

Use Canadian spelling
Expand all contractions
80 character line limit.
Numbers in text should be english words ("four common mistakes" not "4 common mistakes") unless there are units (40km, not forty km)
Use Oxford commas ("a, b, and c" not "a, b and c")
Functions in text should have parentheses (read_csv() not read_csv)
Remove all references to "course" and "student"; replace with "reader" or "you" where necessary
Remove all references to "clicking on things" in the HTML version of the book (e.g. "click this link to ...")
Book titles in the text should be typeset in italics and then the reference (e.g. R for Data Science [thebibtexkey])

Code blocks

Always use |> pipe, not %>%
Do not end code blocks with head(dataframe); just use dataframe to print
set.seed once at the beginning of each chapter
Use "double quotes" for strings, not 'single quotes'
Make sure all lines of code are at most 80 characters (for LaTeX PDF output typesetting)
Pass code blocks through styler (although must obey the 80ch limit)

Section headings

All (sub)section headings should be sentence case ("Loading a tabular data set", not "Loading a Tabular Data Set")
Make sure that subsections occur in 1-step hierarchies (no subsubsection directly below subsection, for example)

Learning objectives

When saying that students will do things in code, always say "in R"
"you will be able to" (not "students will be able to", "the reader will be able to")

Captions

Captions should be sentence formatted and end with a period

Equations

Make sure all equations get capitalized labels ("Equation \@ref(blah)", not "equation below" or "equation above")

Figures

Make sure all figures get (capitalized) labels ("Figure \@ref(blah)", not "figure below" or "figure above")
Make sure all figures get captions
Specify image widths of pngs and jpegs in terms of linewidth percent (e.g. out.width="70%"), for plots we create in R use fig.width and fig.height.
Center align all images via fig.align = "center"

Tables

Make sure all tables get capitalized labels ("Table \@ref(blah)", not "table below" or "table above")
Make sure all tables get captions
Make sure the row + column spacing is reasonable
Do not put links in table captions, it breaks pdf rendering
Do not put underscores in table captions, it breaks pdf rendering

Note boxes

Note boxes should be typeset as quote boxes using > and start with Note:

Bibliography

Do not put "et al" or "and others"; always use the full list of authors, BibTeX will choose how to abbreviate
Read https://trevorcampbell.me/html/bibtex.html and make sure our bib follows this convention

Naming conventions

K-means (not $K$-*, K means, Kmeans)
K-nearest neighbors (not $K$-*, K nearest neighbors, K nearest neighbor, use US spelling neighbor not neighbour). Note that "K-nearest neighbor" is not the singular form; "K-nearest neighbors" is
K-NN (not $K$-*, KNN, K NN, $K$NN, K-nn)
Local repository (not local computer)
Package (not library, meta package, meta-package)
data science (not Data Science)
dataframe (not data frame)
dataset (not data set)
scatterplot (not scatter plot)
bar plot (not bar chart)
Capitalize all initialisms and acronyms (URL not url, API not api, K-NN not k-nn)
Response variable (not target, output, label)
Predictor variable (not explanatory, feature)
Numerical variable (not quantitative variable)
Categorical variable (not class variable)

Punctuation

emdashes should have no surrounding spaces. This kind of typesetting—which is awesome—is correct! and Typesetting with spaces around em-dashes — which is bad — is not correct
make sure \index commands don't break punctuation spacing. E.g. This is an item \index{item}; it is good will typeset with an erroneous space after item, i.e. This is an item ; it is good

Whitespace

We need a line of whitespace before and after code fences (code surrounded by three backticks above and below). This is for readability, and it is essential for figure captions.

PDF Output

These are absolute last steps when rendering the PDF output:

Look for and fix bad line breaks (e.g. with only one word on the next line, orphans, and widows)
Look for and fix bad line wraps in code and text
Look for and fix bad figure placement (falling off page, going over the side)
Look for and fix large whitespace sections where LaTeX doesn't want to break the next paragraph (usually \allowdisplaybreaks helps)
Fix incorrect indenting. LaTeX will indent for a new paragraph if there is an extra whitespace line, so these should be deleted if no paragraph break is desired.
Look for ?? in the PDF (broken refs)
Look in the index for near-duplicates, and merge if needed
Look for / fix raw LaTeX code (search for backslash and curly brace in the final PDF)
Make sure the 3D figures (and the text around them that refers to clicking and dragging) are properly modified for the PDF output
Make sure all markdown label-replaced URLs (of the form [blah](url)) will make sense in the hardcopy book version (i.e. nothing like "click this"). Many links appear in the additional resources: make sure the text-replacement of the URL contains enough information for someone to find the resource (without being able to click the link)

HTML Output

Look for broken references
Look for uncentered images

Repository Organization / Important Files

The files index.Rmd and ##-name.Rmd are R-markdown chapter contents to be parsed by Bookdown
_bookdown.yml sets the output directory (docs/) and default chapter name
img/ contains custom images to be used in the text; note this is not all of the images as some are generated by R code when compiling
data/ stores datasets processed during compile
docs/.nojekyll tells github's static site builder not to run Jekyll. This avoids Jekyll deleting the folder docs/_main_files (as it starts with an underscore)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

DoSStoolkit

Style Guide

General

Code blocks

Section headings

Learning objectives

Captions

Equations

Figures

Tables

Note boxes

Bibliography

Naming conventions

Punctuation

Whitespace

PDF Output

HTML Output

Repository Organization / Important Files

About

Uh oh!

Releases

Packages

Contributors 12

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 70 Commits
_bookdown_files		_bookdown_files
data		data
docs		docs
images		images
.gitignore		.gitignore
.nojekyll		.nojekyll
000-hello_world-introduction.Rmd		000-hello_world-introduction.Rmd
001-hello_world-setup.Rmd		001-hello_world-setup.Rmd
002-hello_world-what_is_what.Rmd		002-hello_world-what_is_what.Rmd
003-hello_world-hello_world_1.Rmd		003-hello_world-hello_world_1.Rmd
004-hello_world-hello_world_2.Rmd		004-hello_world-hello_world_2.Rmd
005-hello_world-hello_world_3.Rmd		005-hello_world-hello_world_3.Rmd
006-hello_world-community.Rmd		006-hello_world-community.Rmd
007-operating_in_an-introduction.Rmd		007-operating_in_an-introduction.Rmd
008-operating_in_an-learning_to_learn.Rmd		008-operating_in_an-learning_to_learn.Rmd
009-operating_in_an-using_google_and_so.Rmd		009-operating_in_an-using_google_and_so.Rmd
010-operating_in_an-stack_overflow.Rmd		010-operating_in_an-stack_overflow.Rmd
011-operating_in_an-when_your_code_wont_work.Rmd		011-operating_in_an-when_your_code_wont_work.Rmd
013-operating_in_an-reprex.Rmd		013-operating_in_an-reprex.Rmd
014-operating_in_an-making_the_most.Rmd		014-operating_in_an-making_the_most.Rmd
015-holding_the_chaos_at_bay.Rmd		015-holding_the_chaos_at_bay.Rmd
016-holding_the_chaos_at_bay-rprojects_setwd.Rmd		016-holding_the_chaos_at_bay-rprojects_setwd.Rmd
017-holding_the_chaos_at_bay-folder_setup.Rmd		017-holding_the_chaos_at_bay-folder_setup.Rmd
018-holding_the_chaos_at_bay-comments.Rmd		018-holding_the_chaos_at_bay-comments.Rmd
019-holding_the_chaos_at_bay-install_packages.Rmd		019-holding_the_chaos_at_bay-install_packages.Rmd
020-holding_the_chaos_at_bay-install_github.Rmd		020-holding_the_chaos_at_bay-install_github.Rmd
021-holding_the_chaos_at_bay-library.Rmd		021-holding_the_chaos_at_bay-library.Rmd
022-holding_the_chaos_at_bay-update_packages.Rmd		022-holding_the_chaos_at_bay-update_packages.Rmd
023-holding_the_chaos_at_bay-read_csv.Rmd		023-holding_the_chaos_at_bay-read_csv.Rmd
024-holding_the_chaos_at_bay-read_table.Rmd		024-holding_the_chaos_at_bay-read_table.Rmd
025-hand_me_my_plyrs.Rmd		025-hand_me_my_plyrs.Rmd
026-hand_me_my_plyrs-tidyverse.Rmd		026-hand_me_my_plyrs-tidyverse.Rmd
027-hand_me_my_plyrs-pipe.Rmd		027-hand_me_my_plyrs-pipe.Rmd
028-hand_me_my_plyrs-select.Rmd		028-hand_me_my_plyrs-select.Rmd
029-hand_me_my_plyrs-filter.Rmd		029-hand_me_my_plyrs-filter.Rmd
030-hand_me_my_plyrs-group_by.Rmd		030-hand_me_my_plyrs-group_by.Rmd
031-hand_me_my_plyrs-summarise.Rmd		031-hand_me_my_plyrs-summarise.Rmd
032-hand_me_my_plyrs-arrange.Rmd		032-hand_me_my_plyrs-arrange.Rmd
033-hand_me_my_plyrs-mutate.Rmd		033-hand_me_my_plyrs-mutate.Rmd
034-hand_me_my_plyrs-pivotwider.Rmd		034-hand_me_my_plyrs-pivotwider.Rmd
035-hand_me_my_plyrs-rename.Rmd		035-hand_me_my_plyrs-rename.Rmd
036-hand_me_my_plyrs-count_and_uncount.Rmd		036-hand_me_my_plyrs-count_and_uncount.Rmd
037-hand_me_my_plyrs-slice.Rmd		037-hand_me_my_plyrs-slice.Rmd
038-hand_me_my_plyrs-c_matrix_dataframe_tibble.Rmd		038-hand_me_my_plyrs-c_matrix_dataframe_tibble.Rmd
039-hand_me_my_plyrs-length_nrow_ncol_dim.Rmd		039-hand_me_my_plyrs-length_nrow_ncol_dim.Rmd
041-totally_addicted_to_base.Rmd		041-totally_addicted_to_base.Rmd
042-totally_addicted_to_base-mean_median_sd_lm_summary.Rmd		042-totally_addicted_to_base-mean_median_sd_lm_summary.Rmd
043-totally_addicted_to_base-glm.Rmd		043-totally_addicted_to_base-glm.Rmd
043-totally_addicted_to_base-lme4.Rmd		043-totally_addicted_to_base-lme4.Rmd
044-totally_addicted_to_base-function.Rmd		044-totally_addicted_to_base-function.Rmd
045-totally_addicted_to_base-for_while.Rmd		045-totally_addicted_to_base-for_while.Rmd
046-totally_addicted_to_base-if_ifelse_casewhen.Rmd		046-totally_addicted_to_base-if_ifelse_casewhen.Rmd
047-totally_addicted_to_base-c_seq_seqalong_rep.Rmd		047-totally_addicted_to_base-c_seq_seqalong_rep.Rmd
048-totally_addicted_to_base-hist_plot_boxplot.Rmd		048-totally_addicted_to_base-hist_plot_boxplot.Rmd
049-totally_addicted_to_base-apply_sapply_lapply.Rmd		049-totally_addicted_to_base-apply_sapply_lapply.Rmd
050-totally_addicted_to_base-basename_rm_fileexists_dircreate.Rmd		050-totally_addicted_to_base-basename_rm_fileexists_dircreate.Rmd
051-totally_addicted_to_base-sum_dim_round.Rmd		051-totally_addicted_to_base-sum_dim_round.Rmd
052-totally_addicted_to_base-isna_which_unique.Rmd		052-totally_addicted_to_base-isna_which_unique.Rmd
053-totally_addicted_to_base-rownames_colnames.Rmd		053-totally_addicted_to_base-rownames_colnames.Rmd
054-totally_addicted_to_base-floor_ceiling_round_abs.Rmd		054-totally_addicted_to_base-floor_ceiling_round_abs.Rmd
055-he_was_a_d8er_boi.Rmd		055-he_was_a_d8er_boi.Rmd
056-he_was_a_d8er_boi-head_tail_glimpse_summary.Rmd		056-he_was_a_d8er_boi-head_tail_glimpse_summary.Rmd
057-he_was_a_d8er_boi-paste_glue_stringr.Rmd		057-he_was_a_d8er_boi-paste_glue_stringr.Rmd
058-he_was_a_d8er_boi-names_rbind_cbind.Rmd		058-he_was_a_d8er_boi-names_rbind_cbind.Rmd
059-he_was_a_d8er_boi-leftjoin_antijoin_fulljoin.Rmd		059-he_was_a_d8er_boi-leftjoin_antijoin_fulljoin.Rmd
061-he_was_a_d8er_boi-missing_data.Rmd		061-he_was_a_d8er_boi-missing_data.Rmd
062-he_was_a_d8er_boi-setseed_runif_sample.Rmd		062-he_was_a_d8er_boi-setseed_runif_sample.Rmd
063-he_was_a_d8er_boi-simulating_datasets_for_regression.Rmd		063-he_was_a_d8er_boi-simulating_datasets_for_regression.Rmd
064-he_was_a_d8er_boi-conditional_mutate_and_summarise.Rmd		064-he_was_a_d8er_boi-conditional_mutate_and_summarise.Rmd
065-he_was_a_d8er_boi-tidying_up_datasets.Rmd		065-he_was_a_d8er_boi-tidying_up_datasets.Rmd
066-he_was_a_d8er_boi-pull_pluck_unnest.Rmd		066-he_was_a_d8er_boi-pull_pluck_unnest.Rmd
067-he_was_a_d8er_boi-forcats_and_factors.Rmd		067-he_was_a_d8er_boi-forcats_and_factors.Rmd
068-he_was_a_d8er_boi-more_on_strings.Rmd		068-he_was_a_d8er_boi-more_on_strings.Rmd
069-he_was_a_d8er_boi-dates.Rmd		069-he_was_a_d8er_boi-dates.Rmd
069-he_was_a_d8er_boi-regular_expressions.Rmd		069-he_was_a_d8er_boi-regular_expressions.Rmd
070-he_was_a_d8er_boi-janitor.Rmd		070-he_was_a_d8er_boi-janitor.Rmd
072-to_ggplot_or_not_to_ggplot.Rmd		072-to_ggplot_or_not_to_ggplot.Rmd
073-to_ggplot_or_not_to_ggplot-overview.Rmd		073-to_ggplot_or_not_to_ggplot-overview.Rmd
074-to_ggplot_or_not_to_ggplot-barcharts.knit.md		074-to_ggplot_or_not_to_ggplot-barcharts.knit.md
075-to_ggplot_or_not_to_ggplot-histograms.Rmd		075-to_ggplot_or_not_to_ggplot-histograms.Rmd
076-to_ggplot_or_not_to_ggplot-scatterplots.Rmd		076-to_ggplot_or_not_to_ggplot-scatterplots.Rmd
077-to_ggplot_or_not_to_ggplot-various_options.Rmd		077-to_ggplot_or_not_to_ggplot-various_options.Rmd
078-to_ggplot_or_not_to_ggplot-saving_graphs.Rmd		078-to_ggplot_or_not_to_ggplot-saving_graphs.Rmd
079-to_ggplot_or_not_to_ggplot-gganimate.Rmd		079-to_ggplot_or_not_to_ggplot-gganimate.Rmd
080-to_ggplot_or_not_to_ggplot-some_other_geom.Rmd		080-to_ggplot_or_not_to_ggplot-some_other_geom.Rmd
081-to_ggplot_or_not_to_ggplot-some_other_other_geom.Rmd		081-to_ggplot_or_not_to_ggplot-some_other_other_geom.Rmd
082-r_marky_markdown.Rmd		082-r_marky_markdown.Rmd
083-r_marky_markdown-introduction.Rmd		083-r_marky_markdown-introduction.Rmd
085-r_marky_markdown-top_matter.Rmd		085-r_marky_markdown-top_matter.Rmd
086-r_marky_markdown-tables.Rmd		086-r_marky_markdown-tables.Rmd
088-r_marky_markdown-patchwork.Rmd		088-r_marky_markdown-patchwork.Rmd
089-r_marky_markdown-references.Rmd		089-r_marky_markdown-references.Rmd
090-r_marky_markdown-pdfs.Rmd		090-r_marky_markdown-pdfs.Rmd
091-r_marky_markdown-here.Rmd		091-r_marky_markdown-here.Rmd
092-git_outta_here.Rmd		092-git_outta_here.Rmd
093-git_outta_here-what_is_version_control.Rmd		093-git_outta_here-what_is_version_control.Rmd
094-git_outta_here-pull_push.Rmd		094-git_outta_here-pull_push.Rmd
095-git_outta_here-branches.Rmd		095-git_outta_here-branches.Rmd
096-git_outta_here-conflict.Rmd		096-git_outta_here-conflict.Rmd
097-git_outta_here-git_in_rstudio.Rmd		097-git_outta_here-git_in_rstudio.Rmd

RohanAlexander/doss_toolkit_book

Folders and files

Latest commit

History

Repository files navigation

DoSStoolkit

Style Guide

General

Code blocks

Section headings

Learning objectives

Captions

Equations

Figures

Tables

Note boxes

Bibliography

Naming conventions

Punctuation

Whitespace

PDF Output

HTML Output

Repository Organization / Important Files

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 12

Uh oh!

Languages

Packages