library(tidyverse)
library(gt)
26 Tables
But not a table. A table with features.
Sometimes, the best way to show your data is with a table – simple rows and columns. It allows a reader to compare whatever they want to compare a little easier than a graph where you’ve chosen what to highlight. The folks that made R Studio and the tidyverse have a neat package called gt
.
For this assignment, we’ll need gt
so go over to the console and run:
install.packages("gt")
So what does all of these libraries do? Let’s gather a few and use data of every men’s basketball game between 2015-2025.
For this walkthrough:
Load libraries.
And the data.
<- read_csv("data/logs2425.csv") logs
Rows: 23899 Columns: 60
── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (10): Season, GameType, TeamFullName, Opponent, HomeAway, W_L, OT, URL,...
dbl (49): Game, TeamScore, OpponentScore, TeamFG, TeamFGA, TeamFGPCT, Team3...
date (1): Date
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.
Let’s ask this question: which college basketball team saw the greatest decrease in three point attempts per game between last season as a percentage of shots? The simplest way to calculate that is by percent change.
We’ve got a little work to do, putting together ideas we’ve used before. What we need to end up with is some data that looks like this:
Team | 2023-2024 season threes | 2024-2025 season threes | pct change
To get that, we’ll need to do some filtering to get the right seasons, some grouping and summarizing to get the right number, some pivoting to get it organized correctly so we can mutate the percent change.
<- logs |>
threechange filter(Season == "2023-2024" | Season == "2024-2025") |>
group_by(Team, Season) |>
summarise(Total3PA = sum(Team3PA)) |>
pivot_wider(names_from=Season, values_from = Total3PA) |>
filter(!is.na(`2024-2025`)) |>
mutate(PercentChange = (`2024-2025`-`2023-2024`)/`2023-2024`) |>
arrange(PercentChange) |>
ungroup() |>
slice_head(n=10) # just want a top 10 list, but can't use top_n!
`summarise()` has grouped output by 'Team'. You can override using the
`.groups` argument.
We’ve output tables to the screen a thousand times in this class with head
, but gt
makes them look decent with very little code.
|> gt() threechange
Team | 2023-2024 | 2024-2025 | PercentChange |
---|---|---|---|
Temple | 931 | 597 | -0.3587540 |
Tarleton State | 623 | 408 | -0.3451043 |
Morehead State | 912 | 614 | -0.3267544 |
Florida International | 840 | 585 | -0.3035714 |
Milwaukee | 861 | 613 | -0.2880372 |
Boston College | 826 | 597 | -0.2772397 |
College of Charleston | 1072 | 782 | -0.2705224 |
Fairfield | 943 | 729 | -0.2269353 |
Arkansas-Pine Bluff | 898 | 696 | -0.2249443 |
Georgia | 883 | 692 | -0.2163080 |
So there you have it. Temple changed its team so much they took 35 percent fewer threes in 2024-25 from the season before. Where did Maryland come out? We ranked pretty low in college basketball in terms of fewer threes from the season before, because the Terps actually took 12% more.
gt
has a mountain of customization options. The good news is that it works in a very familiar pattern. We’ll start with fixing headers. What we have isn’t bad, but PercentChange isn’t good either. Let’s fix that.
|>
threechange gt() |>
cols_label(
PercentChange = "Percent Change"
)
Team | 2023-2024 | 2024-2025 | Percent Change |
---|---|---|---|
Temple | 931 | 597 | -0.3587540 |
Tarleton State | 623 | 408 | -0.3451043 |
Morehead State | 912 | 614 | -0.3267544 |
Florida International | 840 | 585 | -0.3035714 |
Milwaukee | 861 | 613 | -0.2880372 |
Boston College | 826 | 597 | -0.2772397 |
College of Charleston | 1072 | 782 | -0.2705224 |
Fairfield | 943 | 729 | -0.2269353 |
Arkansas-Pine Bluff | 898 | 696 | -0.2249443 |
Georgia | 883 | 692 | -0.2163080 |
Better. Note the pattern: Actual header name = “What we want to see”. So if we wanted to change Team to School, we’d do this: Team = "School"
inside the cols_label
bits.
Now we can start working with styling. The truth is most of your code in tables is going to be dedicated to styling specific things. The first thing we need: A headline and some chatter. They’re required parts of a graphic, so they’re a good place to start. We do that with tab_header
|>
threechange gt() |>
cols_label(
PercentChange = "Percent Change"
|>
) tab_header(
title = "Did Maryland Shoot Fewer Threes in 2024-25?",
subtitle = "No, the Terps shot more. But these 10 teams completely changed their offenses."
)
Did Maryland Shoot Fewer Threes in 2024-25? | |||
---|---|---|---|
No, the Terps shot more. But these 10 teams completely changed their offenses. | |||
Team | 2023-2024 | 2024-2025 | Percent Change |
Temple | 931 | 597 | -0.3587540 |
Tarleton State | 623 | 408 | -0.3451043 |
Morehead State | 912 | 614 | -0.3267544 |
Florida International | 840 | 585 | -0.3035714 |
Milwaukee | 861 | 613 | -0.2880372 |
Boston College | 826 | 597 | -0.2772397 |
College of Charleston | 1072 | 782 | -0.2705224 |
Fairfield | 943 | 729 | -0.2269353 |
Arkansas-Pine Bluff | 898 | 696 | -0.2249443 |
Georgia | 883 | 692 | -0.2163080 |
We have a headline and some chatter, but … gross. Centered? The extra lines? No real difference in font weight? We can do better. We can style individual elements using tab_style
. First, let’s make the main headline – the title
– bold and left aligned.
|>
threechange gt() |>
cols_label(
PercentChange = "Percent Change"
|>
) tab_header(
title = "Did Maryland Shoot Fewer Threes in 2024-25?",
subtitle = "No, the Terps shot more. But these 10 teams completely changed their offenses."
|> tab_style(
) style = cell_text(color = "black", weight = "bold", align = "left"),
locations = cells_title("title")
)
Did Maryland Shoot Fewer Threes in 2024-25? | |||
---|---|---|---|
No, the Terps shot more. But these 10 teams completely changed their offenses. | |||
Team | 2023-2024 | 2024-2025 | Percent Change |
Temple | 931 | 597 | -0.3587540 |
Tarleton State | 623 | 408 | -0.3451043 |
Morehead State | 912 | 614 | -0.3267544 |
Florida International | 840 | 585 | -0.3035714 |
Milwaukee | 861 | 613 | -0.2880372 |
Boston College | 826 | 597 | -0.2772397 |
College of Charleston | 1072 | 782 | -0.2705224 |
Fairfield | 943 | 729 | -0.2269353 |
Arkansas-Pine Bluff | 898 | 696 | -0.2249443 |
Georgia | 883 | 692 | -0.2163080 |
It’s hard to see here, but the chatter below is also centered (it doesn’t look like it because it fills the space). We can left align that too, but leave it normal weight (i.e. not bold).
|>
threechange gt() |>
cols_label(
PercentChange = "Percent Change"
|>
) tab_header(
title = "Did Maryland Shoot Fewer Threes in 2024-25?",
subtitle = "No, the Terps shot more. But these 10 teams completely changed their offenses."
|> tab_style(
) style = cell_text(color = "black", weight = "bold", align = "left"),
locations = cells_title("title")
|> tab_style(
) style = cell_text(color = "black", align = "left"),
locations = cells_title("subtitle")
)
Did Maryland Shoot Fewer Threes in 2024-25? | |||
---|---|---|---|
No, the Terps shot more. But these 10 teams completely changed their offenses. | |||
Team | 2023-2024 | 2024-2025 | Percent Change |
Temple | 931 | 597 | -0.3587540 |
Tarleton State | 623 | 408 | -0.3451043 |
Morehead State | 912 | 614 | -0.3267544 |
Florida International | 840 | 585 | -0.3035714 |
Milwaukee | 861 | 613 | -0.2880372 |
Boston College | 826 | 597 | -0.2772397 |
College of Charleston | 1072 | 782 | -0.2705224 |
Fairfield | 943 | 729 | -0.2269353 |
Arkansas-Pine Bluff | 898 | 696 | -0.2249443 |
Georgia | 883 | 692 | -0.2163080 |
The next item on the required elements list: Source and credit lines. In gt
, those are called tab_source_notes
and we can add them like this:
|>
threechange gt() |>
cols_label(
PercentChange = "Percent Change"
|>
) tab_header(
title = "Did Maryland Shoot Fewer Threes in 2024-25?",
subtitle = "No, the Terps shot more. But these 10 teams completely changed their offenses."
|> tab_style(
) style = cell_text(color = "black", weight = "bold", align = "left"),
locations = cells_title("title")
|> tab_style(
) style = cell_text(color = "black", align = "left"),
locations = cells_title("subtitle")
|>
) tab_source_note(
source_note = md("**By:** Derek Willis | **Source:** [Sports Reference](https://www.sports-reference.com/cbb/seasons/)")
)
Did Maryland Shoot Fewer Threes in 2024-25? | |||
---|---|---|---|
No, the Terps shot more. But these 10 teams completely changed their offenses. | |||
Team | 2023-2024 | 2024-2025 | Percent Change |
Temple | 931 | 597 | -0.3587540 |
Tarleton State | 623 | 408 | -0.3451043 |
Morehead State | 912 | 614 | -0.3267544 |
Florida International | 840 | 585 | -0.3035714 |
Milwaukee | 861 | 613 | -0.2880372 |
Boston College | 826 | 597 | -0.2772397 |
College of Charleston | 1072 | 782 | -0.2705224 |
Fairfield | 943 | 729 | -0.2269353 |
Arkansas-Pine Bluff | 898 | 696 | -0.2249443 |
Georgia | 883 | 692 | -0.2163080 |
By: Derek Willis | Source: Sports Reference |
We can do a lot with tab_style
. For instance, we can make the headers bold and reduce the size a bit to reduce font congestion in the area.
|>
threechange gt() |>
cols_label(
PercentChange = "Percent Change"
|>
) tab_header(
title = "Did Maryland Shoot Fewer Threes in 2024-25?",
subtitle = "No, the Terps shot more. But these 10 teams completely changed their offenses."
|>
) tab_style(
style = cell_text(color = "black", weight = "bold", align = "left"),
locations = cells_title("title")
|>
) tab_style(
style = cell_text(color = "black", align = "left"),
locations = cells_title("subtitle")
|>
) tab_source_note(
source_note = md("**By:** Derek Willis | **Source:** [Sports Reference](https://www.sports-reference.com/cbb/seasons/)")
|>
) tab_style(
locations = cells_column_labels(columns = everything()),
style = list(
cell_borders(sides = "bottom", weight = px(3)),
cell_text(weight = "bold", size=12)
) )
Did Maryland Shoot Fewer Threes in 2024-25? | |||
---|---|---|---|
No, the Terps shot more. But these 10 teams completely changed their offenses. | |||
Team | 2023-2024 | 2024-2025 | Percent Change |
Temple | 931 | 597 | -0.3587540 |
Tarleton State | 623 | 408 | -0.3451043 |
Morehead State | 912 | 614 | -0.3267544 |
Florida International | 840 | 585 | -0.3035714 |
Milwaukee | 861 | 613 | -0.2880372 |
Boston College | 826 | 597 | -0.2772397 |
College of Charleston | 1072 | 782 | -0.2705224 |
Fairfield | 943 | 729 | -0.2269353 |
Arkansas-Pine Bluff | 898 | 696 | -0.2249443 |
Georgia | 883 | 692 | -0.2163080 |
By: Derek Willis | Source: Sports Reference |
Next up: There’s a lot of lines in this that don’t need to be there. gt
has some tools to get rid of them easily and add in some other readability improvements.
|>
threechange gt() |>
cols_label(
PercentChange = "Percent Change"
|>
) tab_header(
title = "Did Maryland Shoot Fewer Threes in 2024-25?",
subtitle = "No, the Terps shot more. But these 10 teams completely changed their offenses."
|>
) tab_source_note(
source_note = md("**By:** Derek Willis | **Source:** [Sports Reference](https://www.sports-reference.com/cbb/seasons/)")
|>
) tab_style(
style = cell_text(color = "black", weight = "bold", align = "left"),
locations = cells_title("title")
|>
) tab_style(
style = cell_text(color = "black", align = "left"),
locations = cells_title("subtitle")
|>
) tab_style(
locations = cells_column_labels(columns = everything()),
style = list(
cell_borders(sides = "bottom", weight = px(3)),
cell_text(weight = "bold", size=12)
)|>
) opt_row_striping() |>
opt_table_lines("none")
Did Maryland Shoot Fewer Threes in 2024-25? | |||
---|---|---|---|
No, the Terps shot more. But these 10 teams completely changed their offenses. | |||
Team | 2023-2024 | 2024-2025 | Percent Change |
Temple | 931 | 597 | -0.3587540 |
Tarleton State | 623 | 408 | -0.3451043 |
Morehead State | 912 | 614 | -0.3267544 |
Florida International | 840 | 585 | -0.3035714 |
Milwaukee | 861 | 613 | -0.2880372 |
Boston College | 826 | 597 | -0.2772397 |
College of Charleston | 1072 | 782 | -0.2705224 |
Fairfield | 943 | 729 | -0.2269353 |
Arkansas-Pine Bluff | 898 | 696 | -0.2249443 |
Georgia | 883 | 692 | -0.2163080 |
By: Derek Willis | Source: Sports Reference |
We’re in pretty good shape here, but look closer. What else makes this table sub-par? How about the formatting of the percent change? We can fix that with a formatter.
|>
threechange gt() |>
cols_label(
PercentChange = "Percent Change"
|>
) tab_header(
title = "Did Maryland Shoot Fewer Threes in 2024-25?",
subtitle = "No, the Terps shot more. But these 10 teams completely changed their offenses."
|>
) tab_source_note(
source_note = md("**By:** Derek Willis | **Source:** [Sports Reference](https://www.sports-reference.com/cbb/seasons/)")
|>
) tab_style(
style = cell_text(color = "black", weight = "bold", align = "left"),
locations = cells_title("title")
|>
) tab_style(
style = cell_text(color = "black", align = "left"),
locations = cells_title("subtitle")
|>
) tab_style(
locations = cells_column_labels(columns = everything()),
style = list(
cell_borders(sides = "bottom", weight = px(3)),
cell_text(weight = "bold", size=12)
)|>
) opt_row_striping() |>
opt_table_lines("none") |>
fmt_percent(
columns = c(PercentChange),
decimals = 1
)
Did Maryland Shoot Fewer Threes in 2024-25? | |||
---|---|---|---|
No, the Terps shot more. But these 10 teams completely changed their offenses. | |||
Team | 2023-2024 | 2024-2025 | Percent Change |
Temple | 931 | 597 | −35.9% |
Tarleton State | 623 | 408 | −34.5% |
Morehead State | 912 | 614 | −32.7% |
Florida International | 840 | 585 | −30.4% |
Milwaukee | 861 | 613 | −28.8% |
Boston College | 826 | 597 | −27.7% |
College of Charleston | 1072 | 782 | −27.1% |
Fairfield | 943 | 729 | −22.7% |
Arkansas-Pine Bluff | 898 | 696 | −22.5% |
Georgia | 883 | 692 | −21.6% |
By: Derek Willis | Source: Sports Reference |
Throughout the semester, we’ve been using color and other signals to highlight things. Let’s pretend we’re doing a project on Boston College. With a little tab_style
magic, we can change individual rows and add color. The last tab_style
block here will first pass off the styles we want to use – we’re going to make the rows maroon and the text gold – and then for locations we specify where with a simple filter. What that means is that any rows we can address with logic – all rows with a value greater than X, for example – we can change the styling.
|>
threechange gt() |>
cols_label(
PercentChange = "Percent Change"
|>
) tab_header(
title = "Did Maryland Shoot Fewer Threes in 2024-25?",
subtitle = "No, the Terps shot more. But these 10 teams completely changed their offenses."
|>
) tab_source_note(
source_note = md("**By:** Derek Willis | **Source:** [Sports Reference](https://www.sports-reference.com/cbb/seasons/)")
|>
) tab_style(
style = cell_text(color = "black", weight = "bold", align = "left"),
locations = cells_title("title")
|>
) tab_style(
style = cell_text(color = "black", align = "left"),
locations = cells_title("subtitle")
|>
) tab_style(
locations = cells_column_labels(columns = everything()),
style = list(
cell_borders(sides = "bottom", weight = px(3)),
cell_text(weight = "bold", size=12)
)|>
) opt_row_striping() |>
opt_table_lines("none") |>
fmt_percent(
columns = c(PercentChange),
decimals = 1
|>
) tab_style(
style = list(
cell_fill(color = "maroon"),
cell_text(color = "gold")
),locations = cells_body(
rows = Team == "Boston College")
)
Did Maryland Shoot Fewer Threes in 2024-25? | |||
---|---|---|---|
No, the Terps shot more. But these 10 teams completely changed their offenses. | |||
Team | 2023-2024 | 2024-2025 | Percent Change |
Temple | 931 | 597 | −35.9% |
Tarleton State | 623 | 408 | −34.5% |
Morehead State | 912 | 614 | −32.7% |
Florida International | 840 | 585 | −30.4% |
Milwaukee | 861 | 613 | −28.8% |
Boston College | 826 | 597 | −27.7% |
College of Charleston | 1072 | 782 | −27.1% |
Fairfield | 943 | 729 | −22.7% |
Arkansas-Pine Bluff | 898 | 696 | −22.5% |
Georgia | 883 | 692 | −21.6% |
By: Derek Willis | Source: Sports Reference |
Two things here:
- Dear God that color scheme is awful, which is fitting for a school that has a non-rhyming mascot.
- We’ve arrived where we want to be: We’ve created a clear table that allows a reader to compare schools at will while also using color to draw attention to the thing we want to draw attention to. We’ve kept it simple so the color has impact.