Is there a way to add number of observations to plot label using ggpmisc::stat_poly_eq

766 views Asked by At
x <- 1:100
y <- (x + x^2 + x^3) + rnorm(length(x), mean = 0, sd = mean(x^3) / 4)
my.data <- data.frame(x = x, y = y,
                      group = c("A", "B"),
                      y2 = y * c(0.5,2),
                      w = sqrt(x))

formula <- y ~ poly(x, 3, raw = TRUE)

ggplot(my.data, aes(x, y)) +
  geom_point() +
  geom_smooth(method = "lm", formula = formula) +
  stat_poly_eq(formula = formula, parse = TRUE)

like so:

enter image description here

3

There are 3 answers

3
kwes On BEST ANSWER
ggplot(my.data, aes(x, y)) +
  geom_point() +
  geom_smooth(method = "lm", formula = formula) + 
  ggpmisc::stat_poly_eq(aes(label = paste(stat(rr.label), paste("N ~`=`~", nrow(my.data)), sep = "*\", \"*")), formula = formula, parse=T)

You can include additional text with aes. Since the string is parsed you have to escape the equals sign with ~`=`~.

Edit: With faceting

You can create an additional unused mapping of group row counts to be used as a variable in the paste statement in lieu of nrow(my.data).

ggplot(my.data %>% group_by(group) %>% mutate(n = n()), aes(x, y, n = n)) +
  geom_point() +
  geom_smooth(method = "lm", formula = formula) +
  facet_grid(vars(group)) + 
  ggpmisc::stat_poly_eq(aes(label = paste(stat(rr.label), paste("N ~`=`~", n), sep = "*\", \"*")), 
                        formula = formula, parse=T)
0
glensbo On

I found this approach:

library(gginnards)
ggplot(my.data, aes(x, y)) +
+     geom_point() +
+     geom_smooth(method = "lm", formula = formula) +
+     stat_poly_eq(formula = formula, geom = "debug",
+                  summary.fun = colnames)

Which gave me this list of:

Input 'data' to 'draw_panel()':
 [1] "npcx"          "npcy"          "label"         "eq.label"     
 [5] "rr.label"      "adj.rr.label"  "AIC.label"     "BIC.label"    
 [9] "f.value.label" "p.value.label" "n.label"       "grp.label"    
[13] "r.squared"     "adj.r.squared" "p.value"       "n"            
[17] "x"             "y"             "PANEL"         "group" 

Combined with information from this site: https://docs.r4photobiology.info/ggpmisc/reference/stat_poly_eq.html

you should be able to solve most of your questions.

0
Pedro J. Aphalo On

Starting from 'ggpmisc' (0.5.0) the assembly of the labels is made easier with the new helper function use_label().

library(ggpmisc)
#> Loading required package: ggpp
#> Loading required package: ggplot2
#> 
#> Attaching package: 'ggpp'
#> The following object is masked from 'package:ggplot2':
#> 
#>     annotate

x <- 1:100
y <- (x + x^2 + x^3) + rnorm(length(x), mean = 0, sd = mean(x^3) / 4)
my.data <- data.frame(x = x, y = y,
                      group = c("A", "B"),
                      y2 = y * c(0.5,2),
                      w = sqrt(x))

formula <- y ~ poly(x, 3, raw = TRUE)

ggplot(my.data, aes(x, y)) +
  geom_point() +
  geom_smooth(method = "lm", formula = formula) +
  stat_poly_eq(formula = formula, parse = TRUE,
               mapping = use_label(c("R2", "n")))

Created on 2022-09-06 with reprex v2.0.2