merge or mutate a summary (dplyr)

Question

merge or mutate a summary (dplyr)

2.7k views Asked by giac At 08 June 2015 at 14:55

I am always unsure how to retrieve a summary with dplyr.

Let us suppose I have a summary of individuals and households.

dta = rbind(c(1, 1, 45), 
  c(1, 2, 47), 
  c(2, 1, 24),
  c(2, 2, 26), 
  c(3, 1, 67), 
  c(4, 1, 20),
  c(4, 2, 21),
  c(5, 3, 7)
 ) 
dta = as.data.frame(dta)
colnames(dta) = c('householdid', 'id', 'age')

 householdid id age
           1  1  45
           1  2  47
           2  1  24
           2  2  26
           3  1  67
           4  1  20
           4  2  21
           4  3   7

Imagine I want to calculate the number of person in the household and the mean age by households and then re-use this information in the original dataset.

dta %>% 
  group_by(householdid) %>% 
  summarise( nhouse = n(), meanAgeHouse = mean(age) ) %>% 
  merge(., dta, all = T)

I am often using merge, but it is slow sometimes when the dataset is huge.
Is it possible to

mutate

instead of

merge ?

Original Q&A

There are 1 answers

**3pitt** · Answer 1 · 2017-10-17T15:25:50+00:00

3pitt On 17 October 2017 at 15:25

dta %>% group_by(householdid) %>% mutate( nhouse = n(), meanAgeHouse = mean(age) )

TechQA.

merge or mutate a summary (dplyr)

There are 1 answers

Related Questions in R

Related Questions in MERGE

Related Questions in DPLYR

Related Questions in SUMMARY

Related Questions in MUTATED

Popular Questions

Popular Tags

Trending Questions