How to take same random sample from dataset every time

Question

How to take same random sample from dataset every time

3.6k views Asked by Connor At 07 June 2015 at 22:10

I have a dataset that consists of nearly 7 million observations and I want to take a random sample of the data to analyze just a subset. I know how to take a random sample of the data:

index <- sample(7009728, 50000)
flights <- flight[index, ]

Is there a way to take a random sample but once created in my dataset, to always give me the same random sample? I'm hoping to do this without having to rely on saving my R project.

Original Q&A

There are 1 answers

**zero323** · Accepted Answer · 2015-06-07T22:15:57+00:00

zero323 On 07 June 2015 at 22:15 BEST ANSWER

Simply use set.seed just before you create index:

> set.seed(1)
> index <- sample(7009728, 50000)
> head(index)
[1] 1861144 2608487 4015546 6366287 1413735 6297463

It sets random number generator seed and ensure consistent results.

TechQA.

How to take same random sample from dataset every time

There are 1 answers

Related Questions in R

Related Questions in RANDOM

Related Questions in RANDOM-SEED

Popular Questions

Popular Tags

Trending Questions