I would like to calculate the readability scores in R-3.3.2(R-Studio 3.4 for Win) using koRpus package for several txt.files and save results to excel or sqllite3 or txt. Now I can only calculate the readability score for one file only and print them to console. I tried to improve the code using loop over directory but it fails to work correctly.
library(koRpus)
library(tm)
#Loop through files
path = "D://Reports"
out.file<-""
file.names <- dir(path, pattern =".txt")
for(i in 1:length(file.names)){
file <- read.table(file.names[i],header=TRUE, sep=";", stringsAsFactors=FALSE)
out.file <- rbind(out.file, file)
}
#Only one file
report <- tokenize(txt =file , format = "file", lang = "en")
#SMOG-Index
results_smog <- SMOG(report)
summary(results_smog)
#Flesch/Kincaid-Index
results_fleshkin <- flesch.kincaid(report)
summary(results_fleshkin)
#FOG-Index
results_fog<- FOG(report)
summary(results_fog)
I ran to this same problem. I was looking through stackoverflow for a solution and saw your post. After some trial and error, I came up with the following code. Worked fine for me. I pulled out all the extra info. To find the index values of the scores i was looking for, i first ran it for one file and pulled the summary of the readability wrapper. It'll give you a table of a bunch of different values. Match the column with the row and you get the specific number to look for. There are lots of different options.
In the path directory, your files should be independent text files.