| stri_stats_general {stringi} | R Documentation |
This function gives general statistics for a character vector,
e.g. obtained by loading a text file with the
readLines or stri_read_lines function,
where each text line' is represented by a separate string.
stri_stats_general(str)
str |
character vector to be aggregated |
Any of the strings must not contain \r or \n characters,
otherwise you will get at error.
Below by 'white space' we mean the Unicode binary property
WHITE_SPACE, see stringi-search-charclass.
Returns an integer vector with the following named elements:
Lines - number of lines (number of
non-missing strings in the vector);
LinesNEmpty - number of lines with at least
one non-WHITE_SPACE character;
Chars - total number of Unicode code points detected;
CharsNWhite - number of Unicode code points
that are not WHITE_SPACEs;
... (Other stuff that may appear in future releases of stringi).
Other stats: stri_stats_latex
s <- c("Lorem ipsum dolor sit amet, consectetur adipisicing elit.",
"nibh augue, suscipit a, scelerisque sed, lacinia in, mi.",
"Cras vel lorem. Etiam pellentesque aliquet tellus.",
"")
stri_stats_general(s)