在下面的
DATA
中,我想知道如何table()
变量lang_comb
忽略顺序?
例如,
"english spanish french"
和 "french english spanish"
被视为相同。
有没有办法在 R 中以这种方式对变量
lang_comb
进行制表?
library(tidyverse)
DATA <- read.table(header=T, text="
ID lang1 lang2 lang3
1 spanish english NA
2 english spanish french
3 russian english NA
4 french english spanish
5 english russian NA
6 english french NA")
combs <- DATA %>%
mutate(lang_comb = paste(lang1, lang2, lang3, sep=","))
with(combs, table(lang_comb))
english,french,NA english,russian,NA english,spanish,french french,english,spanish russian,english,NA
1 1 1 1 1
spanish,english,NA
1
DATA |>
rowwise() |>
mutate(lang = paste(sort(c_across(starts_with("lang"))), collapse = ",")) |>
pull(lang) |>
table()