如何替换数据框中某列的多个值？

Question

我有一个包含几列的数据框，这是其中一列的示例：

df <- data.frame(x=1:3)

数字1代表“是”，2代表“否”，3代表“也许”。我想出的一种解决方案是更改变量的类，然后使用：

df$x <- replace(df$x, "1", "Yes")

并重复“否”和“也许”。然而其中一列有 27 个不同的值，代表 27 个不同的单词，这样代码会太大。

关于如何有效地用单词替换数字有什么想法吗？

Answer 1

您可以使用

mapvalues()

中的

plyr

:

library(plyr)
x <- c("a", "b", "c")
mapvalues(x, c("a", "c"), c("A", "C"))

[1] "A" "b" "C"

就你而言，

df <- data.frame(x=1:3)
mapvalues(df$x, c(1,3,2), c("Yes","Maybe","No"))

[1] "Yes"   "No"    "Maybe"

由于

plyr

已停用，您无需使用以下代码调用包即可完成此操作（直接从

body(mapvalues)

复制）。

my_mapvalues <- function(x, from, to, warn_missing = TRUE) {
    if (length(from) != length(to)) {
        stop("`from` and `to` vectors are not the same length.")
    }
    if (!is.atomic(x)) {
        stop("`x` must be an atomic vector.")
    }
    if (is.factor(x)) {
        levels(x) <- mapvalues(levels(x), from, to, warn_missing)
        return(x)
    }
    mapidx <- match(x, from)
    mapidxNA <- is.na(mapidx)
    from_found <- sort(unique(mapidx))
    if (warn_missing && length(from_found) != length(from)) {
        message("The following `from` values were not present in `x`: ", 
            paste(from[!(1:length(from) %in% from_found)], collapse = ", "))
    }
    x[!mapidxNA] <- to[mapidx[!mapidxNA]]
    x
}

Answer 2

也许还有另一种方式：

df <- sample(1:3, 100, replace = T)
a <- c(1,2,3)
b <- c('Yes', 'No', 'Maybe')
df[df %in% a] <- na.omit(b[match(df, a)])

或者像一个因素一样处理：

df <- sample(1:3, 100, replace = T)
df <- as.character(factor(df, labels = c('Yes', 'No', 'Maybe')))

如何替换数据框中某列的多个值？

问题描述投票：0回答：2

2个回答

最新问题

如何替换数据框中某列的多个值？

问题描述 投票：0回答：2

2个回答

最新问题

问题描述投票：0回答：2