我有数据:
+-----------+---------+----------+
| AGE_group | mean_y | count_y |
+-----------+---------+----------+
| 1 | 0.141 | 1115 |
| 2 | 0.196 | 1043 |
| 3 | 0.202 | 1093 |
| 4 | 0.114 | 1123 |
+-----------+---------+----------+
我用ggplot:
g_age <- ggplot(AGE_group_df, aes(AGE_group, mean_y, group = 1)) +
geom_line(size=1, color='blue') +
geom_point(aes(size=AGE_group_df$count_y), color='darkblue') +
labs(x = 'Age Group',
y='% Subscribe',
title='Age Group and Subscribe Rate',
size='# Customer')
g_age
问题:第二点的大小(计数为1043)与其他点相比非常小。
问题:如何更改积分的大小? (我希望保持所有4个点的大小几乎相等),同时保持原始计数单位。
非常感谢你。
您可以使用(至少)三种方法
scale_size_area
:scale_size_area()
scale_size
的范围参数:scale_size(range = 4:5)
count_y
:scale_size(limits = c(0, max(AGE_group_df$count_y))
这里更详细:一个选项是使用scale_size_area
:
AGE_group_df <- data.frame(AGE_group = 1:4,
mean_y = c(0.141, 0.196, 0.202, 0.114),
count_y = c(1115, 1043, 1093, 1123))
ggplot(AGE_group_df, aes(x = AGE_group, y = mean_y, size = count_y)) +
geom_line(size=1, color='blue') +
scale_size_area(breaks = round(seq(min(AGE_group_df$count_y),
max(AGE_group_df$count_y), length.out = 4), 0)) +
geom_point(color='darkblue') +
labs(x = 'Age Group',
y='% Subscribe',
title='Age Group and Subscribe Rate',
size='# Customer')
另一个在scale_size
中手动定义范围参数:
ggplot(AGE_group_df, aes(x = AGE_group, y = mean_y, size = count_y)) +
geom_line(size=1, color='blue') +
scale_size(breaks = round(seq(min(AGE_group_df$count_y),
max(AGE_group_df$count_y), length.out = 4), 0), range = 4:5) +
geom_point(color='darkblue') +
labs(x = 'Age Group',
y='% Subscribe',
title='Age Group and Subscribe Rate',
size='# Customer')
ggplot(AGE_group_df, aes(x = AGE_group, y = mean_y, size = count_y)) +
geom_line(size=1, color='blue') +
scale_size(breaks = round(seq(min(AGE_group_df$count_y),
max(AGE_group_df$count_y), length.out = 4), 0),
limits = c(0, max(AGE_group_df$count_y))) +
geom_point(color='darkblue') +
labs(x = 'Age Group',
y='% Subscribe',
title='Age Group and Subscribe Rate',
size='# Customer')
请注意,您可以在aes
函数中定义大小。我添加了断点来显示最小值和最大值,但这不是必需的。这只是一个额外的壮举。