r - 表中的错误(`202004.citibike.tripdata`):尝试使用 >= 2^31 个元素创建表
问题描述
我正在尝试绘制一个表格,但我不断收到错误尝试使用 2^31 个元素制作表格
性别频率
barplot(table(`202004.citibike.tripdata`), xlab="Gender",ylab="Frequency", main="Frequency Gender")
这是我的代码行 https://s3.amazonaws.com/tripdata/index.html 我的数据集可以从任何这些 csv 文件中使用。我应该在代码行中添加什么?
解决方案
如果您愿意使用该软件包ggplot2
制作条形图,这里有一个使用“2014 - 01 Citi 自行车旅行数据”文件中的前 10 个条目的解决方案。
设置测试数据:
df <-
structure(list(tripduration = c(471, 1494, 464, 373, 660, 330,
261, 337, 429, 1025), starttime = structure(c(1388534406, 1388534438,
1388534639, 1388534715, 1388534718, 1388534755, 1388534764, 1388534801,
1388534853, 1388534907), tzone = "UTC", class = c("POSIXct",
"POSIXt")), stoptime = structure(c(1388534877, 1388535932, 1388535103,
1388535088, 1388535378, 1388535085, 1388535025, 1388535138, 1388535282,
1388535932), tzone = "UTC", class = c("POSIXct", "POSIXt")),
`start station id` = c(2009, 536, 228, 519, 83, 422, 516,
380, 296, 540), `start station name` = c("Catherine St & Monroe St",
"1 Ave & E 30 St", "E 48 St & 3 Ave", "Pershing Square N",
"Atlantic Ave & Fort Greene Pl", "W 59 St & 10 Ave", "E 47 St & 1 Ave",
"W 4 St & 7 Ave S", "Division St & Bowery", "Lexington Ave & E 26 St"
), `start station latitude` = c(40.71117444, 40.74144387,
40.7546011026, 40.75188406, 40.68382604, 40.770513, 40.75206862,
40.73401143, 40.71413089, 40.74147286), `start station longitude` = c(-73.99682619,
-73.97536082, -73.971878855, -73.97770164, -73.97632328,
-73.988038, -73.96784384, -74.00293877, -73.9970468, -73.98320928
), `end station id` = c(263, 259, 2022, 526, 436, 526, 167,
435, 306, 447), `end station name` = c("Elizabeth St & Hester St",
"South St & Whitehall St", "E 59 St & Sutton Pl", "E 33 St & 5 Ave",
"Hancock St & Bedford Ave", "E 33 St & 5 Ave", "E 39 St & 3 Ave",
"W 21 St & 6 Ave", "Cliff St & Fulton St", "8 Ave & W 52 St"
), `end station latitude` = c(40.71729, 40.70122128, 40.75849116,
40.74765947, 40.68216564, 40.74765947, 40.7489006, 40.74173969,
40.70823502, 40.76370739), `end station longitude` = c(-73.996375,
-74.01234218, -73.95920622, -73.98490707, -73.95399026, -73.98490707,
-73.97604882, -73.99415556, -74.00530063, -73.9851615), bikeid = c(16379,
15611, 16613, 15938, 19830, 17343, 17880, 16275, 17318, 15525
), usertype = c("Subscriber", "Subscriber", "Subscriber",
"Subscriber", "Subscriber", "Subscriber", "Subscriber", "Subscriber",
"Subscriber", "Subscriber"), `birth year` = c("1986", "1963",
"1991", "1989", "1990", "1987", "1983", "1963", "1972", "1981"
), gender = c(1, 1, 1, 1, 1, 1, 1, 1, 2, 1)), row.names = c(NA,
-10L), class = c("tbl_df", "tbl", "data.frame"))
加载我们的包并制作按性别计算的行的条形图:
library(tidyverse)
df %>%
ggplot() +
geom_bar(aes(x=gender))
推荐阅读
- c# - SQL 如何替换一个值
- python - 使用 python 和请求,如何进行正确的 POST 调用以及在哪里找到标头?
- android - FirebaseAppDistribution:appDistributionUpload gradle 命令中缺少应用程序 ID
- mysql - 跨列和行计算值的实例
- javascript - 使用 redux 钩子时使用 redux 操作的最佳选择是什么?
- sql - Impala 2.11:AnalysisException:选择列表中不支持子查询
- swift - 在 SwiftUI 中获取当前的经纬度
- c++ - 是否可以禁止一个类被动态转换为?
- git - Git - 将发布分支合并到主分支
- tensorflow - 任何想法如何解决激活函数的问题?