python - 使用 Pandas GroupBy 和 value_counts 查找最常见的值
问题描述
我正在处理表格中的两列。
+-------------+--------------------------------------------------------------+
| Area Name | Code Description |
+-------------+--------------------------------------------------------------+
| N Hollywood | VIOLATION OF RESTRAINING ORDER |
| N Hollywood | CRIMINAL THREATS - NO WEAPON DISPLAYED |
| N Hollywood | CRIMINAL THREATS - NO WEAPON DISPLAYED |
| N Hollywood | ASSAULT WITH DEADLY WEAPON, AGGRAVATED ASSAULT |
| Southeast | ASSAULT WITH DEADLY WEAPON, AGGRAVATED ASSAULT |
| West Valley | CRIMINAL THREATS - NO WEAPON DISPLAYED |
| West Valley | CRIMINAL THREATS - NO WEAPON DISPLAYED |
| 77th Street | RAPE, FORCIBLE |
| Foothill | CRM AGNST CHLD (13 OR UNDER) (14-15 & SUSP 10 YRS OLDER)0060 |
| N Hollywood | VANDALISM - FELONY ($400 & OVER, ALL CHURCH VANDALISMS) 0114 |
+-------------+--------------------------------------------------------------+
我正在使用 Groupby 和 value_counts 按区域名称查找代码描述。
df.groupby(['Area Name'])['Code Description'].value_counts()
有没有办法只查看每个区域名称的前“n”值?如果我附加.nlargest(3)
到上面的代码,它只返回一个区域名称的结果。
+---------------------------------------------------------------------------------+
| Wilshire SHOPLIFTING-GRAND THEFT ($950.01 & OVER) 7 |
+---------------------------------------------------------------------------------+
解决方案
head
在每组中使用以下结果value_counts
:
df.groupby('Area Name')['Code Description'].apply(lambda x: x.value_counts().head(3))
输出:
Area Name
77th Street RAPE, FORCIBLE 1
Foothill CRM AGNST CHLD (13 OR UNDER) (14-15 & SUSP 10 YRS OLDER)0060 1
N Hollywood CRIMINAL THREATS - NO WEAPON DISPLAYED 2
VIOLATION OF RESTRAINING ORDER 1
ASSAULT WITH DEADLY WEAPON, AGGRAVATED ASSAULT 1
Southeast ASSAULT WITH DEADLY WEAPON, AGGRAVATED ASSAULT 1
West Valley CRIMINAL THREATS - NO WEAPON DISPLAYED 2
Name: Code Description, dtype: int64
推荐阅读
- flutter - 如何在 Flutter 中产生这种菜单效果?
- node.js - Jasmine/supertest 在 '.throwError()' 上崩溃
- go - 如何制作具有多个信息的 PUT API (gin gonic)
- python - 我应该强迫开发人员编写元素的 ID 来编写适当的自动化测试吗?
- ios - 通用链接,指定非英文路径
- google-cloud-platform - 监控 BigQuery 性能
- delphi - Cnpack/CnWizard 代码完成在 Delphi 10.4 Sydney 中不起作用
- ios - 由于权利不匹配,XCode“无法安装应用程序”
- android - xml drawable 的 AGPBI 类型错误文本 Android 资源链接失败
- javascript - 如何在每个部分绘制圆角和不同渐变和阴影的圆环图?