python - python在直方图中标记新数据点
问题描述
我目前正在使用此代码绘制直方图。
import matplotlib.pyplot as plt
import numpy as np
from matplotlib.ticker import FormatStrFormatter
data = np.random.randn(82)
fig, ax = plt.subplots()
counts, bins, patches = ax.hist(data, facecolor='yellow', edgecolor='gray')
# Set the ticks to be at the edges of the bins.
ax.set_xticks(bins)
# Set the xaxis's tick labels to be formatted with 1 decimal place...
ax.xaxis.set_major_formatter(FormatStrFormatter('%0.1f'))
# Change the colors of bars at the edges...
twentyfifth, seventyfifth = np.percentile(data, [25, 75])
for patch, rightside, leftside in zip(patches, bins[1:], bins[:-1]):
if rightside < twentyfifth:
patch.set_facecolor('green')
elif leftside > seventyfifth:
patch.set_facecolor('red')
# Label the raw counts and the percentages below the x-axis...
bin_centers = 0.5 * np.diff(bins) + bins[:-1]
for count, x in zip(counts, bin_centers):
# Label the raw counts
ax.annotate(str(count), xy=(x, 0), xycoords=('data', 'axes fraction'),
xytext=(0, -18), textcoords='offset points', va='top', ha='center')
# Label the percentages
percent = '%0.0f%%' % (100 * float(count) / counts.sum())
ax.annotate(percent, xy=(x, 0), xycoords=('data', 'axes fraction'),
xytext=(0, -32), textcoords='offset points', va='top', ha='center')
# Give ourselves some more room at the bottom of the plot
plt.subplots_adjust(bottom=0.15)
plt.show()
我想在直方图上添加给定直方图 x 轴值的 x 标记(标有“橙色”、“苹果”、“菠萝”),如下所示:
我该怎么做?
x 标记没有y 值。
解决方案
除了这些行之外,所有内容都保持不变:
...
# Change the colors of bars at the edges...
left = []
right = []
twentyfifth, seventyfifth = np.percentile(data, [25, 75])
for patch, rightside, leftside in zip(patches, bins[1:], bins[:-1]):
if rightside < twentyfifth:
patch.set_facecolor('green')
left.append(leftside)
elif leftside > seventyfifth:
patch.set_facecolor('red')
right.append(rightside)
ax.text(left[int(len(left)/2)], 1, 'orange\n x')
ax.text(right[0], 1, 'pineapple\n x')
ax.text((left[int(len(left)/2)] + right[0]) / 2, 1, 'apple\n x')
# Label the raw counts and the percentages below the x-axis...
bin_centers = 0.5 * np.diff(bins) + bins[:-1]
...
输出:
- -编辑 - -
OP 添加了数据并要求进行编辑。
问题中来自 OP 的代码保持原样,之后将添加以下行。
data = {'product_name': ['laptop', 'printer', 'tablet', 'desk', 'chair'],'price': [2, 0.1, 2.4, 2.2, 1]}
ax.scatter(data['price'], [1]*len(data['price']), zorder=2, marker='x', c='k')
for i in range(len(data['price'])):
ax.text(data['price'][i]-0.2, 1.5, f"{data['product_name'][i]}")
输出:
注释是重叠的,但这是预期的,因为 OP 给出了 x 值(彼此非常接近)。
推荐阅读
- r - 使用 R 获取 GeoJSON 中多边形的邻接矩阵
- python - 用 mpf.plot mplfinance 打印两个点
- django - 如何允许 http django heroku
- c# - 如何使用 c# 获取或设置自定义 appsetting.json
- javascript - 在 pdf-lib 中设置 Pdf 图像注释的链接
- r - R Shiny:从动态创建的存储桶列表中打印输出
- java - 从 XSD 生成类删除命名空间引用
- ios - 将数据从 ViewContoller 快速传递到 popOver ViewController
- php - PHP Laravel - 控制器不存在
- python - 长初始化方法的 Python 最佳实践