python - 从目录中删除没有特定单词的文本文件
问题描述
我有一个目录,里面有 ~2200 个文本文件。我需要删除任何不包含我定义的特定单词的文本文件。有人可以看看这段代码并就如何让它工作提出建议吗?现在,当我运行它时,它说它找不到目录“C”。
另外,我想确保该目录中的每个文件都运行它。我需要包含下一个功能吗?
import os
path = r'C:\Users\user\Desktop\AFL codes to test'
words = ['buy', 'sell']
for root, dirs, files in os.walk(path):
for file in path:
if not any(words in file for words in words):
os.remove(file)
此外,这是完整的回溯:
runfile('C:/Users/user/.spyder-py3/DELETE FILES THAT DONT CONTAIN CERTAIN WORDS.py', wdir='C:/Users/user/.spyder-py3')
Traceback (most recent call last):
File "<ipython-input-23-dbc80e182b2b>", line 1, in <module>
runfile('C:/Users/user/.spyder-py3/DELETE FILES THAT DONT CONTAIN CERTAIN WORDS.py', wdir='C:/Users/user/.spyder-py3')
File "C:\Users\user\Anaconda31\lib\site-packages\spyder\utils\site\sitecustomize.py", line 705, in runfile
execfile(filename, namespace)
File "C:\Users\user\Anaconda31\lib\site-packages\spyder\utils\site\sitecustomize.py", line 102, in execfile
exec(compile(f.read(), filename, 'exec'), namespace)
File "C:/Users/user/.spyder-py3/DELETE FILES THAT DONT CONTAIN CERTAIN WORDS.py", line 9, in <module>
os.remove(file)
FileNotFoundError: [WinError 2] The system cannot find the file specified: 'C'
This is the error after trying shutil.rmtree
runfile('C:/Users/user/.spyder-py3/DELETE FILES THAT DONT CONTAIN CERTAIN WORDS.py', wdir='C:/Users/user/.spyder-py3')
Traceback (most recent call last):
File "<ipython-input-16-dbc80e182b2b>", line 1, in <module>
runfile('C:/Users/user/.spyder-py3/DELETE FILES THAT DONT CONTAIN CERTAIN WORDS.py', wdir='C:/Users/user/.spyder-py3')
File "C:\Users\user\Anaconda31\lib\site-packages\spyder\utils\site\sitecustomize.py", line 705, in runfile
execfile(filename, namespace)
File "C:\Users\user\Anaconda31\lib\site-packages\spyder\utils\site\sitecustomize.py", line 102, in execfile
exec(compile(f.read(), filename, 'exec'), namespace)
File "C:/Users/user/.spyder-py3/DELETE FILES THAT DONT CONTAIN CERTAIN WORDS.py", line 12, in <module>
shutil.rmtree(full_path)
File "C:\Users\user\Anaconda31\lib\shutil.py", line 494, in rmtree
return _rmtree_unsafe(path, onerror)
File "C:\Users\user\Anaconda31\lib\shutil.py", line 376, in _rmtree_unsafe
onerror(os.listdir, path, sys.exc_info())
File "C:\Users\user\Anaconda31\lib\shutil.py", line 374, in _rmtree_unsafe
names = os.listdir(path)
NotADirectoryError: [WinError 267] The directory name is invalid: 'C:/Users/user/Desktop/AFL codes to test/newfile1.txt'
解决方案
您应该用常规斜杠替换反斜杠。
path = r'C:\Users\user\Desktop\AFL codes to test'
应该
path = 'C:/Users/user/Desktop/AFL codes to test'
编辑:这是应该让你去的完整代码:
import os
path = 'C:/Users/user/Desktop/AFL codes to test'
words = ['buy', 'sell']
files = os.listdir(path)
for each_file in files:
full_path = "%s/%s" % (path, each_file)
each_file_content = open(full_path, 'r', encoding="utf-8").read()
if not any(word in each_file_content for word in words):
os.unlink(full_path)
推荐阅读
- android - Android Oreo 上的广播接收器
- javascript - 找不到图像/无法获取 /uploads/image nodejs REST API
- linux - 安装审计时出错
- c# - Payfast 通知 url 测试 c#
- javascript - 有什么方法可以读取 openlayers 4 中的 geojson 属性吗?
- java - 从游戏返回菜单
- arduino - 如果 rx 和 tx 线断开连接,如何制作一个执行某些操作的程序?
- google-compute-engine - 关于如何将承诺折扣应用于不同类型的实例
- c# - 如何使用 C# 客户端在 VoltDB 中创建触发器?
- node.js - 如何使用 Firebase-admin 为 NodeJs 设置 timestampsInSnapshots 设置?