我有两个文本文件,file1 和 file2.
I have two text files, file1 and file2.
File1 包含一堆随机单词,而 file2 包含我想在出现时从 file1 中删除的单词.有没有办法做到这一点?
File1 contains a bunch of random words, and file2 contains words that I want to remove from file1 when they occur.
Is there a way of doing this?
我知道我可能应该在脚本中加入我自己的尝试,至少是为了显示努力,但老实说,这很可笑,不会有任何帮助.
I know I probably should include my own attempt at a script, to at least show effort, but to be honest it's laughable and wouldn't be of any help.
如果有人至少可以提供关于从哪里开始的提示,将不胜感激.
If someone could at least give a tip about where to start, it would be greatly appreciated.
获取每个单词:
f1 = open("/path/to/file1", "r")
f2 = open("/path/to/file2", "r")
file1_raw = f1.read()
file2_raw = f2.read()
file1_words = file1_raw.split()
file2_words = file2_raw.split()
如果您想要 file1 中不在 file2 中的唯一单词:
if you want unique words from file1 that aren't in file2:
result = set(file1_words).difference(set(file2_words))
如果您关心从 file1 的文本中删除单词
if you care about removing the words from the text of file1
for w in file2_words:
file1_raw = file1_raw.replace(w, "")
这篇关于比较两个文本文件并删除python中的重复项的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持html5模板网!
如何在python中的感兴趣区域周围绘制一个矩形How to draw a rectangle around a region of interest in python(如何在python中的感兴趣区域周围绘制一个矩形)
如何使用 OpenCV 检测和跟踪人员?How can I detect and track people using OpenCV?(如何使用 OpenCV 检测和跟踪人员?)
如何在图像的多个矩形边界框中应用阈值?How to apply threshold within multiple rectangular bounding boxes in an image?(如何在图像的多个矩形边界框中应用阈值?)
如何下载 Coco Dataset 的特定部分?How can I download a specific part of Coco Dataset?(如何下载 Coco Dataset 的特定部分?)
根据文本方向检测图像方向角度Detect image orientation angle based on text direction(根据文本方向检测图像方向角度)
使用 Opencv 检测图像中矩形的中心和角度Detect centre and angle of rectangles in an image using Opencv(使用 Opencv 检测图像中矩形的中心和角度)