我想使用python脚本对文件夹中的一堆文件执行一些统计计算。
到现在为止,我能够选择整个文件夹,对文件进行排序并执行计算。
def numericalSort(value):
parts = numbers.split(value)
parts[1::2] = map(int, parts[1::2])
return parts
for infile in sorted(glob.glob('*.txt'), key=numericalSort):
if infile == "log.txt":
continue
print "Current File Being Processed is: " + infile
log.write("~~File " + infile + " has been Opened Succesfully \n")
with open(infile, "r") as f:
next(f)
#print "file has been opened" + infile
for line in f:
现在我想要做的是让用户输入初始和结束日期,并在选定的时间间隔内执行相同的操作。
文件命名如下
countlog-2018-06-09.txt
有任何想法吗?
使用input
标志写入stdin。您可能还想执行输入验证。以下是一些可以适应您要求的代码。
import re
from datetime import datetime
date_format = '%Y-%m-%d'
date_regex = r'([12]\d{3}-(0[1-9]|1[0-2])-(0[1-9]|[12]\d|3[01]))'
def validate_date_input(date_type='start'):
while True:
start_date = input('Enter a valid {} date in format YYYY-MM-DD\n'.format(date_type))
try:
date = datetime.strptime(start_date, date_format)
except ValueError:
print('"{}" is not in format YYYY-MM-DD. try again\n'.format(start_date))
else:
return date
if __name__ == '__main__':
start_date = validate_date_input(date_type='start')
end_date = validate_date_input(date_type='end')
file_name = 'countlog-2018-06-09.txt'
file_re = re.search(date_regex, file_name)
file_date = datetime.strptime(file_re.group(0), date_format)
if start_date <= file_date <= end_date:
print('Working on {}'.format(file_name))
首先,您可以生成该日期范围内的日期列表,如下所示:
#generate all dates from date1 to date2
import datetime
date1 = '2011-05-03'
date2 = '2011-05-10'
start = datetime.datetime.strptime(date1, '%Y-%m-%d')
end = datetime.datetime.strptime(date2, '%Y-%m-%d')
date_ranges=[]
step = datetime.timedelta(days=1)
while start <= end:
date_ranges.append(start.date())
start += step
然后您可以使用这些日期范围生成文本文件名
text_file_names = ["countlog-"+str(the_date)+".txt" for the_date in date_ranges]
你现在可以阅读文件,你知道这是你需要的:)