为了账号安全,请及时绑定邮箱和手机立即绑定

根据第一个值列出索引超出范围python2.7循环

根据第一个值列出索引超出范围python2.7循环

桃花长相依 2022-11-18 16:49:30
美好的一天可能有人请帮助我尝试基于第一个值“tocken”循环遍历 txt 文件,它当前仅在我指定 tockens 时循环,例如 {600001130260} 文件中的多个 tockens 都在第一行 [0 ] id 喜欢它迭代每个标记/行并提取指定的信息。数据文件看起来像这样600001130260|005|||IN|2197|01||20160905210028600001130260|100|005|00|VAT|VAT|VAT @ 14%|2,150.14600001130260|100|013|00|TOT|CTOT|Total Due|86,578.93600001130260|100|014|00|DD|DD|Due Date|2015/09/22|2015/10/15600001130260|200|019|01||YDACON|Daily average consumption 79.325 kWh||28002385859|000|||||LT|||T0IQ04960000000016400000000000000||28002385859|100|005|00|CUR|CUR|Current Charges (Excl. VAT)|304.4828002385859|100|006|00|VAT|VAT|VAT @ 14%|10.6228002385859|100|013|00|TOT|CTOT|Total Due|26,451.7528002385859|100|014|00|DD|DD|Due Date|2015/09/2328002385859|150||23,149.02|1,686.37|1,233.57|382.79|0.00|26,451.75这是我的代码file1 = open(r"C:\Users\isaac.gumbi\Documents\jhb\Full test file.txt", 'r')file2 = ""with file1 as f:    for line in f:        tokens = line.split("|")        keys = {'600001130260','118002191517','CTOT', 'CUR', 'Due Date',                'VAT', '020', '030', '010', '040', 'STOT', '000', '005',                '050', '0100', 'BBF', 'INT','CIN', 'CTOT', 'DD', 'YVLEVY',                 'YRREM'}        if len(tokens) and tokens[0] in keys and tokens[5] == 'CTOT':            Total_due = ' '.join(tokens[7:8])            if Total_due == '' : Total_Due = "null"            print ("Total_due", Total_due)这是我目前的输出('Total_due', '86,578.93\n')('Total_due', '79,191.18\n')我希望它在不指定 [0] 中的标记的情况下给我输出 total_due
查看完整描述

1 回答

?
蝴蝶不菲

TA贡献1810条经验 获得超4个赞

你想要的所有值Total Due吗?如果是这样,您只需执行以下操作:


sep = "|"

value_name = "Total Due"

result = []


with open("thefile.txt", 'r') as f:

    for line in f:

        tokens = line.split(sep)

        try:

            ind_total_due = tokens.index(value_name) + 1

            result.append((value_name, tokens[ind_total_due]))

        except ValueError:

            continue


结果将是:


[('Total Due', '86,578.93'),

 ('Total Due', '26,451.75'),

 ('Total Due', '3,483.28'),

 ('Total Due', '983.04'),

 ('Total Due', '- 197,358.33')]

第一个“令牌”似乎是一个唯一标识符。如果您想要 csv 导出和多列支持,您可以这样做:


token_sep = "|"

csv_sep = ";"


# Lambda function that whill format total due

float_formater = lambda string : float(

    string.replace(" ", "").replace(",", "")

)


# Attributes you want to parse

col_names = (

    ("Total Due", float_formater, 1),

    ("Due Date", None, 1),

)


# Python dictionary which associate to each identifier, a total due

# and a due date

records = {}


with open("thefile.txt", 'r') as f:

    for line in f:


        tokens = line.strip().split(token_sep)


        # We assume the first token is an identifier

        unique_id = tokens[0]


        # For each new identifier we create a new record and initialize 

        # total due and due date to an empty string

        if unique_id and unique_id not in records:

            records[unique_id] = {col_name: "" for col_name, _ in col_names}


        # then we look for values we are interesting in. If we find one, we can

        # update one value of the record

        for col_name, formatter, index_val in col_names:

            try:

                ind_col = tokens.index(col_name) + index_val 

                value = tokens[ind_col]


                if formatter:

                    value = formatter(value)


                records[unique_id][col_name] = value



            except ValueError:

                continue


# For an easier csv export we reformat the record dict to a list of values

list_values = [

    (unique_id,) + tuple((values[col] for col, _ in col_names))

    for unique_id, values in records.items()

]


# We can then easily write all the records one by one

with open("mycsv.csv", "w") as f:

    f.write(csv_sep.join(["id"] + [c for c, _ in col_names]))

    for values in list_values:

        print(values)

        f.write("\n")

        f.write(csv_sep.join(map(str, values)))


mycsv.csv:


id;Total Due;Due Date

112002209769;3483.28;2015/09/23

142002121343;-197358.33;

600001130260;86578.93;2015/09/22

28002385859;26451.75;2015/09/23

100002232416;983.04;2015/09/23


查看完整回答
反对 回复 2022-11-18
  • 1 回答
  • 0 关注
  • 68 浏览
慕课专栏
更多

添加回答

举报

0/150
提交
取消
意见反馈 帮助中心 APP下载
官方微信