将文本文件转换为 JSON 格式的 Python 函数（兼容多行题目，多个选项卡以及无序号选项卡）

在日常工作中，我们经常需要处理各种格式的数据。有时候，我们可能需要将一个文本文件中的内容转换为 JSON 格式的数据。为了方便起见，我写了一个 Python 函数来实现这个功能。

函数功能

这个函数的主要功能是读取一个文本文件，解析其中的题目、选项、答案和解析等内容，并将其转换为 JSON 格式的数据。

实现思路

这个函数的实现思路如下：

首先，打开指定的文本文件，并逐行读取文件内容。
使用一些标志变量来跟踪当前正在处理的题目、选项、答案和解析等内容。
遍历文件的每一行，根据不同的情况进行处理：
- 如果是题目的开始，则提取题目的编号和内容。
- 如果是选项，则将选项内容添加到选项列表中。
- 如果是答案，则提取答案内容。
- 如果是解析，则提取解析内容。
- 如果是空行，则表示当前题目处理完毕，将其转换为 JSON 格式的数据，并添加到结果列表中。
最后，将结果列表转换为 JSON 格式的字符串，并将其写入到输出文件中。

示例代码

 import json
 
def txt_to_json(txt_file_path):
    json_data = []
    with open(txt_file_path, 'r', encoding='utf-8') as file:
        if_analysis = 0
        option_num = 0
        if_ques = 1
        number_ques = 1
        question = ''
        options = []
        analysis = ''
        data = {}
        answer = ''
        for line in file:
            if line.strip().split('、')[0] == str(number_ques):
                if_analysis = 0
                option_num = 0
                if_ques = 1
                if_ans = 0
                options = []
                analysis = ''
                data = {}
                answer = ''
                data['id'] = number_ques
                question = '题目' + str(number_ques) + ':' + line.strip().split('、')[1]
            if len(line) >= 4 and (line[len(line)-5] + line[len(line)-4] + line[len(line)-3] + line[len(line)-2] == '(  )' or line[len(line)-5] + line[len(line)-4] + line[len(line)-3] + line[len(line)-2] == '  )。' or (line[len(line)-2] == '：' and line[0] != 'A') or (line[len(line)-2] == '。' and if_ques == 1) or ('(  )' in line)):
                if_ques = 0
                if line.strip().split('、')[0] != str(number_ques):
                    question = question + line.strip().split('\n')[0]
            if if_ques == 1 and line.strip().split('、')[0] != str(number_ques):
                question = question + line.strip().split('\n')[0]
            if line.strip().split('：')[0] == '答案':
                if_ans = 1
                answer = line.strip().split('：')[1]
                if_analysis = 1
            if if_ques == 0 and if_ans == 0 and len(line) >= 4 and (line[len(line)-5] + line[len(line)-4] + line[len(line)-3] + line[len(line)-2] != '(  )' and line[len(line)-5] + line[len(line)-4] + line[len(line)-3] + line[len(line)-2] != '  )。' and (line[len(line)-2] != '：' or line[0] == 'A') and (line[len(line)-2] != '。' or if_ques == 0) and ('(  )' not in line)):
                uppercase_alphabet = ['A', 'B', 'C', 'D', 'E', 'F', 'G', 'H', 'I', 'J', 'K', 'L', 'M', 'N', 'O', 'P',
                                      'Q', 'R', 'S', 'T', 'U', 'V', 'W', 'X', 'Y', 'Z']
                if line[0] in uppercase_alphabet:
                    if line[0] == 'A':
                        str_join = ' '.join(options)
                        str_earn = ''
                        print(str_join)
                        for i in range(0,len(options)):
                            print(type(options[i]))
                            str_earn = str_earn + options[i].strip().split('、')[1]
                            # if char_join[i] not in uppercase_alphabet:
                            #     str_earn = str_earn + char_join[i]
                        question = question + str_earn
                        print(question)
                        options = []
                    print('hjx')
                    line1 = line.strip().split('\n')[0]
                    options.append(line1)
                    option_num = option_num + 1
                else:
                    line1 = line.strip().split('\n')[0]
                    options.append(uppercase_alphabet[option_num]+'、'+line1)
                    option_num = option_num + 1
            if if_analysis == 1 and line.strip().split('：')[0] != '答案':
                analysis = analysis + line.strip().split('\n')[0]
            if line == '\n':
                number_ques = number_ques + 1
                data['question'] = question
                data['type'] = "选择题"
                data['options'] = options
                data['answer'] = answer
                if analysis:
                    data['analysis'] = analysis
                json_data.append(data)
                print(json_data)
 
    json_data = json.dumps(json_data, indent=4, ensure_ascii=False)
 
    with open('output.json', 'w', encoding='utf-8') as output_file:
        # print(json_data)
        # output_file.write("questions = " + json_data)
        output_file.write('"questions":' + json_data)
 
txt_to_json('python基础题库.txt')复制

结论

通过这个函数，我们可以轻松地将一个文本文件中的内容转换为 JSON 格式的数据，这样就可以更方便地进行后续的处理和分析了。

将文本文件转换为 JSON 格式的 Python 函数（兼容多行题目，多个选项卡以及无序号选项卡）

函数功能

实现思路

示例代码

结论

C#解析JSON的常用库--Newtonsoft.Json

【SpringMVC】_SpringMVC项目返回HTML与JSON

python 解读JSON文件，一文搞懂！

由于不同电脑语言具有不同的特性和用途，我会为你提供一个简化版的游戏商城的概念代码，分别使用 Python（用于后端逻辑）和 HTML/JavaScript（用于前端展示）。

Python数据可视化案例——折线图

（开题）flask框架基于HTML5的酒店预订管理系统（程序论文 python）

python爬虫入门（三）之HTML网页结构

cesium 加载本地json、GeoJson数据

JSONPath，一个事半功倍的查找取数工具

Python毕业设计选题：基于django vue的荣誉证书管理系统

前端哥

C#解析JSON的常用库--Newtonsoft.Json

jsonfield 项目常见问题解决方案

【SpringMVC】_SpringMVC项目返回HTML与JSON

BugJson因为json格式问题OOM怎么办

python 解读JSON文件，一文搞懂！

Redisson同时使用jackson、fastjson、kryo、protostuff序列化（含效率对比）

开源项目“Pretty JSON”安装与配置完全指南

2024年前端最新Nodejs基础之包管理工具npm(二)(2)，微软面试题及答案

解决全局安装pnpm后无法使用的问题

安装Nodejs后，npm无法使用

1
【Echarts系列】—— 实现电池图、3D立体圆形柱状图

2024-03-03 11:03:011001

2
CSS常用属性（文本属性）

2024-11-04 09:11:111000

3
TypeScript 中的 Number 类型，Number 类型的特性、常见操作和注意事项

2024-09-30 23:09:061000

4
CSS写代码使页面划分为左右两个区域

2024-09-09 00:09:071000

5
vue使用datav echarts

2024-09-06 00:09:381000

6
使用TweenMax.js和CSS3创建冰球运动员动画效果教程

2024-09-04 23:09:411000

7
使用CDN提高jQuery加载速度

2024-08-24 23:08:211000

8
小兔鲜儿网页首页制作黑马程序员前端基础项目自学笔记

2024-08-19 22:08:161000

9
《Vue》你的弹窗能拖动吗？Vue自定义指令实现可拖动弹窗

2024-08-19 22:08:121000

10
npm的使用

2024-08-18 00:08:131000

	import json

	def txt_to_json(txt_file_path):
	json_data = []
	with open(txt_file_path, 'r', encoding='utf-8') as file:
	if_analysis = 0
	option_num = 0
	if_ques = 1
	number_ques = 1
	question = ''
	options = []
	analysis = ''
	data = {}
	answer = ''
	for line in file:
	if line.strip().split('、')[0] == str(number_ques):
	if_analysis = 0
	option_num = 0
	if_ques = 1
	if_ans = 0
	options = []
	analysis = ''
	data = {}
	answer = ''
	data['id'] = number_ques
	question = '题目' + str(number_ques) + ':' + line.strip().split('、')[1]
	if len(line) >= 4 and (line[len(line)-5] + line[len(line)-4] + line[len(line)-3] + line[len(line)-2] == '( )' or line[len(line)-5] + line[len(line)-4] + line[len(line)-3] + line[len(line)-2] == ' )。' or (line[len(line)-2] == '：' and line[0] != 'A') or (line[len(line)-2] == '。' and if_ques == 1) or ('( )' in line)):
	if_ques = 0
	if line.strip().split('、')[0] != str(number_ques):
	question = question + line.strip().split('\n')[0]
	if if_ques == 1 and line.strip().split('、')[0] != str(number_ques):
	question = question + line.strip().split('\n')[0]
	if line.strip().split('：')[0] == '答案':
	if_ans = 1
	answer = line.strip().split('：')[1]
	if_analysis = 1
	if if_ques == 0 and if_ans == 0 and len(line) >= 4 and (line[len(line)-5] + line[len(line)-4] + line[len(line)-3] + line[len(line)-2] != '( )' and line[len(line)-5] + line[len(line)-4] + line[len(line)-3] + line[len(line)-2] != ' )。' and (line[len(line)-2] != '：' or line[0] == 'A') and (line[len(line)-2] != '。' or if_ques == 0) and ('( )' not in line)):
	uppercase_alphabet = ['A', 'B', 'C', 'D', 'E', 'F', 'G', 'H', 'I', 'J', 'K', 'L', 'M', 'N', 'O', 'P',
	'Q', 'R', 'S', 'T', 'U', 'V', 'W', 'X', 'Y', 'Z']
	if line[0] in uppercase_alphabet:
	if line[0] == 'A':
	str_join = ' '.join(options)
	str_earn = ''
	print(str_join)
	for i in range(0,len(options)):
	print(type(options[i]))
	str_earn = str_earn + options[i].strip().split('、')[1]
	# if char_join[i] not in uppercase_alphabet:
	# str_earn = str_earn + char_join[i]
	question = question + str_earn
	print(question)
	options = []
	print('hjx')
	line1 = line.strip().split('\n')[0]
	options.append(line1)
	option_num = option_num + 1
	else:
	line1 = line.strip().split('\n')[0]
	options.append(uppercase_alphabet[option_num]+'、'+line1)
	option_num = option_num + 1
	if if_analysis == 1 and line.strip().split('：')[0] != '答案':
	analysis = analysis + line.strip().split('\n')[0]
	if line == '\n':
	number_ques = number_ques + 1
	data['question'] = question
	data['type'] = "选择题"
	data['options'] = options
	data['answer'] = answer
	if analysis:
	data['analysis'] = analysis
	json_data.append(data)
	print(json_data)

	json_data = json.dumps(json_data, indent=4, ensure_ascii=False)

	with open('output.json', 'w', encoding='utf-8') as output_file:
	# print(json_data)
	# output_file.write("questions = " + json_data)
	output_file.write('"questions":' + json_data)

	txt_to_json('python基础题库.txt')

将文本文件转换为 JSON 格式的 Python 函数（兼容多行题目，多个选项卡以及无序号选项卡）

函数功能

实现思路

示例代码

结论

微信扫一扫：分享