GitHub

这个代码（参考链接）进行了简单优化升级:

采用的是tfidf模型，主要思路是：首先计算用户的问题与问题库中的问题的相似度并选出top15的相似问题，然后去问题库对应的答案库中找出这15个问题对应的答案，以此作为回答用户问题的候选答案。
新增BM25模型。

preprocess.py:

file_reader.py:

cut_words.py:

setence_similarity.py：

sentence.py:

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.gitignore		.gitignore
README.md		README.md
bm25.py		bm25.py
config.py		config.py
cut_words.py		cut_words.py
file_reader.py		file_reader.py
preprocess.py		preprocess.py
run_bm25.py		run_bm25.py
run_tfidf.py		run_tfidf.py
sentence.py		sentence.py
setence_similarity.py		setence_similarity.py
utils.py		utils.py

Provide feedback