[Bug]: delete index in graphrag #1637

ajain85 · 2025-01-20T06:51:04Z

Do you need to file an issue?

I have searched the existing issues and this bug is not already filed.
My model is hosted on OpenAI or Azure. If not, please look at the "model providers" issue and don't file a new one here.
I believe this is a legitimate bug, not just a question. If this is a question, please use the Discussions area.

Describe the bug

I am facing index deletion issue in azure ai search and in local , I have deleted the file in blob input folder and After running update command it doesn't delete the indexes in the azure search ai. I am running below command .

cli - python -m graphrag update --config .\cli_graphrag\settings.yaml

Steps to reproduce

No response

Expected Behavior

No response

GraphRAG Config Used

# Paste your config here

Logs and screenshots

No response

Additional Information

GraphRAG Version:
Operating System:
Python Version:
Related Issues:

natoverse · 2025-02-06T00:26:30Z

This is correct - the update command appends new data, but does not remove data. That's a much more complicated task because the summarization process is lossy.

If you retain your cache, subsequent runs can avoid re-invoking the LLM for things like graph extraction. This makes it possible to re-run indexing with different mixes of document content. So you could remove the documents in question and then re-run the regular indexing. Graph extraction should be "free" in that it just uses the cache for all the existing text units. Depending on how the removed documents affect the community structure, community report generation could be the same cost as a usual run, or cheaper if some do not change and therefore also use the cache.

natoverse · 2025-02-06T00:28:54Z

I should mention: when the embeddings are generated during normal indexing, they overwrite the existing vector store index, so your old entries should go away. Because the update command only adds new content, the old entries would still be there.

ajain85 added bug Something isn't working triage Default label assignment, indicates new issue needs reviewed by a maintainer labels Jan 20, 2025

ajain85 changed the title ~~[Bug]: <title>~~ [Bug]: delete index in graphrag Jan 20, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug]: delete index in graphrag #1637

[Bug]: delete index in graphrag #1637

ajain85 commented Jan 20, 2025

natoverse commented Feb 6, 2025

natoverse commented Feb 6, 2025

[Bug]: delete index in graphrag #1637

[Bug]: delete index in graphrag #1637

Comments

ajain85 commented Jan 20, 2025

Do you need to file an issue?

Describe the bug

Steps to reproduce

Expected Behavior

GraphRAG Config Used

Logs and screenshots

Additional Information

natoverse commented Feb 6, 2025

natoverse commented Feb 6, 2025