[Question]: parsing file failed #2222

lilu6301 · 2024-09-03T09:42:31Z

Describe your problem

[ERROR]Insert chunk error, detail info please check ragflow-logs/api/cron_logger.log. Please also check ES

mjtechguy · 2024-09-04T00:41:05Z

Same. I am trying to ingest a .csv file using the latest docker-compose and Ollama (nomic-embed-text)

Any ideas?

lilu6301 · 2024-09-05T09:50:03Z

my os is ubuntu 20.04. fixed by updated to 22.04

PuppyMeng · 2024-10-08T01:55:30Z

my os is ubuntu 20.04. fixed by updated to 22.04

I’m having the same problem, too. using ubuntu 24.04.1

cfenglv · 2024-10-10T09:55:08Z

same here. Running with local build docker image on Mac.
Looking into cron_logger.log gives:
{'type': 'document_parsing_exception', 'reason': '[1:90157] failed to parse: The [cosine] similarity does not support vectors with zero magnitude. Preview of invalid vector: [0.0, 0.0, 0.0, 0.0, 0.0, ...]', 'caused_by': {'type': 'illegal_argument_exception', 'reason': 'The [cosine] similarity does not support vectors with zero magnitude. Preview of invalid vector: [0.0, 0.0, 0.0, 0.0, 0.0, ...]'}}"

By the way, looking into database.log gives:
ERROR (21) Can't update token usage for bc2bb8c3862b11ef8466f966ff2a9065/CHAT

I am using local Ollama, chat model local llama3.2:1b, embedding model local Ollama "snowflake-arctic-embed"

aalboori · 2024-10-12T02:53:13Z

I had this error due to low storage space. Please check the status of ES through ragflow user setting-->system. If it is red, please follow the link https://www.elastic.co/guide/en/elasticsearch/reference/8.15/red-yellow-cluster-status.html#fix-red-yellow-cluster-status to diagnose the reason.

EzeLLM · 2024-12-04T05:30:13Z

g the same problem, too. using ubuntu 24.04.1

this solved my problem. the following command in particular, as it fits with my logs:

curl -X PUT "localhost:9200/_cluster/settings?pretty" -H 'Content-Type: application/json' -d'
{
  "persistent" : {
    "cluster.routing.allocation.enable" : null
  }
}
'

WaterDrop-EarthDivision · 2024-12-13T07:41:45Z

I'm happy to share my experience here.

The reason I have this is because of lack of space, strictly because elasticsearch monitors your storage usage and stops sharding when it's greater than 85 percent.

I will describe below how I discovered and solved this problem.

1. Find the username and password for elasticsearch by using `docker logs -f ragflow-server`.

'username': 'elastic', 'password': 'infini_rag_flow'

2. Diagnosis via link shared by @aalboori

2.1 Use `curl -X GET “localhost:1200/_cluster/health?pretty” -u elastic:infini_rag_flow` to view cluster status.

Use curl -X GET “localhost:1200/_cluster/health?pretty” -u elastic:infini_rag_flow to view cluster status. Note that in the original code it was “localhost:7200”. The -u stands for the login and password.

2.2 Use `curl -X GET “localhost:1200/_cat/allocation?v=true&h=node,shards,disk.*&pretty” -u elastic:infini_rag_flow` to view usage.And then I realized that it was using up to 94% of the memory.Then I realized that a whopping 94% of the memory was being used. But I have almost 2T of RAM.

3. Replace using the specified memory instead of a percentage.

curl -X PUT "localhost:1200/_cluster/settings?pretty" -H 'Content-Type: application/json' -u elastic:infini_rag_flow -d'
{
  "persistent": {
    "cluster.routing.allocation.disk.watermark.low": "50gb",
    "cluster.routing.allocation.disk.watermark.high": "30gb",
    "cluster.routing.allocation.disk.watermark.flood_stage": "20gb"
  }
}'

You can't change just one alone. Must all three be percentages or memory capacity ! ! !

I hope my experience helps you.

lilu6301 added the question Further information is requested label Sep 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question]: parsing file failed #2222

[Question]: parsing file failed #2222

lilu6301 commented Sep 3, 2024

mjtechguy commented Sep 4, 2024

lilu6301 commented Sep 5, 2024

PuppyMeng commented Oct 8, 2024

cfenglv commented Oct 10, 2024

aalboori commented Oct 12, 2024

EzeLLM commented Dec 4, 2024

WaterDrop-EarthDivision commented Dec 13, 2024

[Question]: parsing file failed #2222

[Question]: parsing file failed #2222

Comments

lilu6301 commented Sep 3, 2024

Describe your problem

mjtechguy commented Sep 4, 2024

lilu6301 commented Sep 5, 2024

PuppyMeng commented Oct 8, 2024

cfenglv commented Oct 10, 2024

aalboori commented Oct 12, 2024

EzeLLM commented Dec 4, 2024

WaterDrop-EarthDivision commented Dec 13, 2024

1. Find the username and password for elasticsearch by using docker logs -f ragflow-server.

2. Diagnosis via link shared by @aalboori

2.1 Use curl -X GET “localhost:1200/_cluster/health?pretty” -u elastic:infini_rag_flow to view cluster status.

2.2 Use curl -X GET “localhost:1200/_cat/allocation?v=true&h=node,shards,disk.*&pretty” -u elastic:infini_rag_flow to view usage.And then I realized that it was using up to 94% of the memory.Then I realized that a whopping 94% of the memory was being used. But I have almost 2T of RAM.

3. Replace using the specified memory instead of a percentage.

1. Find the username and password for elasticsearch by using `docker logs -f ragflow-server`.

2.1 Use `curl -X GET “localhost:1200/_cluster/health?pretty” -u elastic:infini_rag_flow` to view cluster status.

2.2 Use `curl -X GET “localhost:1200/_cat/allocation?v=true&h=node,shards,disk.*&pretty” -u elastic:infini_rag_flow` to view usage.And then I realized that it was using up to 94% of the memory.Then I realized that a whopping 94% of the memory was being used. But I have almost 2T of RAM.