Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

suggest lower cardinality variables for the pod dashboard[bug] #106

Closed
brokenjacobs opened this issue Apr 22, 2024 · 3 comments
Closed

suggest lower cardinality variables for the pod dashboard[bug] #106

brokenjacobs opened this issue Apr 22, 2024 · 3 comments
Assignees
Labels
bug Something isn't working released

Comments

@brokenjacobs
Copy link

brokenjacobs commented Apr 22, 2024

Describe the bug

When in a cluster with a lot of churn on pods, the high cardinality pod metrics cause queries to fail due to the large number of series returns. For instance I doubled the max returned label sets in victoriametrics to 60k and I still fail when trying to use the pod dashboard:

2024-04-22T18:17:33.527Z	warn	VictoriaMetrics/app/vmselect/main.go:231	error in "/api/v1/series?start=1713806220&end=1713809880&match%5B%5D=%7B__name__%3D%22kube_pod_info%22%7D": cannot fetch time series for "filters=[{__name__=\"kube_pod_info\"}], timeRange=[2024-04-22T17:17:00Z..2024-04-22T18:18:00Z]": cannot find metric names: error when searching for metricIDs in the current indexdb: the number of matching timeseries exceeds 60000; either narrow down the search or increase -search.max* command-line flag values at vmselect; see https://docs.victoriametrics.com/#resource-usage-limits

How to reproduce?

Have a cluster with a lot of pods being created...

Expected behavior

No response

Additional context

I have a fix suggestion that seems to work fine for me. It involves changing the namespace and job queries to not query "all pods" for labels. Like this:

namespace: label_values(kube_namespace_created{cluster="$cluster"},namespace)
job: label_values(kube_pod_info{namespace="$namespace", cluster="$cluster"},job)
@brokenjacobs brokenjacobs added the bug Something isn't working label Apr 22, 2024
@dotdc
Copy link
Owner

dotdc commented Apr 22, 2024

Thank you for the bug report @brokenjacobs, will have a look at it by the end of the week!

@dotdc dotdc closed this as completed in 75dd5a1 Apr 25, 2024
@dotdc
Copy link
Owner

dotdc commented Apr 25, 2024

🎉 This issue has been resolved in version 1.1.0 🎉

The release is available on GitHub release

Your semantic-release bot 📦🚀

@dotdc dotdc added the released label Apr 25, 2024
@dotdc
Copy link
Owner

dotdc commented Apr 25, 2024

@brokenjacobs
Should be fixed 75dd5a1, let me know if it's not the case.

sdlarsen pushed a commit to digital-science/grafana-dashboards-kubernetes that referenced this issue Apr 26, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working released
Projects
None yet
Development

No branches or pull requests

2 participants