Run ListMetrics calls with goroutines. #53

cristiangreco · 2022-05-30T12:58:25Z

Execute ListMetrics calls in separate goroutines (one for each metric),
in a similar way to how GetMetricData requests are handled.

This change removes semaphore usage (it's not used in GetMetricData as
well) for the sake of making it easier to reason about the code. It can
easily be added back in case we hit any issue around e.g. rate limits.

In local tests, the speedup seems to be particularly effective when
requesting 4 or more metrics in parallel (e.g. especially with EC2/EBS).

…abelConsistencyForMetrics

Eliminate global labelMap and build a labelSet for metrics in ensureLabelConsistencyForMetrics

…tch to structured logging (#7)

* Pass logger to the different structs * Use a builder for the AWS services * Use job specific loggers to propagate properties

* Merge latest release 0.34.0-alpha * Add missing metric

* Move AWS metrics back to prometheus.go and expose them as array in update.go * The go workflow should run on PR's for live * Remove unused field

Execute ListMetrics calls in separate goroutines (one for each metric), in a similar way to how GetMetricData requests are handled. This change removes semaphore usage (it's not used in GetMetricData as well) for the sake of making it easier to reason about the code. It can easily be added back in case we hit any issue around e.g. rate limits. In local tests, the speedup seems to be particularly effective when requesting 4 or more metrics in parallel (e.g. especially with EC2/EBS).

kgeckhart

This change removes semaphore usage (it's not used in GetMetricData as well) for the sake of making it easier to reason about the code. It can easily be added back in case we hit any issue around e.g. rate limits.

Slight concern, errors aren't propagated out of the parallel sections of the code ATM so we might want to make sure we are monitoring errors a little closer after this.

IfSentient

Do we need to worry about ratelimiting from AWS in the case of large numbers of metrics?

thepalbi · 2022-05-31T12:42:35Z

pkg/abstract.go

+			metricsList, err := getFullMetricsList(ctx, svc.Namespace, m, clientCloudwatch)
+
+			if err != nil {
+				level.Error(logger).Log("msg", "Failed to get full metric list", "err", err, "metric_name", m.Name, "namespace", svc.Namespace)


Maybe we could add a prom metrics to track both this errors and the scenario below with zero resources?

ferruvich

Can we roll this out to a single cluster first to observe any impact?

LGTM as we're going to do so

CLAassistant · 2022-06-15T18:01:55Z

All committers have signed the CLA.

kgeckhart and others added 13 commits October 7, 2021 08:24

Return error on getFullMetricsList instead of log.Fatal (#1)

86b0764

Move replacer and splitRegexp to package level (#2)

cac7ca4

Eliminate global labelMap and build a labelSet for metrics in ensureL…

98f5a60

…abelConsistencyForMetrics

Merge pull request #3 from grafana/keckhart/label-map-cannot-be-global

7cfecdb

Eliminate global labelMap and build a labelSet for metrics in ensureLabelConsistencyForMetrics

Merge master in to live to upgrade to v0.32.0

70baf31

Allow injection of a go-kit logger in to update.UpdateMetrics and swi…

e8c0ee0

…tch to structured logging (#7)

Decouple yace_* metric registration from UpdateMetrics (#8)

445cae5

Support a provided set of observed labels for each metric (#9)

a01fea5

Ensure logger is provided (#10)

1bc09af

* Pass logger to the different structs * Use a builder for the AWS services * Use job specific loggers to propagate properties

Add the plus sign back for the loop increment (#11)

edb42e2

Merge 0.34.0 and 0.33.0 (#30)

6ca630f

* Merge latest release 0.34.0-alpha * Add missing metric

Expose AWS Metrics as an array in update.go (#31)

69f6286

* Move AWS metrics back to prometheus.go and expose them as array in update.go * The go workflow should run on PR's for live * Remove unused field

cristiangreco requested review from kgeckhart, matthewnolf and ferruvich May 30, 2022 13:00

matthewnolf approved these changes May 31, 2022

View reviewed changes

kgeckhart approved these changes May 31, 2022

View reviewed changes

IfSentient approved these changes May 31, 2022

View reviewed changes

thepalbi approved these changes May 31, 2022

View reviewed changes

thepalbi reviewed May 31, 2022

View reviewed changes

ferruvich approved these changes May 31, 2022

View reviewed changes

kgeckhart force-pushed the live branch from 5e80616 to b299d6d Compare July 18, 2022 18:25

cristiangreco force-pushed the live branch 2 times, most recently from f35cc9e to b15fdd9 Compare September 16, 2022 08:15

cristiangreco force-pushed the live branch from b15fdd9 to 10b4148 Compare November 9, 2022 10:58

cristiangreco force-pushed the live branch from 10b4148 to a4325b0 Compare January 2, 2023 17:11

cristiangreco force-pushed the live branch from a4325b0 to b4a7914 Compare January 24, 2023 08:57

cristiangreco force-pushed the live branch from b4a7914 to 32040c4 Compare January 31, 2023 10:23

cristiangreco force-pushed the live branch from 32040c4 to 6d294f0 Compare March 9, 2023 11:17

cristiangreco force-pushed the live branch from 6d294f0 to 59b0f79 Compare April 3, 2023 10:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Run ListMetrics calls with goroutines. #53

Run ListMetrics calls with goroutines. #53

cristiangreco commented May 30, 2022

kgeckhart left a comment

IfSentient left a comment

thepalbi May 31, 2022

ferruvich left a comment

CLAassistant commented Jun 15, 2022 •

edited

Loading

Run ListMetrics calls with goroutines. #53

Are you sure you want to change the base?

Run ListMetrics calls with goroutines. #53

Conversation

cristiangreco commented May 30, 2022

kgeckhart left a comment

Choose a reason for hiding this comment

IfSentient left a comment

Choose a reason for hiding this comment

thepalbi May 31, 2022

Choose a reason for hiding this comment

ferruvich left a comment

Choose a reason for hiding this comment

CLAassistant commented Jun 15, 2022 • edited Loading

CLAassistant commented Jun 15, 2022 •

edited

Loading