Rough sketch (draft 3) limiter extension and middleware #12700

jmacd · 2025-03-22T00:42:08Z

Description

After prior drafts, summarized in #12603, with feedback from @bogdandrutu and @axw, I explored adding limiters via middleware structured as two separate configuration and two separate extensions.

This draft includes only the outline of 6 (six!) new modules, which piece together to support a variety of limiter and interceptor behaviors. While I am concerned about the scope of this (#9591, #7441), this appears to be a good direction:

Two new configuration modules, updates to configgrpc and confighttp:

config/configlimiter: this names a limiter extension https://github.com/open-telemetry/opentelemetry-collector/pull/12700/files#diff-edf193c2f38bf762e28b553c3e564e512bb335e206134c60419b9045928c7139
config/configmiddleware: this names a middleware extension https://github.com/open-telemetry/opentelemetry-collector/pull/12700/files#diff-e0f4e2622b9865d4d5312848dd0502d407dc88e293b773bb5a84c42476f39a70
config/configgrpc: adds Middlewares configuration, inserts them https://github.com/open-telemetry/opentelemetry-collector/pull/12700/files#diff-2eb38d27ab53ca22abef502c1b39e68f28165a4eb3aa92e2c1d2314ab8991a59
config/confighttp: adds Middlewares configuration, inserts them https://github.com/open-telemetry/opentelemetry-collector/pull/12700/files#diff-1fadfcd3b4194004717150126d27e8a32fdaa8292ca99e81a937d26abe0998fa

Two extension interfaces:

extension/extensionlimiter: this has resource-limiters and rate-limiters; note that resource-limiters can downgrade to rate-limiters, not the other way around https://github.com/open-telemetry/opentelemetry-collector/pull/12700/files#diff-7f5adde0a3698ce5d3f35a54ba88023472b8b7398e0fa15f494c651ccca59211
extension/extensionmiddleware: this has HTTP and gRPC client and server interfaces; they return the natural interceptor type for HTTP and gRPC (HTTPClientRoundTripper, HTTPServerHandler, GRPCServerOptions, GRPCClientOptions).
- client https://github.com/open-telemetry/opentelemetry-collector/pull/12700/files#diff-348a7c80750c81cf3cbc5c3d3d452ea51411f2bf8d2b54defbf70579d7eaa483
- server https://github.com/open-telemetry/opentelemetry-collector/pull/12700/files#diff-3df402dcc5ae7fd7cede8c937a0d72251910ef9e010634e5fc4774c45da6ac49

Three extensions added/modified:

extension/limitermiddlewareextension: this shows how to turn a limiter into a middleware extension; this rate-limits network bytes and rate/resource-limits requests
- config https://github.com/open-telemetry/opentelemetry-collector/pull/12700/files#diff-6b95fd3e634307b1ec46d37f8f766e82a89130d63189eb93b582287163df8b32
- implementation https://github.com/open-telemetry/opentelemetry-collector/pull/12700/files#diff-9b04045be31926beaf0ea3b8b36fd04c55291439c16141ab1335212ac216199a
extension/ratelimiterextension: this is an example token bucket rate limiter
- config https://github.com/open-telemetry/opentelemetry-collector/pull/12700/files#diff-92fd95fd435ad03f4fb0af81bc6f8b6557ec8fae36f3a78f01b3d0ecb66fa69c
- implementation https://github.com/open-telemetry/opentelemetry-collector/pull/12700/files#diff-747eb6090ce6f6e8b873ef2edcc7e5ed44cb5d5bf8f788036af0c1d11d7c4f40
extension/memorylimiterextension: the original limiter extension, updated
- https://github.com/open-telemetry/opentelemetry-collector/pull/12700/files#diff-50228098c7b1cb3e86d349cf309bfc70dae29f56dc0374f9a5e432f9659cf926

One helper library:

extension/extensionlimiter/limiterhelper: contains re-usable functionality for receivers
- pipeline consumer wrapper https://github.com/open-telemetry/opentelemetry-collector/pull/12700/files#diff-020703b3dda54818edcbe31e8bff6befec640a0caca465f36ebe2fffc7805c7f
- multiple limiter combiner https://github.com/open-telemetry/opentelemetry-collector/pull/12700/files#diff-75d248dd6d1ec3b0c42c55ca62ff619e1b6e84a6b1ae452ce2b4ea35818c9794

One receiver demonstrating item-count and memory-size limits:

receiver/otlpreceiver: add support for resource limiting on request_count, request_items, memory_size through limiterhelper.
- wrappers are applied here using limiterhelper https://github.com/open-telemetry/opentelemetry-collector/pull/12700/files#diff-2a7eb17cf0abdd714b970baba28efa8f036c1eece1139e1e1db11fe9c8d852e8

Next steps:

assuming this direction is accepted, I will begin to break it into small pieces and add tests and documentation. In addition to the work shown here, my intent stated in Applying memory_limiter extension #9591 is to add a semaphore-based limiter modeled on the one in collector-contrib/internal/otelarrow/admission2
the ratelimiter demo in this PR is just an example, better to incorporate the work in https://github.com/elastic/opentelemetry-collector-components/tree/main/processor/ratelimitprocessor.

Link to tracking issue

Part of #9591 #7441 #12603

Testing

NONE: for discussion

Documentation

NONE: TODO

…missed a form of gRPC interceptor. For both client and server gRPC cases in the middleware extension API, introduce a method to obtain a stats handler. ClientStatsHandler() and ServerStatsHandler() methods will be added, returning (grpc.StatsHandler, error). In limitermiddleware, add support for the two new methods. Implement the StatsHandler interface with empty methods for now. The type is named stats.Handler, package documented https://pkg.go.dev/google.golang.org/[email protected]/stats#Handler

bogdandrutu

I reviewed extensionmiddleware and extensionlimiter:

I have very few comments that we can discuss about;
We should merge these asap; matches exactly with what I expected to see.
Next on my list will be the configs then the convertors.

extension/extensionmiddleware/client.go

extension/extensionlimiter/extensionlimiter.go

extension/extensionmiddleware/server.go

config/configmiddleware/configmiddleware.go

mattsains · 2025-03-24T16:09:52Z

I like the way things have shaped up here. I like the interfaces you've defined and I think it sets us going in the right direction

jmacd

Thanks @bogdandrutu, @mattsains. I've improved this PR (still a draft) based on your feedback.

I'm happy with how the gRPC rate limits are implemented, and I added HTTP network-bytes limits to flush out the skeleton of this approach. The http.RoundTripper and http.Handler will be created when either of the two weight keys it supports are used: network_bytes and request_count.

Note that request_items and resident_bytes will be implemented at a different level in the receiver(s).

However, there are still likely some challenges. There are existing middlewares with a pre-defined order, including auth, headers, compression, and opentelemetry instrumentation. To add a limiter we want to go before compression. To turn compression or opentelemetry instrumentation into extensions, which sounds nice, also implies a transition plan. I could imagine making the middleware configurable with a default of [compression, opentelemetry]; if you want to add a rate limiter you'll have to include compression and opentelemetry in the proper order.

config/configmiddleware/configmiddleware.go

extension/extensionlimiter/extensionlimiter.go

extension/extensionmiddleware/client.go

extension/extensionmiddleware/server.go

extension/extensionlimiter/extensionlimiter.go

axw · 2025-03-25T05:17:24Z

P.S. the core Provider interface and contract with middleware and other callers is really the only thing bothering me - otherwise this is looking great. Thank you for working on it! ❤️

jmacd · 2025-03-25T17:26:47Z

@axw I think it's a good question, whether the Weight key should be a fixed enum or an open set.

I can imagine a user who decides they want to rate-limit auth requests, having added special support in their auth extension. Then the auth extension would list a limiter extension, and potentially it could use a Weight key like "authorization_count". 🤷

So, I made the value an enum, but I don't really understand the implications for adding values in the future.

…tor into jmacd/limiter_v3

jmacd · 2025-03-29T00:23:49Z

@axw @bogdandrutu Please review the changes in configgrpc, confighttp, and otlpreceiver. If I don't hear more input, I'll start to send out single-package changes after next week, starting with extensionlimiter, extensionmiddleware, configlimiter, configmiddleware, then configgrpc, confighttp, then limitermiddleware, memorylimiter, and work starts on two new limiters (admission and rate).

Note: the OTLP receiver now implements network_bytes and request_count limits using middleware, but it implements request_items and memory_size limits directly after it knows this information. The functionality could be implemented in a shim-layer between Receiver and pipeline, potentially, but there isn't a precedent for this and I would be glad for items/size limits to be opt-in.

Is it OK for the limitermiddleware to present itself as an extensionlimiter.Provider so that OTLP receiver can call it directly? I like this, and this is how OTel-Arrow receiver would like it as well -- we can't calculate items/size until the data is parsed and processed a bit.

bogdandrutu

LGTM (can start PRs):

configgrpc
configmiddleware
confighttp
extensionmiddleware

For the rest I need more time to think about how will be used. For example the limitermiddlewareextension does it need to be public (or how a dev uses it)? Or can we have it part of the extensionmiddleware to support conversion if the middleware ID is a limiter extension?

axw

I agree with @bogdandrutu, the middleware bits look straightforward and ready to move ahead with.

For the limiter API, I would still prefer to completely remove the weight key.

AFAICS the the OTLP receiver doesn't need to call the limiter itself since the limit is acquired after decoding has happened - so I think this could just as well be done by a processor, and then it's immediately usable with other receivers too. Maybe I'm missing some subtle detail though.

Is it OK for the limitermiddleware to present itself as an extensionlimiter.Provider so that OTLP receiver can call it directly? I like this, and this is how OTel-Arrow receiver would like it as well -- we can't calculate items/size until the data is parsed and processed a bit.

Not sure I understand the rationale here. Would it still be relevant if we do the item/size limiting in a processor?

extension/extensionlimiter/extensionlimiter.go

extension/limitermiddlewareextension/limitermiddleware.go

extension/extensionlimiter/extensionlimiter.go

receiver/otlpreceiver/internal/logs/otlp.go

jmacd · 2025-03-31T19:27:32Z

@bogdandrutu I need for us to reach agreement on the limiter APIs before I proceed.

@axw Let's focus on "can this be a processor?"

Would it still be relevant if we do the item/size limiting in a processor?

My position is that processors are too late for effectively governing memory use. This is why memorylimiterprocessor basically doesn't work and the genesis of memorylimiterextension. I understand the intuition behind this question--in some receivers, very little happens between the point where a receiver knows the memory_size and request_items and the call to Consume(), so why not perform these functions in the first processor, instead of complicating receiver logic?

In some receivers, a lot happens between the point where a receiver knows memory_size/request_items and starting to process the request. gRPC-unary and HTTP servers typically hide the creation of a goroutine per request, so it looks like there's no difference between "last thing a receiver does" and "first thing a processor does", but for many protocols this is not the case. I'll give two examples:

A gRPC-stream protocol (e.g., OTel-Arrow) receives large payloads on a single goroutine dedicated to reading the stream. Its goal is to move as fast as possible, subject to the limiter, so once it has constructed a request it creates a new goroutine for the call to Consume(). By calling the limiter before creating the new goroutine, when the limiter blocks it will (a) prevent immediate creation of a goroutine, and (b) slow down the producer by not reading the next request.
A UDP protocol (e.g., syslog, statsd) receives a packet at a time. Because packets are typically small, we don't create a goroutine for every tiny request, instead we accumulate data for a period of time or for a limited number of items. There are two categories of friction in this situation: (a) memory in the accumulating request is not accounted for, making it risky, and (b) because a single goroutine eventually flushes the request, the sort of configuration adds latency and limits throughput.

Note: I'm not actually interested in item-count limits. Item count limits can be implemented in a processor, but I don't think we should.

I think it would be appropriate to give receivers an option, a sort of receiverhelper feature to automatically apply request_items and/or memory_size limits as an intersticial pipeline element between the receiver and the processor, however this would not be appropriate for the two cases described above.

@bogdandrutu wrote:

For example the limitermiddlewareextension does it need to be public (or how a dev uses it)?

I showed inside this PR how I would use it, for example where @axw raised the question above.

(Note the OTLP receiver is a special case, because (a) it receives OTLP data, therefore middleware can easily limit request_items/memory_size, (b) as a gRPC-unary server, a goroutine has been allocated before the limiter is called.)

@awx considering your suggested alternative, the point I want to make is that not all uses of limiters will be middleware. In my proposal, the limitermiddleware is nothing more than a reference to a limiter. In yours, a lot of configuration (i.e., "weight keys") goes into the limitermiddleware, which means I'll have to re-create it in receivers that do not support middleware.

Narrowing in on @axw's statement:

With a key, there's a sort of implicit protocol between the implementation and consumer that will need to be documented:

I read "need to be documented" to mean "sounds complex", but the two look equally complex to me. All solutions need to be documented. :)

The limiter provider may only support a subset of keys, so implementations will need to document which keys (by implication, which consumers) they support.

This is not how my proposal goes. Every limiter will support all keys, because limiters are expected to operate on weight information alone. There would be no reason for a limiter to support a subset of keys. In my counter-example, the rate-limiter has one section per weight key (and unconfigured keys are unlimited).

  ratelimiter/single:
    network_bytes:
      metadata_keys: [x-tenant-id]
      rate: 1
      burst: 10
    request_count:
      rate: 2
      burst: 20
    request_items:
      rate: 3
      burst: 30

Summarizing: we have four weight-specific entries in every limiter, and there will be at least three limiters: memorylimiter, admissionlimiter, ratelimiter. Limiters are expected to support all weight keys and behave identically. Adding new weight keys will be simple for limiters, complex for receivers.

The consumer of the limiter provider will need to document which keys it requests (by implication, which implementations), and the user needs to make sure they configure an implementation that supports those.

Disagree. I see it as every receiver's responsibility to ensure that all four standard limiter weights are implemented, through middleware or otherwise. A gRPC or HTTP receiver can document that middleware includes limiters; however we know that a middleware-limiter adapter can only limit network_bytes and request_items in general, leaving the receiver itself responsible for calling request_items and memory_size (which can be provided as helpers). Therefore, documentation is not what is needed: it's to update the receivers to support middleware and/or call receiver helpers and/or directly call the limiters.

Here is what I think we have to document to mitigate complexity:

Receivers have to document which of the known weight keys are supported, and whether through Rate or Resource limiter APIs. Ideally, all receivers document support for all weight keys, through:

HTTP/gRPC receivers using middleware for network_bytes/request_count
HTTP/gRPC-unary recievers using helpers for request_items/memory_size
gRPC-stream receivers directly loading limiters from the middleware, with custom code instead of the helpers
other receivers directly configure and invoking the limiters.

Limiters have to document whether they act as Resource limiters, Rate limiters, or both. In my draft Rough sketch (draft 3) limiter extension and middleware #12700, the rate limiter and memory limiter act as both. Neither of these limiters cares when the ReleaseFunc returned from Acquire is called, so no special cases have to be documented. However, when I introduce the admission limiter, which is a Resource limiter, it will return an error when trying to use it as a Rate limiter. In the present PR, the limiter middleware extension uses the Limit() API for network bytes; if you configured limiter middleware referring to an admission limiter for network_bytes it would fail in Start().
Receivers and middleware/helpers have to document whether they use a Resource Limiter or Rate limiter API for each weight key. For the example just mentioned, of a network_bytes limit configured with a Resource limiter, this is a problematic configuration because for gRPC middleware, the StatsHandler interface, which is the source of network_bytes information, is not automatically scoped to the request. There is no way to invoke a ReleaseFunc in gRPC middleware scoped to the request. There are two ways a user can resolve this error: (1) configure a rate limiter, not a resource limiter, for network_bytes (easy), (2) upgrade the middleware to use resource limiter APIs instead of rate limiter APIs (hard).

I hope this helps convince you both that limiter extensions should configure weight keys and that receivers need direct access to limiters. IMO this draft is the best path forward.

axw · 2025-04-01T06:40:32Z

@jmacd re "why not a processor?": thanks for the additional context, I get your point now with the OTel-Arrow & syslog examples. My overall takeaway is: we should perform the limiting at the earliest opportunity, and in non-OTLP receivers we will have the information before we have converted to OTLP/pdata, in which case a processor will not be the earliest opportunity.

In my proposal, the limitermiddleware is nothing more than a reference to a limiter. In yours, a lot of configuration (i.e., "weight keys") goes into the limitermiddleware, which means I'll have to re-create it in receivers that do not support middleware.

OK. I had in mind that the limiter implementations would care about the keys, and the receivers were expected to use specific values.

Please bear with me, there's a lot to get my head around here...

In https://github.com/open-telemetry/opentelemetry-collector/pull/12700/files#r2020428860 you mentioned having some interstitial - do you have an idea of what that would look like already? I'm not sure I understand how that would help address my concern, which is that a user who cares about (only) one kind of rate limiting shouldn't pay for any other kinds.

jmacd · 2025-04-02T00:00:08Z

@axw Thank you! I refactored some of the code from the prior state into a limiterhelper library which is the "interstitial" I was thinking about. In this configuration:

1.Note that network_bytes and request_count limits are applied in the limitermiddleware (protocol-specific)
2. Note that request_items and memory_size limits are applied in the limiterhelper, which are protocol-agnostic, however we pass in different middlewares to the helper in the gRPC and HTTP paths so these can't be handled at the factory level. During Start(), the limiter helper is constructed and used to wrap the individual signals. This makes a typical HTTP or gRPC receiver easy to add limit supports because they will automatically get middleware support and they can easily call the limiterhelper for the other weight keys.

There are two interfaces in the helper library, and now the changes in otlpreceiver are quite small. For now, I placed this helper library into extension/extensionlimiter/limiterhelper. Please see:

extension/extensionlimiter/limiterhelper: contains re-usable functionality for receivers
- pipeline consumer wrapper https://github.com/open-telemetry/opentelemetry-collector/pull/12700/files#diff-020703b3dda54818edcbe31e8bff6befec640a0caca465f36ebe2fffc7805c7f
- multiple limiter combiner https://github.com/open-telemetry/opentelemetry-collector/pull/12700/files#diff-75d248dd6d1ec3b0c42c55ca62ff619e1b6e84a6b1ae452ce2b4ea35818c9794
receiver/otlpreceiver: add support for resource limiting on request_items, memory_size through limiterhelper.
- wrappers are applied for request_items and memory_size weight keys here using limiterhelper https://github.com/open-telemetry/opentelemetry-collector/pull/12700/files#diff-2a7eb17cf0abdd714b970baba28efa8f036c1eece1139e1e1db11fe9c8d852e8

extension/extensionlimiter/extensionlimiter.go

axw · 2025-04-02T04:08:14Z

extension/extensionlimiter/extensionlimiter.go

+// Provider is an interface that provides access to different limiter types
+// for specific weight keys.
+type Provider interface {
+	// RateLimiter returns a RateLimiter for the specified weight key
+	RateLimiter(key WeightKey) RateLimiter
+
+	// ResourceLimiter returns a ResourceLimiter for the specified weight key.
+	//
+	// In cases where a component supports a rate limiter and does not use
+	// a release function, the component may return a ResourceLimiterFunc
+	// which calls the underlying rate limiter and returns a nil ReleaseFunc.
+	ResourceLimiter(key WeightKey) ResourceLimiter


I'm having a hard to understanding when you would use a RateLimiter vs. a ResourceLimiter.

In limiterMiddleware we have:

ResourceLimiter for request_count

RateLimiter for network_bytes

Using RateLimiter makes sense to me for network_bytes: there's an unlimited stream of bytes coming in, and we want to limit the rate at which they will be consumed.

Why would you not do the same for request_count? IIUC, if you use a ResourceLimiter then the rate at which resources are consumed would be dependent on both the defined rate and how quickly each one is consumed. That's because as soon as you release, the resource becomes available for another consumer.

I would expect a ResourceLimiter to be used only for limited resources, like memory_size. Another case where I think it would make sense is for limiting concurrency. In those cases we don't care about rates, we're just making enforcing a limit on that limited resource.

Having said all that, I can think of cases where it would be useful to have an upper limit on concurrent requests in the pipeline in order to protect the backend from a rate it cannot possibly handle. I guess that's what you're going for here?

If that's the case, I'm wondering how the ResourceLimiter would work in a pipeline where the receiver and exporter are separated by a buffer (e.g. Kafka). If the receiver releases the resource immediately after producing to Kafka, then at sustained high receive rate the buffer growth may exceed the backend's capacity.

I've added to the package documentation details about the Rate- and Resource-limiter APIs. I also mention a connection with the OTel metrics data model: Rates apply to Counters, Resource apply to UpDownCounters. I had already documented that Rate limiters can be applied as Resource limiters, simply by using a no-op ReleaseFunc: the same applies in the OTel metrics data model: you can count the rate of increments to an UpDownCounter while ignoring decrements.

0e30a3a

Other forms of limiting that I've seen or heard discussed recently, in our context:

goroutines (a resource)

auth requests (a rate)

retries (a rate)

We would be able to add new weight keys for these.

Thank you, that helps. I suppose for keys like request_count there might be configuration to choose between rate or resource limiting then?

axw · 2025-04-02T04:52:35Z

extension/extensionlimiter/limiterhelper/consumer.go

+	"go.opentelemetry.io/collector/pdata/ptrace"
+)
+
+// Consumer is a builder for creating wrapped consumers with resource limiters


Thanks for adding this. Starting to become clearer. Would you also expect to add some common config struct that can be embedded in receiver config, similar to how exporters have exporterhelper.TimeoutConfig and exporterhelper.QueueBatchConfig?

Then the receiver can indicate which limits it supports with the With*Limit options, and users can configure the receiver to enable/disable specific limits.

~~Yes, this is a natural conclusion. I won't go as far as to implement this, but it would be an easy next step.~~

Those other structures (e.g., TimeoutConfig) are applied at the factory level, whereas the behavior I've illustrated uses the HTTP- or gRPC-level configuration of middleware to infer the limiters. Now that we're here, I have mixed feelings about this aspect of the design. To go in the direction you suggested, here's what I'll do:

Middlewares will continue to apply network_bytes and request_count limits, no changes in interceptor logic.

A config named receiverhelper.LimiterConfig will list []configlimiter.Limiter entries, thus factory-level logic can initialize limiters for request_items and memory_size. However, note that only a list of limiters is configured, no mention of weight keys. The receiver logic will specify which weight keys it expects to handle where, in code, while configuring the factory.

For example, the OTLP receiver covered in this draft would specify in its factory that receiver-level limiters should be configured for request_items and memory_size, while middleware-level limiters should be configured for request_count and network_bytes.

This is not the only design for weight keys, I'd like your opinion @axw. Would you extend configlimiter.Limiter to refer to the weight key or keys to use for the binding? We might end up with configuration like:

receivers: otlp: protocols: grpc: middlewares: - middleware: limitermiddleware/rate12 http: middlewares: - middleware: limitermiddleware/rate12 limiters: - limiter: admissionlimiter/memory1 key: memory_size extensions: limitermiddleware/rate12: - limiter: ratelimiter/rate1 key: network_bytes - limiter: ratelimiter/rate2 key: request_count

This is maybe a little confusing and verbose, but it address the problem that request_count could be provided by middleware or it could be provided by the helper, and now we're configuring this detail instead of providing it in code. The consequence of this is that now misconfiguration is possible leading to start failures (e.g., requesting a key that is not provided); the benefit is that it's explicit.

One downside of this configuration, an orthogonal one, is that we cannot use separate limiters for gRPC and HTTP traffic on the same port because, at the factory level, the HTTP and gRPC traffic are identical. Potentially this could be addressed by adding context metadata indicating which protocol is in use, so that Acquire() and Limit() have access to this information, moving this configuration into the limiters.

Nit:

grpc: middlewares: - middleware: limitermiddleware/rate12 http: middlewares: - middleware: limitermiddleware/rate12

Would this look better as?

grpc: middlewares: [limitermiddleware/rate12] http: middlewares: [limitermiddleware/rate12]

This is maybe a little confusing and verbose, but it address the problem that request_count could be provided by middleware or it could be provided by the helper, and now we're configuring this detail instead of providing it in code. The consequence of this is that now misconfiguration is possible leading to start failures (e.g., requesting a key that is not provided); the benefit is that it's explicit.

My preference is to be explicit over implicit, and bring all the complexity to the surface. That does mean more verbose config, but I think it should be clearer.

Re "requesting a key that is not provided": (IIUC) this is why I was proposing in #12700 (comment) to be even more explicit, and instead of having keys, make each type of limit its own configuration setting. Rebasing on your example above, what I have in mind is this:

receivers: otlp: protocols: grpc: middleware: [limitermiddleware/rate12] http: middleware: [limitermiddleware/rate12] memory_size_limiter: admissionlimiter/memory1 extensions: limitermiddleware/rate12: network_bytes_limiter: ratelimiter/rate1 request_count_limiter: ratelimiter/rate2

One downside of this configuration, an orthogonal one, is that we cannot use separate limiters for gRPC and HTTP traffic on the same port because, at the factory level, the HTTP and gRPC traffic are identical. Potentially this could be addressed by adding context metadata indicating which protocol is in use, so that Acquire() and Limit() have access to this information, moving this configuration into the limiters.

Is this a hypothetical issue, or are there receivers that support both gRPC & HTTP on the same port? Anyway, if there are then they will need to figure out which requests are gRPC and which are not, so I agree that conveying through context metadata should be viable.

bogdandrutu · 2025-04-09T04:14:19Z

extension/extensionlimiter/extensionlimiter.go

+// Provider is an interface that provides access to different limiter types
+// for specific weight keys.
+//
+// Extensions implementing this interface can be referenced by their
+// names from component rate limiting configurations (e.g., limitermiddleware).
+type Provider interface {


Not sure I understand the need of the Provider. Does an extension need to implement both rate and resource? Why not have a RateLimiterProvider (or RateLimiterExtension)?

Can one extension implement only one of the limiters?

bogdandrutu · 2025-04-09T04:18:54Z

extension/extensionlimiter/limiterhelper/consumer.go

+}
+
+// Option represents the consumer options
+type Option func(*Config)


We prefer the option of an interface with a private func for options.

bogdandrutu · 2025-04-09T04:19:38Z

extension/extensionlimiter/limiterhelper/consumer.go

+}
+
+// NewConsumer creates a new limiterhelper Consumer
+func NewConsumer(provider extensionlimiter.Provider, options ...Option) *Consumer {


Not sure you need the Consumer you can make:

func WrapTraces(provider extensionlimiter.Provider, nextConsumer consumer.Traces, options ...Option)

bogdandrutu · 2025-04-09T04:20:44Z

extension/limitermiddlewareextension/config.go

+type Config struct {
+	// Limiter configures the underlying extension used for limiting.
+	Limiter configlimiter.Limiter `mapstructure:",squash"`
+}


Where would this be used?

bogdandrutu · 2025-04-09T04:21:10Z

extension/limitermiddlewareextension/factory.go

+)
+
+// NewFactory returns a new factory for the Limiter Middleware extension.
+func NewFactory() extension.Factory {


Not clear yet where this will be used for me.

bogdandrutu · 2025-04-09T04:25:09Z

extension/extensionlimiter/extensionlimiter.go

+	// Limit attempts to apply rate limiting based on the provided weight value.
+	// Limit is expected to block the caller until the weight can be admitted.


This is very confusing:
I see that https://github.com/open-telemetry/opentelemetry-collector/pull/12700/files#diff-50228098c7b1cb3e86d349cf309bfc70dae29f56dc0374f9a5e432f9659cf926R38 returns if memory is over limit immediately which in my opinion is the right thing.

Here Limit is expected to block the caller until the weight can be admitted., I don't agree with this.

bogdandrutu · 2025-04-09T04:25:51Z

extension/extensionlimiter/extensionlimiter.go

+	// It may block until resources are available or return an error if the limit
+	// cannot be satisfied.


Who controls the blocking vs not blocking behavior?

…tor into jmacd/limiter_v3

…jmacd/limiter_v3

jmacd added 9 commits March 20, 2025 13:03

config and limiter extension

6aac7ae

config and middleware extension

2a1818d

add limiter skeleton

b519182

add request limiter calls

4434dac

split rate limiter, resource limiter

8fe3941

todos

84838fc

skeleton ratelimiterextension

cdc6698

rate limiter outline

88b203b

bogdandrutu reviewed Mar 22, 2025

View reviewed changes

bogdandrutu mentioned this pull request Mar 22, 2025

Stabilize extensionauth #12675

Merged

bogdandrutu reviewed Mar 22, 2025

View reviewed changes

config/configmiddleware/configmiddleware.go Outdated Show resolved Hide resolved

jmacd added 4 commits March 24, 2025 11:54

HTTP client and server network bytes

b602b18

add limiter provider

cec3306

Follow 17002

060c756

Let rate limiter accept request resource-weighted limits

ee8dd47

jmacd commented Mar 24, 2025

View reviewed changes

axw reviewed Mar 25, 2025

View reviewed changes

extension/extensionlimiter/extensionlimiter.go Show resolved Hide resolved

jmacd added 5 commits March 28, 2025 11:34

Merge branch 'main' of github.com:open-telemetry/opentelemetry-collec…

9369495

…tor into jmacd/limiter_v3

middleware config: more direct, extension naming more consistent

af51711

sketch grpc middleware

2c3a406

extend confighttp and configgrpc

8ca49f0

memory limiter

1fa405e

github-actions bot added the extension/memorylimiter label Mar 28, 2025

jmacd added 3 commits March 28, 2025 15:14

multi limiter provider

e125898

OTLP receiver limiter

a6e6b17

add direct grpc limiter support (this could be a helper)

10cfbb3

add direct http limiter support (this could be a helper)

1f4a724

github-actions bot added the receiver/otlp label Mar 28, 2025

jmacd marked this pull request as ready for review March 29, 2025 00:23

jmacd requested a review from a team as a code owner March 29, 2025 00:23

jmacd requested a review from bogdandrutu March 29, 2025 00:23

This was referenced Mar 29, 2025

Outline steps to add "limiter" extension component #12603

Open

Refactor configauth to support multiple extension auth interfaces #12702

Open

bogdandrutu reviewed Mar 30, 2025

View reviewed changes

axw reviewed Mar 31, 2025

View reviewed changes

jmacd added 3 commits April 1, 2025 12:13

merge

ee0c2e0

factor limiterhelper

6d194a8

remove/rename

9f66533

axw reviewed Apr 2, 2025

View reviewed changes

Use non-nil ReleaseFunc everywhere

0e30a3a

bogdandrutu reviewed Apr 9, 2025

View reviewed changes

jmacd added 2 commits April 14, 2025 11:04

Merge branch 'main' of github.com:open-telemetry/opentelemetry-collec…

70296f0

…tor into jmacd/limiter_v3

Merge branch 'main' of github.com:jmacd/opentelemetry-collector into …

a79da5f

…jmacd/limiter_v3

This was referenced Apr 15, 2025

Middleware configuration, extension, HTTP and gRPC support all in one #12842

Closed

Middleware: extension interface (part 1/4) #12843

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rough sketch (draft 3) limiter extension and middleware #12700

Rough sketch (draft 3) limiter extension and middleware #12700

jmacd commented Mar 22, 2025 •

edited

Loading

bogdandrutu left a comment

mattsains commented Mar 24, 2025

jmacd left a comment

axw commented Mar 25, 2025

jmacd commented Mar 25, 2025

jmacd commented Mar 29, 2025

bogdandrutu left a comment

axw left a comment

jmacd commented Mar 31, 2025

axw commented Apr 1, 2025

jmacd commented Apr 2, 2025

axw Apr 2, 2025

axw Apr 2, 2025

jmacd Apr 2, 2025

jmacd Apr 2, 2025

axw Apr 3, 2025

axw Apr 2, 2025

jmacd Apr 2, 2025 •

edited

Loading

jmacd Apr 2, 2025

bogdandrutu Apr 9, 2025

axw Apr 14, 2025

bogdandrutu Apr 9, 2025

bogdandrutu Apr 9, 2025

bogdandrutu Apr 9, 2025

bogdandrutu Apr 9, 2025

bogdandrutu Apr 9, 2025

bogdandrutu Apr 9, 2025

bogdandrutu Apr 9, 2025

bogdandrutu Apr 9, 2025

		// Limit attempts to apply rate limiting based on the provided weight value.
		// Limit is expected to block the caller until the weight can be admitted.

		// It may block until resources are available or return an error if the limit
		// cannot be satisfied.

Rough sketch (draft 3) limiter extension and middleware #12700

Are you sure you want to change the base?

Rough sketch (draft 3) limiter extension and middleware #12700

Conversation

jmacd commented Mar 22, 2025 • edited Loading

Description

Link to tracking issue

Testing

Documentation

bogdandrutu left a comment

Choose a reason for hiding this comment

mattsains commented Mar 24, 2025

jmacd left a comment

Choose a reason for hiding this comment

axw commented Mar 25, 2025

jmacd commented Mar 25, 2025

jmacd commented Mar 29, 2025

bogdandrutu left a comment

Choose a reason for hiding this comment

axw left a comment

Choose a reason for hiding this comment

jmacd commented Mar 31, 2025

axw commented Apr 1, 2025

jmacd commented Apr 2, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jmacd Apr 2, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jmacd commented Mar 22, 2025 •

edited

Loading

jmacd Apr 2, 2025 •

edited

Loading