Rate limits that allow for bursts and protect sensitive endpoints by duckduckgrayduck · Pull Request #42 · MuckRock/python-documentcloud

duckduckgrayduck · 2026-06-12T20:14:38Z

This creates some rate limits for endpoints especially sensitive to scripts where the user has set no rate limiting of their own that use our official libraries. We should set a good example of rate limiting by doing it for them in the client. The only thing that doesn't use the client because it is hosted on s3.documentcloud.org (this is for fetching assets of public documents) has to be configured in a separate rate limiter in documents.py. I feel that this is an acceptable tradeoff. I have tried these locally and believe they offer a good balance for someone wanting to do a small workflow quickly, but won't affect our services long term. This uses the token-bucket library: https://pypi.org/project/token-bucket/

python-muckrock will receive the same update.

I noticed that there was a custom user agent set for calls to s3.documentcloud.org, this also sets that to the one we should use from the client.

duckduckgrayduck added 2 commits June 12, 2026 12:44

Add sane rate limits

22076c4

Add burst-based rate limiting

4b0865d

duckduckgrayduck requested review from eyeseast and mitchelljkotler June 12, 2026 20:14

Add token-bucket to main.yml

f6ab09b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rate limits that allow for bursts and protect sensitive endpoints#42

Rate limits that allow for bursts and protect sensitive endpoints#42
duckduckgrayduck wants to merge 3 commits into
masterfrom
rate_limits

duckduckgrayduck commented Jun 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

duckduckgrayduck commented Jun 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant