Skip to content

[SDK] LogRecord attribute limits enforcement#4157

Open
thc1006 wants to merge 15 commits into
open-telemetry:mainfrom
thc1006:feat/log-record-limits-4126
Open

[SDK] LogRecord attribute limits enforcement#4157
thc1006 wants to merge 15 commits into
open-telemetry:mainfrom
thc1006:feat/log-record-limits-4126

Conversation

@thc1006

@thc1006 thc1006 commented Jun 14, 2026

Copy link
Copy Markdown
Contributor

Fixes #4126.

Implements the LogRecord attribute count and value length limits described
by the logs SDK spec
(https://opentelemetry.io/docs/specs/otel/logs/sdk/#logrecord-limits).
Limits flow from LoggerProvider through LoggerContext to each LogRecord
created by Logger::CreateLogRecord. The infrastructure existed in the
declarative configuration tier (LogRecordLimitsConfiguration) but was
never wired into the runtime pipeline.

What changed

New header sdk/include/opentelemetry/sdk/logs/log_record_limits.h:

  • LogRecordLimits struct with spec defaults
    attribute_count_limit = 128 and
    attribute_value_length_limit = SIZE_MAX (unlimited).

SDK Recordable hierarchy:

  • Recordable gains a non-pure virtual SetLogRecordLimits with a no-op
    default body. Existing implementations that do not enforce limits
    inherit the no-op and compile unchanged. The virtual is appended at
    the end of the vtable to keep the change additive.
  • ReadableLogRecord gains a non-pure virtual GetDroppedAttributesCount
    returning zero by default.
  • ReadWriteLogRecord overrides both. SetAttribute checks the count
    limit before inserting and truncates string / array-of-string values
    whose length exceeds the configured limit. Truncation is byte level,
    mirroring the existing Span attribute behavior.
  • MultiRecordable propagates SetLogRecordLimits to every wrapped
    recordable.

OTLP exporter:

  • OtlpLogRecordable overrides SetLogRecordLimits. SetAttribute
    drops attributes beyond the count limit and increments the proto
    LogRecord.dropped_attributes_count field. Strings and string-array
    values are truncated after OtlpPopulateAttributeUtils::PopulateAttribute
    populates the proto AnyValue.

Wiring:

  • LoggerContext owns a LogRecordLimits value and exposes
    GetLogRecordLimits(). A new fourth parameter is appended to the
    existing constructor with a default-constructed default, so existing
    call sites compile unchanged.
  • Logger::CreateLogRecord (both ABI v1 and ABI v2 variants) calls
    recordable->SetLogRecordLimits(context_->GetLogRecordLimits())
    immediately after MakeRecordable, before any user attribute write.
  • A new LoggerProviderFactory::Create overload accepts
    LogRecordLimits and builds a LoggerContext with those limits
    internally.

Declarative configuration:

  • SdkBuilder::CreateLoggerProvider now maps
    LogRecordLimitsConfiguration from the parsed
    LoggerProviderConfiguration to the runtime LogRecordLimits and
    passes them to the new factory overload. The existing FIXME-SDK
    comment about wiring limits is removed.

Tests

sdk/test/logs/log_record_limits_test.cc (new):

  • Default behavior: 200 attributes accepted, zero dropped when no
    limits object is supplied.
  • Count enforcement: attributes beyond the limit are dropped and
    counted. Replacing an existing key while at the limit must not drop.
  • Length truncation for strings and string arrays.
  • Type selectivity: only string and array-of-string are truncated;
    int / double / bool pass through.
  • Combined count + length case.
  • Logger-level wiring test: a TrackingRecordable produced by a
    TrackingProcessor confirms that the limits configured on
    LoggerProvider reach the recordable returned by
    Logger::CreateLogRecord.

exporters/otlp/test/otlp_log_recordable_test.cc augmented with four
cases that verify the proto dropped_attributes_count field and the
string / array truncation logic.

Verification

Built and tested in the otelcpp-dev container:

  • Release (gcc, abiv1): log_record_limits_test 9/9,
    otlp_log_recordable_test 23/23, logger_sdk_test 9/9,
    logger_provider_sdk_test 9/9, simple_log_record_processor_test
    10/10, batch_log_record_processor_test 13/13. No regression.
  • ABI v2 (clang): log_record_limits_test 9/9.
  • -fno-rtti (Bazel nortti CI mirror): log_record_limits_test 9/9,
    otlp_log_recordable_test 23/23.
  • clang-format fixed point verified for all touched files.
  • IWYU (abiv1-preview): include-list reconciled with diagnostics
    from a local clang-22 run.
  • git diff --check whitespace: clean.

Known limitations

  • ElasticSearchRecordable is not modified in this PR. It inherits
    the no-op default and does not enforce limits. A follow-up PR can
    add enforcement if desired.
  • Truncation is byte level. Strings whose configured limit falls inside
    a multibyte UTF-8 sequence will be truncated mid-sequence, matching
    the existing Span attribute behavior in OtlpRecordable.
  • The default branch in the YAML parser
    (ParseLogRecordLimitsConfiguration) keeps the existing
    attribute_value_length_limit = 4096 fallback for callers that
    opt into the limits: block without specifying a length. Aligning
    that fallback with the spec default (unlimited) is left to a follow-up.

@thc1006 thc1006 requested a review from a team as a code owner June 14, 2026 14:25
@thc1006 thc1006 force-pushed the feat/log-record-limits-4126 branch from 7bdc3c5 to 1769757 Compare June 14, 2026 14:26
Apply attribute count and value length limits described by the logs
SDK spec (https://opentelemetry.io/docs/specs/otel/logs/sdk/#logrecord-limits)
during attribute writes. Limits flow from LoggerProvider through
LoggerContext to each LogRecord created by Logger::CreateLogRecord.

* Add LogRecordLimits struct with spec defaults: attribute_count_limit
  = 128 and attribute_value_length_limit = SIZE_MAX (unlimited).
* Recordable gains a virtual SetLogRecordLimits with a no-op default
  so existing implementations need not change. ReadableLogRecord gains
  a virtual GetDroppedAttributesCount returning zero by default.
* ReadWriteLogRecord and OtlpLogRecordable enforce the limits in
  SetAttribute. An attribute beyond attribute_count_limit is dropped
  and counted as dropped; string and string-array values whose byte
  length exceeds attribute_value_length_limit are truncated.
  Truncation is byte-level, mirroring the existing Span attribute
  behavior. The OTLP path also populates dropped_attributes_count on
  the proto LogRecord.
* MultiRecordable propagates the limits to every wrapped recordable.
* LoggerContext owns a LogRecordLimits value; Logger calls
  SetLogRecordLimits on the recordable returned by MakeRecordable
  before any user attribute writes. A new
  LoggerProviderFactory::Create overload accepts LogRecordLimits.
* The declarative configuration path (SdkBuilder) wires
  LogRecordLimitsConfiguration to the runtime LogRecordLimits.

Tests cover ReadWriteLogRecord and OtlpLogRecordable: defaults, count
enforcement (including the "replace existing key while at limit must
not drop" case), length truncation of strings and string arrays, type
selectivity (only string and array-of-string are truncated), the
combined count plus length case, and a Logger-level wiring test that
verifies the limits configured on LoggerProvider reach the recordable
returned by Logger::CreateLogRecord.

Fixes open-telemetry#4126

Signed-off-by: thc1006 <84045975+thc1006@users.noreply.github.com>
@thc1006 thc1006 force-pushed the feat/log-record-limits-4126 branch from 1769757 to e244122 Compare June 14, 2026 14:29
@codecov

codecov Bot commented Jun 14, 2026

Copy link
Copy Markdown

Codecov Report

❌ Patch coverage is 93.28358% with 9 lines in your changes missing coverage. Please review.
✅ Project coverage is 82.93%. Comparing base (82a2ada) to head (980ef8e).

Files with missing lines Patch % Lines
...xporters/otlp/src/otlp_populate_attribute_utils.cc 80.77% 5 Missing ⚠️
...include/opentelemetry/sdk/common/attribute_utils.h 97.50% 1 Missing ⚠️
sdk/include/opentelemetry/sdk/logs/exporter.h 0.00% 1 Missing ⚠️
...clude/opentelemetry/sdk/logs/readable_log_record.h 0.00% 1 Missing ⚠️
sdk/include/opentelemetry/sdk/logs/recordable.h 0.00% 1 Missing ⚠️
Additional details and impacted files

Impacted file tree graph

@@            Coverage Diff             @@
##             main    #4157      +/-   ##
==========================================
- Coverage   82.98%   82.93%   -0.05%     
==========================================
  Files         406      416      +10     
  Lines       17264    17400     +136     
==========================================
+ Hits        14325    14429     +104     
- Misses       2939     2971      +32     
Files with missing lines Coverage Δ
...ntelemetry/exporters/ostream/log_record_exporter.h 100.00% <100.00%> (ø)
...try/exporters/otlp/otlp_file_log_record_exporter.h 100.00% <100.00%> (ø)
...try/exporters/otlp/otlp_grpc_log_record_exporter.h 100.00% <100.00%> (ø)
...try/exporters/otlp/otlp_http_log_record_exporter.h 100.00% <100.00%> (ø)
...opentelemetry/exporters/otlp/otlp_log_recordable.h 100.00% <ø> (ø)
...try/exporters/otlp/otlp_populate_attribute_utils.h 100.00% <100.00%> (ø)
exporters/otlp/src/otlp_log_recordable.cc 38.16% <100.00%> (+2.11%) ⬆️
...pentelemetry/sdk/logs/batch_log_record_processor.h 100.00% <100.00%> (ø)
...include/opentelemetry/sdk/logs/log_record_limits.h 100.00% <100.00%> (ø)
sdk/include/opentelemetry/sdk/logs/processor.h 100.00% <100.00%> (ø)
... and 12 more
🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

@proost

proost commented Jun 14, 2026

Copy link
Copy Markdown
Contributor

@thc1006
I'm working on the feature in here(#4132).
Do you want to implement this feature?

@thc1006

thc1006 commented Jun 15, 2026

Copy link
Copy Markdown
Contributor Author

@proost Hey, thanks for the ping. I went through #4132 and the review thread before pushing #4157. There's a fair bit of substantive work in yours that mine doesn't have: the OTEL_LOGRECORD_* env var integration, the benchmark file, ElasticSearchRecordable enforcement, and the yaml count_limit plumbing.

The reason I went ahead with #4157 even after seeing yours is lalitb's comment from yesterday (#4132 (comment)), which was asking to move enforcement out of the base Recordable hot path and let each recordable handle it the way that fits. That's the shape #4157 takes. Recordable just gains a SetLogRecordLimits virtual with a no-op default body, and the actual enforcement lives in ReadWriteLogRecord and OtlpLogRecordable. Non-enforcing exporters inherit the no-op so they pay nothing.

If the maintainers think #4157 is the better base, I'd like to layer the env vars, benchmark, and ES enforcement on top in follow-ups with attribution to your work. If they prefer the #4132 redesign route, happy to close #4157 and review the new version there.

@marcalff @lalitb would appreciate your call on which to drive forward.

@proost

proost commented Jun 15, 2026

Copy link
Copy Markdown
Contributor

@thc1006

Thanks for the explanation.

I'm always open to feedback, and if there were concerns about the direction of #4132, I would have been happy to discuss and adjust it. I do wish those concerns had been raised directly on the PR earlier, as it would have helped avoid duplicated work.

That said, I don't have a strong preference on whose implementation lands. If you would like to continue driving this work, I'm happy to step aside and close #4132 so we don't spend effort maintaining two competing PRs.

So would you like to continue driving this work?

@thc1006

thc1006 commented Jun 15, 2026

Copy link
Copy Markdown
Contributor Author

@proost You're right and that one's on me. The honest reason I missed #4132: I was searching open issues by help wanted label and recent merged PRs for landscape, not by fixes #4126 against open PRs. #4132 has no labels and the title in the open-PR list truncates around "attribute count l...", so I read it as a narrower length-only fix and moved on. That was a diligence gap on my part, not a deliberate decision to bypass your work. Sorry about that.

If you're comfortable stepping aside, yes I'd like to continue with #4157.

After #4157 lands I'd open these as follow-ups:

  1. OTEL_LOGRECORD_ATTRIBUTE_VALUE_LENGTH_LIMIT and OTEL_LOGRECORD_ATTRIBUTE_COUNT_LIMIT env-var integration through LogRecordLimits and the SdkBuilder path, modeled on your feat: log record attribute value length limit & attribute count limit #4132 work and adding you as a Co-authored-by on the commit.
  2. The benchmark file from feat: log record attribute value length limit & attribute count limit #4132, ported onto the per-recordable enforcement so the numbers reflect the new architecture, attribution in the commit body.
  3. ElasticSearchRecordable enforcement following the same SetLogRecordLimits pattern.
  4. YAML attribute_count_limit parsing along the lines feat: log record attribute value length limit & attribute count limit #4132 did.

If you'd rather author any of those follow-ups yourself and have me review, that works equally well. Whatever keeps the work landing.

@proost

proost commented Jun 15, 2026

Copy link
Copy Markdown
Contributor

Thanks for the clarification and the apology.

I’m happy for you to continue driving this work, so I’ll close #4132 to avoid duplicate effort.

@thc1006

thc1006 commented Jun 15, 2026

Copy link
Copy Markdown
Contributor Author

@proost Thanks for the clean handoff. The reviewer feedback from owent, lalitb, and ThomsonTan on #4132 still applies here, and I'll reference the specific discussions when each follow-up goes up. I'll ping you on each so you can shape the attribution however works for you.

const opentelemetry::sdk::resource::Resource *resource_ = nullptr;
const opentelemetry::sdk::instrumentationscope::InstrumentationScope *instrumentation_scope_ =
nullptr;
const opentelemetry::sdk::logs::LogRecordLimits *limits_ = nullptr;

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think we should avoid storing a pointer to the limits object here. In the normal logger path, CreateLogRecord() sets this from LoggerContext, but the returned LogRecord is an independent unique_ptr. User code can keep that record and call SetAttribute() after the logger/provider/context is gone, which would leave limits_ dangling.

Can we instead copy LogRecordLimits into the recordable instead of storing &limits? The struct is tiny, and it avoids adding a lifetime requirement to LogRecord.

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Agreed. Switched to by-value storage in ba8f9047. The default-constructed value carries the spec defaults (count=128, length=unlimited), so a fresh recordable now enforces the spec count cap from construction — happy to revisit that semantic if you'd prefer a no-op-when-unset sentinel instead.

Comment thread sdk/include/opentelemetry/sdk/logs/read_write_log_record.h Outdated
Comment thread exporters/otlp/src/otlp_log_recordable.cc Outdated
thc1006 added a commit to thc1006/opentelemetry-cpp that referenced this pull request Jun 18, 2026
…runcate)

Address three review comments from @lalitb on PR open-telemetry#4157:

1. Store LogRecordLimits by value inside ReadWriteLogRecord and
   OtlpLogRecordable instead of a raw pointer to a LoggerContext-owned
   object. A LogRecord is handed out as a unique_ptr, so a record
   outliving the context that produced it would otherwise dereference a
   dangling pointer when the user calls SetAttribute() later. The
   default-constructed value already carries the spec defaults
   (count=128, length=unlimited), so a fresh recordable enforces the
   spec count cap from construction, matching the PR's "enforcement"
   contract.

2. Drop the now-redundant `limits_ != nullptr` short-circuit at every
   enforcement site (4 in total). This also closes the Codecov-reported
   uncovered branch in otlp_log_recordable.cc.

3. Truncate UTF-8 string attributes at a code-point boundary instead of
   a raw byte boundary, so an OTLP protobuf string_value produced by
   truncation stays valid UTF-8 when the input was. Malformed UTF-8 and
   trailing lead bytes degrade to plain byte truncation. Logic adapted
   from open-telemetry#4132 with attribution.

While in the same truncation paths, also apply the byte-length cap to
raw bytes attributes (`vector<uint8_t>` on the SDK side, AnyValue
`bytes_value` on the OTLP side). Both were previously passing through
any size, even though the spec applies `attribute_value_length_limit`
to bytes attributes as well.

Test changes:
- Rename DefaultsPassThroughWithoutLimitsObject to
  DefaultRecordEnforcesSpecCountCap (200 attrs in, 128 stored, 72
  dropped) to reflect the new spec-correct default behavior.
- Add 3 UTF-8 regression tests on the SDK side (split prevention,
  exact fit at sequence boundary, malformed input falls back) plus a
  bytes-truncation test.
- Add 3 mirror tests on the OTLP side, a default-cap test, and a
  bytes-truncation test.

Refs: open-telemetry#4126

Co-authored-by: Hyeonho Kim <proost@apache.org>
Signed-off-by: thc1006 <84045975+thc1006@users.noreply.github.com>
Comment thread sdk/src/logs/read_write_log_record.cc Outdated
…are truncate)

Address three review comments from @lalitb on PR open-telemetry#4157, plus a
follow-up from @owent (r3434178971).

1. Store LogRecordLimits by value inside ReadWriteLogRecord and
   OtlpLogRecordable instead of a raw pointer to a LoggerContext-owned
   object. A LogRecord is handed out as a unique_ptr, so a record
   outliving the context that produced it would otherwise dereference a
   dangling pointer when the user calls SetAttribute() later. The
   default-constructed value already carries the spec defaults
   (count=128, length=unlimited), so a fresh recordable enforces the
   spec count cap from construction, matching the PR's "enforcement"
   contract.

2. Drop the now-redundant `limits_ != nullptr` short-circuit at every
   enforcement site (4 in total). This also closes the Codecov-reported
   uncovered branch in otlp_log_recordable.cc.

3. Truncate OTLP string attributes at a UTF-8 code-point boundary
   instead of a raw byte boundary, so the protobuf string_value produced
   by truncation stays valid UTF-8 when the input was. Malformed UTF-8
   and trailing lead bytes degrade to plain byte truncation. Logic
   adapted from open-telemetry#4132 with attribution.

   The SDK-side ReadWriteLogRecord truncation stays as plain byte cut.
   The in-memory `OwnedAttributeValue::std::string` variant may
   legitimately carry raw bytes when constructed from a non-UTF-8
   source, so forcing UTF-8 boundary semantics there would over-truncate
   that case (per @owent's r3434178971, echoing the same point on open-telemetry#4132
   r3409677314). Each recordable's truncation strategy now matches its
   own consumer's wire-format requirement: SDK in-memory has no wire
   requirement, OTLP protobuf requires valid UTF-8.

While in the same truncation paths, also apply the byte-length cap to
raw bytes attributes (`vector<uint8_t>` on the SDK side, AnyValue
`bytes_value` on the OTLP side). Both were previously passing through
any size, even though the spec applies `attribute_value_length_limit`
to bytes attributes as well.

Test changes:
- Rename DefaultsPassThroughWithoutLimitsObject to
  DefaultRecordEnforcesSpecCountCap (200 attrs in, 128 stored, 72
  dropped) to reflect the new spec-correct default behavior.
- Add bytes-truncation tests on both SDK and OTLP sides.
- Add 3 UTF-8 regression tests on the OTLP side only (split prevention,
  exact fit at sequence boundary; malformed-fallback omitted since the
  algorithm's seq=1 fallback for invalid continuations is implementation
  detail rather than wire contract).
- Add a default-cap test on the OTLP side.

Refs: open-telemetry#4126

Co-authored-by: Hyeonho Kim <proost@apache.org>
Signed-off-by: thc1006 <84045975+thc1006@users.noreply.github.com>
@thc1006 thc1006 force-pushed the feat/log-record-limits-4126 branch from ba8f904 to 1612edc Compare June 18, 2026 09:08

@dbarker dbarker left a comment

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the PR. Please see comments below.

Comment thread exporters/otlp/src/otlp_log_recordable.cc Outdated
Comment thread sdk/src/logs/read_write_log_record.cc Outdated
Comment thread sdk/src/logs/read_write_log_record.cc Outdated
…dupe SetAttribute lookup)

Address @dbarker's three review comments on PR open-telemetry#4157.

* r3442854800: Extract Utf8SafePrefixLength and TruncateProtoAttributeValue
  from the anonymous namespace in otlp_log_recordable.cc into
  OtlpPopulateAttributeUtils as static methods, with new direct unit tests
  in otlp_populate_attribute_utils_test.cc. The upcoming SpanLimits PR
  will reuse these from the OTLP trace recordable. The OTLP helper that
  was previously TruncateProtoStringValue is renamed
  TruncateProtoAttributeValue to reflect that it covers string_value,
  bytes_value, and array_value branches.

* r3443005793: Extract the SDK byte-length truncation helper into
  sdk::common::TruncateAttributeValueByteLength (inline, declared in the
  existing sdk/common/attribute_utils.h), with new direct unit tests in
  attribute_utils_test.cc. The upcoming SpanLimits PR will reuse this
  from ReadWriteSpanData. The new name reflects that the helper covers
  string, string-array, AND bytes variants rather than only strings.

* r3443034336: Rewrite ReadWriteLogRecord::SetAttribute to use a single
  unordered_map lookup. The previous code did .find() to gate the count
  cap, then operator[] to fetch-or-insert; the new code does .find()
  followed by conditional .emplace(), so existing-key replacement and
  new-key insertion each cost one hash lookup.

Refs: open-telemetry#4126
Signed-off-by: thc1006 <84045975+thc1006@users.noreply.github.com>
Comment thread sdk/include/opentelemetry/sdk/common/attribute_utils.h Outdated
Two small fixes on top of c4532cd:

* Address @dbarker's r3444034234: rewrite
  sdk::common::TruncateAttributeValueByteLength to dispatch on the
  variant via nostd::get_if (returns nullptr if the alternative does
  not hold) instead of nostd::holds_alternative + nostd::get. The
  helper is declared noexcept; nostd::get throws when the alternative
  does not match, which would invoke std::terminate even though the
  preceding holds_alternative check makes that path unreachable in
  practice. The get_if rewrite removes the throwing call entirely so
  the noexcept contract is statically honored.

* Restore the Bazel link of the new otlp_populate_attribute_utils_test
  target by adding //sdk/src/metrics to its deps, matching the
  existing otlp_log_recordable_test target. The new test target links
  against :otlp_recordable, which transitively references
  sdk::metrics::AdaptingCircularBufferCounter symbols; Bazel's strict
  layering requires the dep to be declared at the cc_test level. CMake
  did not catch this because its default link aggregates the whole
  library.

Refs: open-telemetry#4126
Signed-off-by: thc1006 <84045975+thc1006@users.noreply.github.com>
@thc1006 thc1006 requested review from dbarker and lalitb June 19, 2026 19:41
dbarker and others added 2 commits June 20, 2026 10:55
… + clang-tidy NOLINT

Three CI failure classes surfaced after the branch-update merge of main
into the PR branch (ccdd5e9). Address each:

* MSVC C2220 (warning-as-error) on attribute_utils.h:93 — the
  `for (auto &s : *vec)` loop inside TruncateAttributeValueByteLength
  shadowed the `if (auto *s = get_if<std::string>(...))` on line 84.
  GCC/Clang accept if-init scoping for the two `s`, MSVC /W4 /WX
  treats C4456 (declaration hides previous local) as an error. Rename
  the loop variable to `element` to remove the shadow.

* IWYU drop four includes that the v3+v4 utility-extraction refactor
  made redundant:
  - sdk/src/logs/read_write_log_record.cc: <vector>
  - exporters/otlp/src/otlp_log_recordable.cc: <string>
  - exporters/otlp/test/otlp_populate_attribute_utils_test.cc:
    <cstddef> and <limits>

* clang-tidy abiv2-preview misc-no-recursion on
  OtlpPopulateAttributeUtils::TruncateProtoAttributeValue — the recursive
  descent into AnyValue::kArrayValue children is intentional and bounded
  by the SDK-side AttributeValue variant depth. Suppress with a
  NOLINTBEGIN/END(misc-no-recursion) block, matching the codebase
  precedent in sdk/src/configuration/configuration_parser.cc.

The remaining clang-tidy bugprone-exception-escape warnings in the same
config (read_write_log_record.cc:68 SetBody, :158 SetAttribute and
otlp_populate_attribute_utils.cc:62/220 PopulateAnyValue) trace through
nostd::visit / nostd::get into bad_alloc paths that pre-date this PR;
they are out of scope for the LogRecord limits change.

Refs: open-telemetry#4126
Signed-off-by: thc1006 <84045975+thc1006@users.noreply.github.com>
@thc1006

thc1006 commented Jun 21, 2026

Copy link
Copy Markdown
Contributor Author

@dbarker the only failing CI on 8c965e7e is DocFX check and it's a transient infrastructure failure unrelated to this PR — the GitHub Releases download for docfx.zip returned HTTP 500:

Attempt to get headers for https://github.com/dotnet/docfx/releases/download/v2.58.5/docfx.zip failed.
The remote server returned an error: (500) Internal Server Error.
Chocolatey installed 0/1 packages. 1 packages failed.
docfx (exited 404)

The job failed in 40 seconds during the docfx download step before any of the PR's content was even read. Would you mind re-running just this job (I don't have admin rights to do it myself)? All other 31 jobs on this head are SUCCESS / 2 SKIPPED, no other failures.

@dbarker dbarker left a comment

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the cleanup. Please see comments below. The main question is: Can the truncation be applied on creation of the owned type instead of after in order to prevent allocating the overflowing value?

Comment thread sdk/src/logs/read_write_log_record.cc Outdated
Comment thread sdk/src/logs/read_write_log_record.cc Outdated
Comment thread exporters/otlp/src/otlp_populate_attribute_utils.cc Outdated
…nversion)

Address @dbarker's three review comments from 2026-06-23 (r3461438236,
r3461500683, r3461573442) by moving attribute-value-length truncation
from a separate post-conversion step into the conversion paths
themselves.

Architecture change:

  Before: convert, default-construct map slot, assign converted,
          post-truncate (helper)

  After:  convert-with-limit, single emplace

sdk/include/opentelemetry/sdk/common/attribute_utils.h
* AttributeConverter gains an `explicit AttributeConverter(std::size_t
  max_length)` constructor. String, bytes, and string-array overloads
  apply the byte-length cap during the OwnedAttributeValue
  construction. The default-constructed converter is unchanged
  (`max_length_ = numeric_limits<size_t>::max()`), so the existing
  callers in `instrumentation_scope.h` and `read_write_log_record.cc
  SetBody` keep their current no-truncation behavior.
* Delete `TruncateAttributeValueByteLength` (its three branches now
  live in the converter; the lone production caller was the SDK
  ReadWriteLogRecord SetAttribute).

sdk/src/logs/read_write_log_record.cc
* `SetAttribute` rewritten to a single `find` + conditional `emplace`,
  with the limit-aware converter producing the truncated value in one
  step. Removes the previous default-construct-then-assign sequence
  and the now-redundant post-truncate branch.

exporters/otlp/include/.../otlp_populate_attribute_utils.h
exporters/otlp/src/otlp_populate_attribute_utils.cc
* All four `PopulateAttribute` / `PopulateAnyValue` overloads gain a
  trailing `std::size_t max_length = numeric_limits<size_t>::max()`
  parameter (default preserves the existing call-site behavior for the
  other callers in metrics/recordable/resource paths).
* String and bytes branches apply truncation in place during the proto
  set: `Utf8SafePrefixLength` for `string_value` (preserves protobuf
  wire-format valid UTF-8), plain byte cut for `bytes_value`.
* Delete `TruncateProtoAttributeValue`. The flat-AttributeValue input
  means the recursive array descent it carried was dead code, as
  dbarker noted. The `NOLINT(misc-no-recursion)` block goes with it.
* `Utf8SafePrefixLength` primary overload now takes `(const char*,
  std::size_t, std::size_t)` so it can be called without constructing
  a temporary `std::string` from a `string_view`. A thin inline
  `(const std::string&, std::size_t)` overload preserves backward
  compatibility for any existing direct user.

exporters/otlp/src/otlp_log_recordable.cc
* `SetAttribute` now calls the limit-aware `PopulateAttribute` directly
  in one step, removing the separate post-truncate branch.

Test changes:
* Delete the five `TruncateAttributeValueByteLength` unit tests in
  `sdk/test/common/attribute_utils_test.cc` (helper is gone; end-to-end
  coverage is preserved by `log_record_limits_test.cc`).
* Delete the six `TruncateProtoAttributeValue` unit tests in
  `exporters/otlp/test/otlp_populate_attribute_utils_test.cc` (helper
  is gone; end-to-end coverage is preserved by
  `otlp_log_recordable_test.cc`).
* The seven `Utf8SafePrefixLength` unit tests still cover the helper
  that the new in-place truncation path calls into.

Refs: open-telemetry#4126
Signed-off-by: thc1006 <84045975+thc1006@users.noreply.github.com>
@thc1006 thc1006 force-pushed the feat/log-record-limits-4126 branch from ba057d4 to cd413f8 Compare June 23, 2026 21:12
@thc1006 thc1006 requested a review from dbarker June 24, 2026 06:45
Comment thread sdk/src/logs/logger.cc Outdated

auto recordable = context_->GetProcessor().MakeRecordable();

recordable->SetLogRecordLimits(context_->GetLogRecordLimits());

@lalitb lalitb Jun 24, 2026

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can we avoid adding this unconditional virtual call on every CreateLogRecord()? This is now paid by all SDK log paths, including custom recordables that do not enforce limits.

One cleaner option would be passing LogRecordLimits through MakeRecordable(...), but that looks API-breaking for existing exporters. A safer incremental option may be to compute/store a config flag once and only call SetLogRecordLimits() for exporters/recordables that actually enforce limits, like OTLP and debug/ostream.

That would keep the extra limit work out of unrelated SDK hot paths.

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for catching this. v7 moved truncation into the recordable (stateful AttributeConverter), so the SetLogRecordLimits setter is what carries that state, and you are right that paying for the call on the common path is not justified when the recordable ignores it.

The "compute/store a config flag once" path lines up well: a RecordableEnforcesLogRecordLimits() virtual on LogRecordProcessor (and LogRecordExporter, defaulting false), cached by Logger at construction, and used to gate the existing call:

if (recordable_enforces_limits_) {
  recordable->SetLogRecordLimits(context_->GetLogRecordLimits());
}

OTLP and ostream exporters override to return true. MultiLogRecordProcessor returns true if any child does. Custom processors keep the default and pay only a bool load plus a predicted branch. The new virtuals append to the existing vtable, so subclass layout is unchanged.

If that lines up with what you had in mind, I will push v8 with this gate.

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, that direction makes sense to me. One small refinement: since LogRecordLimits and the processor/exporter chain are provider/context-level configuration, can we cache this capability on LoggerContext rather than on each Logger?

That would still avoid the unconditional SetLogRecordLimits() virtual call in CreateLogRecord(), but it also avoids adding per-logger state for a provider-level setting. The hot path would just read the context-level flag and call SetLogRecordLimits() only when some processor/exporter actually enforces limits.

For example:

  LoggerContext {
    LogRecordLimits limits_;
    bool recordable_enforces_limits_;
  }

Then each logger can gate the setup with:

  if (context_->RecordableEnforcesLogRecordLimits())
  {
    recordable->SetLogRecordLimits(context_->GetLogRecordLimits());
  }

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Agreed, that's a cleaner placement. LoggerContext is where the processor chain and LogRecordLimits already live, so the capability flag belongs there. I'll add the cached recordable_enforces_limits_ member and the RecordableEnforcesLogRecordLimits() getter on LoggerContext, populated at construction from the processor chain. Both gating call sites in Logger::CreateLogRecord() will route through the context getter as you showed.

std::string s(v);
if (s.size() > max_length_)
{
s.resize(max_length_);

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I suggest keeping UTF-8-aware truncation here. The reasoning:

  1. For UTF-8 strings: If a user stores a UTF-8 string in std::string, they would expect the truncated result to remain valid UTF-8. A naive byte-length truncation risks splitting a multi-byte code point, which could cause the OTLP exporter to fall back to bytes_value instead of the intended string_value.
  2. For raw bytes: If a user stores raw binary data in std::string, they would expect simple byte-length truncation without any encoding semantics.

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks, agreed. The v7 docstring defends byte cut on the grounds that std::string may carry raw bytes from a non-UTF-8 source, but the OTel convention that std::string means UTF-8 (with nostd::span<const uint8_t> for raw bytes) is the stronger signal here.

The v8 plan:

  1. Move Utf8SafePrefixLength into sdk/include/opentelemetry/sdk/common/attribute_utils.h alongside AttributeConverter (inline, no new file; the OTLP-side wrapper forwards into it).
  2. Route the AttributeConverter std::string / nostd::string_view / const char* overloads through it, plus the per-element cut in the nostd::span<const nostd::string_view> array overload; nostd::span<const uint8_t> keeps byte cut as you describe.
  3. The existing 8 OTLP-side Utf8SafePrefixLength tests stay; sdk/test/common/attribute_utils_test.cc gets equivalent SDK-side coverage.
  4. Docstring updates to match.

This composes with the gating ask in r3468037047; both land in the same v8 push as separate commits.

Comment thread sdk/include/opentelemetry/sdk/common/attribute_utils.h Outdated
Comment thread exporters/otlp/src/otlp_populate_attribute_utils.cc Outdated
Comment thread sdk/include/opentelemetry/sdk/common/attribute_utils.h Outdated
@ThomsonTan

Copy link
Copy Markdown
Contributor

The introduced limit changes the current default behavior, a default-constructed ReadWriteLogRecord / OtlpLogRecordable now enforces the 128 attribute cap even when no limits were configured and the logger wiring path was never used. Suggest iither make "no explicit SetLogRecordLimits = unlimited" the recordable default and let the wiring layer inject 128, or keep the spec default but call this out in CHANGELOG as a breaking change.

ThomsonTan and others added 3 commits June 25, 2026 10:06
CreateLogRecord() called SetLogRecordLimits() on every record
unconditionally, paying a virtual dispatch even for recordables that do
not enforce limits. Add a RecordableEnforcesLogRecordLimits() capability
query on LogRecordExporter and LogRecordProcessor (default false; OTLP
and ostream exporters return true; Simple and Batch processors delegate
to their exporter; Multi returns true if any child does). LoggerContext
caches the result at construction and refreshes it in AddProcessor, so
the Logger hot path reads a plain bool and only calls SetLogRecordLimits()
when a processor actually enforces limits.

The new virtuals are appended at the end of their vtables to keep the
change additive.

Signed-off-by: thc1006 <84045975+thc1006@users.noreply.github.com>
The in-memory AttributeConverter cut string values at a raw byte
boundary, which can split a multi-byte UTF-8 sequence and leave an
invalid std::string. Move Utf8SafePrefixLength from the OTLP recordable
into sdk::common so the converter can share it, and truncate the
std::string, string_view, const char*, and string-array alternatives at a
UTF-8 code-point boundary. Raw bytes (span<const uint8_t>) keep the exact
byte cut since they carry no encoding. The OTLP helper now forwards to the
shared implementation, leaving its public surface and tests unchanged.

Also fold in review nits: delegate the const char* overload to the
string_view overload to avoid copying an oversized C string before
truncating, drop a redundant pointer cast in the bytes overload, and use
std::strlen.

Signed-off-by: thc1006 <84045975+thc1006@users.noreply.github.com>
@thc1006

thc1006 commented Jun 25, 2026

Copy link
Copy Markdown
Contributor Author

BTW the v+num is Number of fixed commit (revisions)

A recent merge of main lowered the abiv1-preview and abiv2-preview
clang-tidy ceilings to 331 and 341. Two warnings need addressing to stay
within them:

- Initialize LoggerContext::recordable_enforces_limits_ in the member
  initializer list instead of the constructor body
  (cppcoreguidelines-prefer-member-initializer).
- Move the rvalue reference parameter in the test TrackingProcessor
  OnEmit stub (cppcoreguidelines-rvalue-reference-param-not-moved).

Signed-off-by: thc1006 <84045975+thc1006@users.noreply.github.com>
@thc1006

thc1006 commented Jun 26, 2026

Copy link
Copy Markdown
Contributor Author

Good catch, you're right. A default-constructed ReadWriteLogRecord / OtlpLogRecordable starts with limits_{} = {128, unlimited}, so it caps at 128 even when no provider wired it up. That goes against the intent that these limits are provider-level.

I'll go with option 1: the recordable defaults to no limits, and the wiring injects the configured ones. This lines up with the RecordableEnforcesLogRecordLimits gating in the latest revision: LoggerContext keeps the spec default ({128, unlimited}) and the Logger injects it through SetLogRecordLimits only when a processor enforces. The result:

  • a bare recordable, or one behind a non-enforcing processor: no limits, matching the pre-PR behavior.
  • a recordable created through the standard provider path: the spec default of 128 (or whatever the provider was configured with) applies.

Concretely I'll default limits_ to unbounded in both recordables, keep the LoggerContext default at the spec values, and update the DefaultRecordEnforcesSpecCountCap test to match. The CHANGELOG will describe the provider-path enforcement as the new behavior.

Happy to do option 2 instead (keep the spec default on the recordable and just document it) if you'd rather, but option 1 looks closer to the provider-level intent.

thc1006 and others added 2 commits June 26, 2026 12:29
…rovider

A default-constructed ReadWriteLogRecord / OtlpLogRecordable previously
carried the spec-default LogRecordLimits (count 128), so a recordable used
without any LoggerProvider wiring enforced the 128 cap. That changed
behavior for code that constructs a recordable directly. Default the
recordable to no limits and let the LoggerProvider wiring inject the
configured limits through SetLogRecordLimits, which it already does only
when a processor enforces them. LoggerContext keeps the spec default, so a
record created through the standard provider path still gets the 128 cap
(or the configured value), while a bare recordable no longer caps on its
own.

Signed-off-by: thc1006 <84045975+thc1006@users.noreply.github.com>

@dbarker dbarker left a comment

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the revision. Please see feedback below. Given the changes please review the code comments and update if needed.

ASSERT_EQ(nostd::get<int64_t>(record.GetAttributes().at("k1")), 99);
}

TEST(LogRecordLimits, LengthLimitTruncatesString)

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please also test truncation for the const char* and span paths.

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Added a const char* truncation test (LengthLimitTruncatesConstCharPtr). The span paths are already covered: LengthLimitTruncatesEachStringInArray for the string array and LengthLimitTruncatesBytesAttribute for the bytes span.

* Apply attribute count and value length limits to this log. The default
* implementation is a no-op; concrete recordables that wish to enforce
* limits override this and store the supplied limits before any
* SetAttribute call is observed. The referenced object must outlive the

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

please update the comment now that recordable should copy the limits

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Updated: the comment now says implementations copy the supplied limits, so the caller does not need to keep the object alive.


/**
* Apply attribute count and value length limits. Must be called before any
* SetAttribute call to take effect. The referenced limits object must

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

please update the comment to account for limits being copied to recordables.

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Updated to say the limits are copied into the recordable.

* recordables that ignore limits. Exporters whose recordable applies the
* limits (OTLP, ostream) override this to return true.
*
* This virtual is appended at the end of the LogRecordExporter vtable to keep

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this second paragraph in the comment can be removed as an implementation detail.

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Removed. I also removed the same paragraph from the LogRecordProcessor version in processor.h for consistency.

{
const char *str_value = nostd::get<const char *>(value);
const char *str_value = nostd::get<const char *>(value);
const std::size_t str_len = std::strlen(str_value);

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

are there worthwhile short cuts that can be taken if the max_length limit is not set (or is the default max size_t) ?

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good point. Added an early return in Utf8SafePrefixLength: when max_bytes >= size (which includes the default unbounded SIZE_MAX), the whole value fits so there is nothing to truncate and the per-byte scan is skipped. It lives in the shared helper, so both this OTLP path and the in-memory AttributeConverter benefit.

…shortcut, test)

- Correct the SetLogRecordLimits comments on the base Recordable,
  ReadWriteLogRecord, and OtlpLogRecordable: the limits are copied, so the
  caller does not need to keep the supplied object alive (the previous
  "must outlive" wording was stale).
- Drop the vtable-append implementation-detail paragraph from the
  RecordableEnforcesLogRecordLimits comments on LogRecordExporter and
  LogRecordProcessor.
- Short-circuit Utf8SafePrefixLength when the whole value fits within the
  budget (max_bytes >= size, including the unbounded default), so the
  common path skips the per-byte scan.
- Add a const char* attribute value truncation test.

Signed-off-by: thc1006 <84045975+thc1006@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Implement LogRecord limits

6 participants