Consistently use int64 type for string refs in pprofextended. #560

aalexand · 2024-05-17T23:45:13Z

Fixing two fields that used a different type, I don't think there is a good reason to be inconsistent.

linux-foundation-easycla · 2024-05-17T23:45:16Z

The committers listed above are authorized under a signed CLA.

✅ login: aalexand / name: Alexey Alexandrov (5b55fb7)

felixge

LGTM but @petethepig should confirm.

tigrannajaryan

Blocking temporarily until #559 is resolved.

javierhonduco · 2024-07-26T10:37:42Z

opentelemetry/proto/profiles/v1experimental/pprofextended.proto

@@ -235,7 +235,7 @@ message Sample {
  // Supersedes location_index.
  uint64 locations_length = 8;
  // A 128bit id that uniquely identifies this stacktrace, globally. Index into string table. [optional]
-  uint32 stacktrace_id_index = 9;
+  int64 stacktrace_id_index = 9;


Semi off-topic, but was wondering if it would make sense to change this field. I feel that the space savings of referencing the string table might not be that big and it's not clear what encoding this would have in the string table (base64? ascii? would the string have to be unicode? etc):

two uint64, storing the high and low bits for the stack ID;

a bytes field;

if we want to keep the indexed approach, maybe we could move this to a new repeated field in the Profile message;

curious on your thoughts here! cc @florianl

Using bytes or string would mean another indirection / dynamic allocation for the in-memory representation so I would be careful with that.

That's a good point. I personally prefer having this as two uint64s but not sure if everyone would agree with the slightly increase in memory. I am not a protobuf expert, I am assuming that adding the extra uint64 field would increase the size of the message by 10 Bytes if I am understanding the docs right

The concept of a stacktrace_id is new and google/pprof doesn't know this element. It originates from the original optimyze stateful protocol and helped two communicate how often a trace was seen, while not sending the full stack trace every time. Back then, we went with 128bit as we wanted the stacktrace IDs to be globally unique and reduce collisions. I'm not sure how this field is used by other implementations of the protocol. So having stacktrace_id_index as a index into the string table is the most flexible way for the moment, I think.

It is just the type of the filed, that needs better alignment. For every index int64 is used (also in google/pprof) and so the current type of uint32 should change and align.

Makes sense, in that case, should we update the comment and leave the number of bits for the stack id unspecified?

Not sure how relevant this PR is anymore, after the discussion around the google/pprof donation in the last meeting. There are some cases where the profiling protocol can and should evolve and maybe this is one of them.

florianl · 2024-07-29T07:01:04Z

opentelemetry/proto/profiles/v1experimental/pprofextended.proto

@@ -355,7 +355,7 @@ message Location {
  bool is_folded = 5;

  // Type of frame (e.g. kernel, native, python, hotspot, php). Index into string table.
-  uint32 type_index = 6;
+  int64 type_index = 6;


Duplicate of #557

As #557 got merged, this should no longer be relevant.

Unblocking, see #559 (comment)

tigrannajaryan · 2024-08-13T15:35:33Z

I removed my block, the PR can progress.

florianl · 2024-09-04T14:43:38Z

opentelemetry/proto/profiles/v1experimental/pprofextended.proto

@@ -235,7 +235,7 @@ message Sample {
  // Supersedes location_index.
  uint64 locations_length = 8;
  // A 128bit id that uniquely identifies this stacktrace, globally. Index into string table. [optional]
-  uint32 stacktrace_id_index = 9;
+  int64 stacktrace_id_index = 9;


This field got removed with #575. Maybe we can close the PR now?

Consistently use int64 type for string refs in pprofextended.

5b55fb7

Fixing two fields that used a different type, I don't think there is a good reason to be inconsistent.

aalexand requested review from a team May 17, 2024 23:45

github-actions bot assigned jack-berg May 17, 2024

felixge approved these changes May 18, 2024

View reviewed changes

tigrannajaryan added the spec:profiles label May 21, 2024

tigrannajaryan previously requested changes May 28, 2024

View reviewed changes

javierhonduco reviewed Jul 26, 2024

View reviewed changes

florianl reviewed Jul 29, 2024

View reviewed changes

florianl reviewed Sep 4, 2024

View reviewed changes

aalexand closed this Sep 4, 2024

aalexand deleted the fix-string-refs branch September 4, 2024 22:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consistently use int64 type for string refs in pprofextended. #560

Consistently use int64 type for string refs in pprofextended. #560

aalexand commented May 17, 2024

linux-foundation-easycla bot commented May 17, 2024 •

edited

Loading

felixge left a comment

tigrannajaryan left a comment

javierhonduco Jul 26, 2024

aalexand Jul 26, 2024

javierhonduco Jul 26, 2024

florianl Jul 29, 2024

javierhonduco Jul 29, 2024

florianl Jul 29, 2024

florianl Jul 29, 2024

florianl Aug 14, 2024

tigrannajaryan commented Aug 13, 2024

florianl Sep 4, 2024

Consistently use int64 type for string refs in pprofextended. #560

Consistently use int64 type for string refs in pprofextended. #560

Conversation

aalexand commented May 17, 2024

linux-foundation-easycla bot commented May 17, 2024 • edited Loading

felixge left a comment

Choose a reason for hiding this comment

tigrannajaryan left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tigrannajaryan commented Aug 13, 2024

Choose a reason for hiding this comment

linux-foundation-easycla bot commented May 17, 2024 •

edited

Loading