Skip to content

Navigation Menu

Appearance settings

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

Appearance settings

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

protobuf-net / protobuf-net Public

Notifications You must be signed in to change notification settings
Fork 1.1k
Star 4.8k

Code
Issues 491
Pull requests 32
Actions
Projects
Wiki
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Wiki
Security
Insights

hook a visitor into protogen to implement schema-aware payload parsing #919

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Draft

mgravell wants to merge 29 commits into main

base: main

Choose a base branch

Loading

Loading

from protogen-decode

Draft

hook a visitor into protogen to implement schema-aware payload parsing #919

mgravell wants to merge 29 commits into main from protogen-decode

Conversation 8 Commits 29 Checks 0 Files changed

Uh oh!

There was an error while loading. Please reload this page.

Conversation

Copy link

Member

mgravell commented Jun 9, 2022 •

edited

Loading

Scenarios:

enable usage similar to protoc --decode to walk a payload+schema and understand the contents
enable weak object-model usage, i.e. payload+schema to dynamic or similar, ideally in a way that is usable from other common serializers, for example as a transcoder
act as a backbone for any other schema+payload scenarios without the implementor needing to do all the thinking

Currently this is implemented via:

DecodeVisitor - has the logic to iterate over a payload while tracking a schema
TextDecodeVisitor : DecodeVisitor - provides a protoc --decode style raw text dump of the interpreted payload contents
ObjectDecodeVisitor : DecodeVisitor - provides an ExpandoObject-based object-model from the interpreted payload

Work items:

basic command-line hooks into protogen (global tool)
implement base visitor that processes fields via a parser
implement text visitor (i.e. stdout) for output
implement object-model visitor
documentation (at a minimum, for -h and ///; a new page under /docs would be good too, though)
basic unit test
comprehensive unit tests
validation of command-line usage (rather than library usage)
comparison to protoc output
well known types (.google.protobuf.Duration, .google.protobuf.Timestamp, for example)
types with special expected formatting (.google.protobuf.Timestamp, for example) (note: this might not apply if we get the well-known-types support to map to regular .NET types that already have the correct handling)
- in particular, struct.proto and wrappers.proto
how should enums be represented in ObjectDecodeVisitor? currently uses int, but: is that right? should it be an option on the object, i.e. new ObjectDecodeVisitor { Enums = EnumMode.Name } ? (which would presumably use the name when possible, and the integer as fallback) - Google lib writes names, reads either names or integers
support for packed primitive values (note this is based on reader.WireType being String for a primitive value (i.e. not string, bytes, group or message) and not on anything related to field)
consideration of extension fields; i.e. should we try to discover known extended field definitions, and present them sensibly?
what output (if any) for unknown fields?
need to detect and implement map scenarios
ObjectDecodeVisitor: default values and presence tracking; i.e. in proto3 a non-presence-tracked field with value zero/false/etc is not transmitted, but that is still the expected value (if it is presence-tracked, then it is transmitted if assigned)
ObjectDecodeVisitor: option to use json_name rather than name (when json_name specified)
naming; are the names correct? ObjectDecodeVisitor seems... weird

Sorry, something went wrong.

Uh oh!

There was an error while loading. Please reload this page.

All reactions

mgravell added 11 commits

June 9, 2022 09:52


          hook a visitor into protogen to implement schema-aware payload parsing

bffb223


          add basic unit test

63b4ca9


          support enums

e36d2c2


          better handling of "repeated"

afc1627


          support messages+repeated

52e13d7


          split TextDecodeVisitor to a separate file

a5c83a9


          whitespace

666d041


          groups/enums

e5c29eb


          use format-provider when noting unknown fiels

325026d


          split code-gen vs generate

92009ea


          object conversion visitor/test

a46f0ae

mgravell added the enhancement label

SamiSadfa reviewed

View reviewed changes

src/protogen/Program.cs Outdated Show resolved Hide resolved

Uh oh!

There was an error while loading. Please reload this page.

Copy link

SamiSadfa commented Jun 15, 2022

Can't understand quite clearly what does "validation of command-line usage (rather than library usage)" means. Is it for input validation when a user tries to do protogen.exe --decode command? Why rather than library usage?

All reactions

Sorry, something went wrong.

Uh oh!

There was an error while loading. Please reload this page.

Copy link

Member Author

mgravell commented Jun 15, 2022

@SamiSadfa I simply mean: I haven't tested that the command-line scenario works at all, let alone that it does the correct thing. The main logic is in the library and is validated via unit tests, but the console exe needs validation too. In particular, that protogen --decode {something} gives similar behaviour and output to protoc --decode {something}

All reactions

Sorry, something went wrong.

Uh oh!

There was an error while loading. Please reload this page.


          Let both scenarios of parsing ROOT_TYPE for protogen --decode command

dccbd9f

Copy link

SamiSadfa commented Jun 15, 2022 •

edited

Loading

@SamiSadfa I simply mean: I haven't tested that the command-line scenario works at all, let alone that it does the correct thing. The main logic is in the library and is validated via unit tests, but the console exe needs validation too. In particular, that protogen --decode {something} gives similar behaviour and output to protoc --decode {something}

protogen.exe output:

protoc.exe output:

It seems like it's working

All reactions

Sorry, something went wrong.

Uh oh!

There was an error while loading. Please reload this page.

Copy link

SamiSadfa commented Jun 15, 2022

"support for packed primitive values (note this is based on reader.WireType being String for a primitive value (i.e. not string, bytes, group or message) and not on anything related to field)"

I am not sure where the support for packed primitives can be implemented. Also, is there a correlation between implementing packed primitives support and repeated fields? I am not really versed in Protobuf packed primitives. Can you please point me to some documentation/code I need to check through to implement this support?

All reactions

Sorry, something went wrong.

Uh oh!

There was an error while loading. Please reload this page.

Copy link

SamiSadfa commented Jun 15, 2022

"what output (if any) for unknown fields?"
In a perfect scenario, this should not happen. In case of corrupted data (the bytes), or when a user tries to decode a payload with the wrong schema using protogen.exe, we can just skip it or raise an exception. What do you think about that?

All reactions

Sorry, something went wrong.

Uh oh!

There was an error while loading. Please reload this page.

mgravell added 11 commits

June 24, 2022 17:10


          support packed primitives

96a9241


          Merge branch 'protogen-decode' of https://github.com/protobuf-net/pro…

5f7dedb

…tobuf-net into protogen-decode


          Merge branch 'main' into protogen-decode

77ac667


          allow name selectors in ObjectDecodeVisitor (fixes JSON need)

01a98d4


          simplify json name usage; support enum names

4c0985f


          apply default values

27d5aa1


          handle enum defaults

a72936b


          infs

562dd2c


          refactor how ambient state is passed; implement most of maps

7a26ebb


          simplify StepIn

43eef84


          simplify map-key assignment

aff6509


          add documentation for --decode command in protogen.exe

b6f9404

mgravell mentioned this pull request

Deserialize with .proto schema provided at runtime #902

Open

mgravell added 5 commits

January 18, 2023 11:24


          refactor and prepare for API

62fb05f


          Merge branch 'main' into protogen-decode

a91b23b


          merge main

a3efea9


          add a decoder example that builds IDataRecord instances with shared s…

fb6c55a

…chema data


          override TryGetObject

9df28d4

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Reviewers

SamiSadfa SamiSadfa left review comments

Assignees

No one assigned

Labels

Projects

None yet

Milestone

No milestone

Development

Successfully merging this pull request may close these issues.

Uh oh!

There was an error while loading. Please reload this page.

2 participants

Add this suggestion to a batch that can be applied as a single commit. This suggestion is invalid because no changes were made to the code. Suggestions cannot be applied while the pull request is closed. Suggestions cannot be applied while viewing a subset of changes. Only one suggestion per line can be applied in a batch. Add this suggestion to a batch that can be applied as a single commit. Applying suggestions on deleted lines is not supported. You must change the existing code in this line in order to create a valid suggestion. Outdated suggestions cannot be applied. This suggestion has been applied or marked resolved. Suggestions cannot be applied from pending reviews. Suggestions cannot be applied on multi-line comments. Suggestions cannot be applied while the pull request is queued to merge. Suggestion cannot be applied right now. Please check back later.

Footer

© 2025 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.