Skip to content

Conversation

@kho
Copy link
Contributor

@kho kho commented Dec 8, 2025

What does this PR do?

This PR adds inputs_to_logits_ratio to LasrCTCConfig so that LasrForCTC can be used in an ASR pipeline with chunked decoding.

Before submitting

  • This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
  • Did you read the contributor guideline,
    Pull Request section?
  • Was this discussed/approved via a Github issue or the forum? Please add a link
    to it if that's the case.
  • Did you make sure to update the documentation with your changes? Here are the
    documentation guidelines, and
    here are tips on formatting docstrings.
  • Did you write any new necessary tests?

@kho
Copy link
Contributor Author

kho commented Dec 8, 2025

@eustlb make fixup is not yet passing due to hop_length not being used in modelling code.

@pcuenca pcuenca requested a review from eustlb December 10, 2025 09:11
@github-actions
Copy link
Contributor

[For maintainers] Suggested jobs to run (before merge)

run-slow: lasr

Copy link
Contributor

@eustlb eustlb left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

hop_length should be accessed directly from the feature extractor. @kho do you confirm this suits your needs?

@kho
Copy link
Contributor Author

kho commented Dec 10, 2025

Unfortunately this will break decoding with LM. Perhaps we should just hardcode hop_length to a fixed value for now in LasrFeatureExtractor?

@kho
Copy link
Contributor Author

kho commented Dec 10, 2025

@eustlb I created #42782 as a Plan B. What do you think?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants