Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Support Request] Investigate duplicate result submissions #8491

Open
DanielSass opened this issue Feb 11, 2025 · 3 comments
Open

[Support Request] Investigate duplicate result submissions #8491

DanielSass opened this issue Feb 11, 2025 · 3 comments
Assignees

Comments

@DanielSass
Copy link
Collaborator

Report stream has been investigating duplicate submissions coming from SR bulk upload users (CDCgov/prime-reportstream#17232).

One of the things they noticed is that it seems as though a SR user is submitting the same Test_Performed_Code (for covid) on all the test results for a given multiplex test. Per Victor the rows appear to be identical with the exception of the description.

We would like to:
a) Verify that this field is not being populated by the application in a way that would cause these errors (Are we defaulting this code in an invalid way? Looking it up in our database and filling in bad data? Does our device lookup tool mislead users?)
b) Evaluate if it would be possible to validate these codes further to be able to catch these issues sooner

@DanielSass DanielSass self-assigned this Feb 11, 2025
@mpbrown mpbrown assigned mpbrown and unassigned DanielSass Feb 12, 2025
@mpbrown
Copy link
Collaborator

mpbrown commented Feb 14, 2025

This issue might be caused by unvalidated user-provided test_ordered_code values. Awaiting response from ReportStream on this Slack thread

@mehansen
Copy link
Collaborator

Looked at this with Dan and I think that what happened here is a user is trying to upload multiplex results using device data for a Covid-only device. Essentially they are using the same device manufacturer/model info and test performed LOINC but adding different disease names in a test name column. The test name column is not part of our schema, so we just pass the rows through to the Covid pipeline because the test_performed_code is a Covid LOINC.
Next steps are to:

  • @DanielSass to work with leads to notify user that the data they are uploading is incorrect
  • @mehansen to notify RS that the duplicates are due to user error and not a processing issue on our side
  • (potentially) SR team to add validation to bulk upload tool to disallow columns not in our schema

@mehansen
Copy link
Collaborator

Notified RS via Slack thread here

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants