Update Google Docs Meta Data #1546

github-actions · 2024-10-07T15:28:43Z

Updating Google Docs Meta Data

addition of "Signal Set" column
addition of two chng signals: 7dav_inpatient_covid and 7dav_outpatient_covid
a bunch of fixes to extended ascii apostrophes and quotation marks (replaced with regular ascii equivalents)

The signal name for "covid_naat_pct_positive_7dav" was lost in an apparent accidental paste, but i fixed it here w/ a commit to the branch PR, and manually in the spreadsheet

sonarqubecloud · 2024-10-08T20:18:07Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarCloud

melange396 · 2024-10-08T20:43:54Z

It turns out that there are still extended ascii chars in here (they are actually unicode chars)... They are findable by running:

from collections import defaultdict
highchars = defaultdict(int)
with open('db_signals.csv') as f:
    for line in f:
        for char in line:
            val = ord(char)
            if val>=127:
                highchars[val] += 1

the current db_signals.csv file gets the following results:

>>> highchars
defaultdict(<class 'int'>, {8220: 9, 8217: 30, 8221: 9})
>>> chr(8220)
'“'
>>> chr(8221)
'”'
>>> chr(8217)
'’'
>>>

I am not going to simply replace them in the file itself because of escaping concerns, so after merging this PR, i will replace them in the google spreadsheet and then run the csv sync utility (GH action) again.

melange396 · 2024-10-09T03:37:40Z

in case it helps someone in the future, heres some ugly code that i used to help compare the two versions of these files:

import csv

dev = []
with open('dev__db_signals.csv') as f:
    for r in csv.reader(f):
        dev.append(r)

new = []
with open('new__db_signals.csv') as f:
    for r in csv.reader(f):
        new.append(r)

def compare_rows(a, b):
    if len(a) != len(b):
        print("length mismatch")
    for i in range(len(a)):
        if a[i] != b[i]:
            print("    ", i, a[i].replace("\n", ""))
            print("    ", i, b[i].replace("\n", ""))

for i in range(len(dev)):
    offset = 0
    if i in (7,8):
        # skip added rows                                                                                                                                                                                          
        continue
    if i > 8:
        # account for added rows                                                                                                                                                                                   
        offset = 2
    n = new[i][:10] + new[i][11:] # skip added column @ index 10                                                                                                                                                   
    d = dev[i-offset]
    if n != d:
        print(i)
        compare_rows(n, d)

chore: update docs

aa30c1a

github-actions bot added the chore label Oct 7, 2024

github-actions bot assigned melange396 Oct 7, 2024

github-actions bot requested a review from melange396 October 7, 2024 15:28

fix lost 'covid_naat_pct_positive_7dav' signal name

97d6124

melange396 approved these changes Oct 8, 2024

View reviewed changes

melange396 merged commit a9a2535 into dev Oct 8, 2024
7 checks passed

melange396 deleted the bot/update-docs branch October 8, 2024 20:45

melange396 mentioned this pull request Oct 9, 2024

Properly decode UTF-8 from gsheet csv #1548

Merged

This was referenced Dec 5, 2024

Automate parts of metadata csv update comparison #1564

Open

Release Delphi Epidata 4.1.27 #1566

Merged

melange396 mentioned this pull request Feb 8, 2025

Update Google Docs Meta Data #1596

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update Google Docs Meta Data #1546

Update Google Docs Meta Data #1546

Uh oh!

github-actions bot commented Oct 7, 2024 •

edited by melange396

Loading

Uh oh!

sonarqubecloud bot commented Oct 8, 2024

Uh oh!

melange396 commented Oct 8, 2024

Uh oh!

Uh oh!

melange396 commented Oct 9, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Update Google Docs Meta Data #1546

Update Google Docs Meta Data #1546

Uh oh!

Conversation

github-actions bot commented Oct 7, 2024 • edited by melange396 Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sonarqubecloud bot commented Oct 8, 2024

Quality Gate passed

Uh oh!

melange396 commented Oct 8, 2024

Uh oh!

Uh oh!

melange396 commented Oct 9, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

github-actions bot commented Oct 7, 2024 •

edited by melange396

Loading