It seems when a Gene Symbol has multiple targets, there get to be some duplicate rows in the tiga table.
For example, there are two targets that correspond to CALCA / ENSG00000110680, and the GWAS traits get associated with each of them (which seems appropriate) but they get associated with both of them twice. CALCA is a good example, HLA-A has more targets and has more duplicates.
SELECT
tiga.*
FROM
protein, tiga
WHERE protein.id = tiga.protein_id
AND protein.sym = 'CALCA'
It seems when a Gene Symbol has multiple targets, there get to be some duplicate rows in the tiga table.
For example, there are two targets that correspond to CALCA / ENSG00000110680, and the GWAS traits get associated with each of them (which seems appropriate) but they get associated with both of them twice. CALCA is a good example, HLA-A has more targets and has more duplicates.
SELECT
tiga.*
FROM
protein,tigaWHERE protein.id = tiga.protein_id
AND protein.sym = 'CALCA'