When discussing GWAS hits, the nearest protein coding gene is used as a kind of shorthand functional annotation for any significantly associated loci (if I understand correctly).
I’m wondering how many significant loci have been conclusively functionally linked to their nearest protein coding gene vs further away genes or intergenic regions?
Do we use this framework since it’s most feasible to understand functional implications of variants in a protein coding gene for wet lab biologists, as Benjamin Neale mentioned yesterday? Or does it capture some biological reality?