Rogue Scholar

Ontology MergingSemantic WebSemantic MappingsBioinformaticsOntologiesNatural Sciences

Inference over Semantic Mappings with SeMRA

Published April 28, 2025

Author Charles Tapley Hoyt

Assembling and inferring missing semantic mappings is a timely problem in biomedical data and knowledge integration. I’ve been developing the Semantic Mapping Assembler and Reasoner (SeMRA) as a generic toolkit for this. In this blog post, I highlight its inference capabilities. SeMRA implements the chaining and inference rules described in the SSSOM specification.

PythonMypyStatic TypingNatural Sciences

I wish I could unpack Callables in Python type annotations

https://doi.org/10.59350/tcz2x-n4d84

Published April 23, 2025

Author Charles Tapley Hoyt

Following the theme of my previous two posts, I’ve run into another typing conundrum where I want to unpack a pre-existing Callable into a class with Generic[P, T] where P is a parameter specification type (i.e. ParamsSpec) After figuring out the right way to declare a generic featuring a ParamSpec, I updated the class-resolver package to use the shiny new (and more accurate) annotations.

PythonMypyStatic TypingNatural Sciences

Using ParamSpec with Python Generics

https://doi.org/10.59350/a9srr-an019

Published April 22, 2025

Author Charles Tapley Hoyt

I’ve been working on applying strict static typing to my Python package class-resolver and ran into an interesting way of using generics in combination with parameter specification variables (i.e., ParamSpecs). Normally, if you want to type annotate a function, you use the Callable, which works like the following: from collections.abc import Callable #: the [int] represents a function that takes in a single integer, #: and returns a single

PythonMypyStatic TypingNatural Sciences

A dilemma with PEP-696 default generics when using optional static typing in Python

https://doi.org/10.59350/3zq9w-my741

Published April 19, 2025

Author Charles Tapley Hoyt

This post describes an issue I’ve had with writing correct types when using PEP-696 defaults in typing.TypeVar. I posted the exploration in a companion repository on GitHub. The motivation behind this comes from my work in biomedical data integration and the semantic web.

ChEBIChEMBLUBERONExperimental Factor OntologyEFONatural Sciences

The EFO_ID column in ChEMBL’s drug indications table isn’t what you think it is

https://doi.org/10.59350/mmrpx-qda35

Published April 17, 2025

Author Charles Tapley Hoyt

ChEMBL periodically curates clinical trial information into its DRUG_INDICATION table. However, there’s some weird inconsistencies in the way it references disease concepts in external vocabularies. This blog post is an exploration of that table.

Clinical TrialsClinicalTrials.govOntologiesOBIChEBINatural Sciences

Data Modeling and Integration with Clinical Trials

https://doi.org/10.59350/jkdtn-kgs07

Published January 23, 2025

Author Charles Tapley Hoyt

I’ve recently worked with clinical studies from ClinicalTrials.gov and other international registries.

BooksNatural Sciences

Books I Read in 2024

https://doi.org/10.59350/psy2m-adk43

Published January 18, 2025

Author Charles Tapley Hoyt

Here’s the books I read in 2024. If I were Dudley Dursley, I’d be very upset that I read one fewer new book than in 2023. But then, I’d remember that I re-read a lot of Cosmere in 2024 to prepare for Wind and Truth , which was great.

WikidataBibliometricsOpen DataNatural Sciences

Exploring Event Venues in Wikidata

https://doi.org/10.59350/53dah-9vf82

Published January 17, 2025

Author Charles Tapley Hoyt

I was working on making data about scholarly conferences more FAIR and a big question crossed my mind: what are all the conference venues? This post is about some queries I wrote for Wikidata, data issues I found, and a few drive-by curations that I did while looking for an answer, and my ideas for the future.

FundingOpen SourceNatural Sciences

Notes on Open Source Funding

https://doi.org/10.59350/eckhy-09r58

Published December 3, 2024

Author Charles Tapley Hoyt

This stub post contains my notes about funding for open source software. It doesn’t follow a story like a lot of my posts, and is more like an ever-evolving notes sheet.

ReadingAutomationNatural Sciences

Downloading Audio from Soundcloud

https://doi.org/10.59350/683zj-mfk55

Published December 3, 2024

Author Charles Tapley Hoyt

Brandon Sanderson has been releasing a few chapters a week of his upcoming novel, Wind and Truth, on his publisher’s website leading up to its December 6 ^th release. This includes the audiobook chapters, but they’re posted to Soundcloud and there’s no good way to listen at 1.6x speed. This post is a note sheet on how to download audio from Soundcloud and prepare it for my audiobook reader.

PythonPackagingCookiecutterDocumentationNatural Sciences

Dependency Groups and ReadTheDocs

https://doi.org/10.59350/3v5dd-78w25

Published November 19, 2024

Author Charles Tapley Hoyt

PEP 735 introduced dependency groups in packaging metadata, which are complementary to optional dependencies in that they might not correspond to features in the package, but rather be something like development or release dependencies. I am slowly working towards updating my cookiecutter template cookiecutter-snekpack to use PEP 735. So far, uv and tox have released support - all that’s left is ReadTheDocs.

Biopragmatics

Inference over Semantic Mappings with SeMRA

I wish I could unpack Callables in Python type annotations

Using ParamSpec with Python Generics

A dilemma with PEP-696 default generics when using optional static typing in Python

The EFO_ID column in ChEMBL’s drug indications table isn’t what you think it is

Data Modeling and Integration with Clinical Trials

Books I Read in 2024

Exploring Event Venues in Wikidata

Notes on Open Source Funding

Downloading Audio from Soundcloud

Dependency Groups and ReadTheDocs