Distributed, fault tolerant batch processing for Natural Language Applications and Search, using remote partitioning
-
Updated
Jan 13, 2023 - Java
Distributed, fault tolerant batch processing for Natural Language Applications and Search, using remote partitioning
A comprehensive Python package for healthcare data engineering, designed to extract, transform, and feature engineer patient data from CogStack-based EHR datalakes. Enables patient-level aggregation, longitudinal time series construction (up to 25 years retrospective), flexiblefeature engineering (biochemistry, demographics, medications, diagnoses
Add a description, image, and links to the cogstack topic page so that developers can more easily learn about it.
To associate your repository with the cogstack topic, visit your repo's landing page and select "manage topics."