run_pulls

processor.run_pulls()

Pull All Data

Usage

Runs PHL, WDRS, and internal table pulls. You need access to Azure KeyVault to get the WDRS server info. See README for more info.

Returns: Entire table and respnet tables

Returns

phl_df : pl.DataFrame: a Polars dataframe containing PHL data
received_submissions_df : pl.DataFrame: a Polars dataframe containing a receipt of the PHL data in json format
base_cols : list: a list of col names used throughout the process for easy reference
respnet : pl.DataFrame: a Polars dataframe containing joined respnet tables
respnet_wizard : pl.DataFrame: a Polars dataframe containing the respnet wizard table
respnet_investigation : pl.DataFrame: a Polars dataframe containing the respnet investigation table
con : duckdb.DuckDBPyConnection: a duckdb connection, used to query internal tables
net_drive : str: the network drive path

Examples

Call in the class and it will execute this init function:

from wadoh_subtyping import processor

# ---- run processor ---- #
# the processor contains general pulls and objects used in all processing (WDRS pulls, duckdb pull, etc)
instance = processor.data_processor()

Then you can call in specific objects if you want, like specific tables init read in:

result = processor.run_pulls()

respnet_table = result.respnet_table