run_pulls
processor.run_pulls()Pull All Data
Usage
Runs PHL, WDRS, and internal table pulls. You need access to Azure KeyVault to get the WDRS server info. See README for more info.
Returns: Entire table and respnet tables
Returns
phl_df : pl.DataFrame-
a Polars dataframe containing PHL data
received_submissions_df : pl.DataFrame-
a Polars dataframe containing a receipt of the PHL data in json format
base_cols : list-
a list of col names used throughout the process for easy reference
respnet : pl.DataFrame-
a Polars dataframe containing joined respnet tables
respnet_wizard : pl.DataFrame-
a Polars dataframe containing the respnet wizard table
respnet_investigation : pl.DataFrame-
a Polars dataframe containing the respnet investigation table
con : duckdb.DuckDBPyConnection-
a duckdb connection, used to query internal tables
net_drive : str-
the network drive path
Examples
Call in the class and it will execute this init function:
from wadoh_subtyping import processor
# ---- run processor ---- #
# the processor contains general pulls and objects used in all processing (WDRS pulls, duckdb pull, etc)
instance = processor.data_processor()Then you can call in specific objects if you want, like specific tables init read in:
result = processor.run_pulls()
respnet_table = result.respnet_table