run_pulls

processor.run_pulls()

Pull All Data

Usage

Runs PHL, WDRS, and internal table pulls. You need access to Azure KeyVault to get the WDRS server info. See README for more info.

Returns: Entire table and respnet tables

Returns

phl_df : pl.DataFrame

a Polars dataframe containing PHL data

received_submissions_df : pl.DataFrame

a Polars dataframe containing a receipt of the PHL data in json format

base_cols : list

a list of col names used throughout the process for easy reference

respnet : pl.DataFrame

a Polars dataframe containing joined respnet tables

respnet_wizard : pl.DataFrame

a Polars dataframe containing the respnet wizard table

respnet_investigation : pl.DataFrame

a Polars dataframe containing the respnet investigation table

con : duckdb.DuckDBPyConnection

a duckdb connection, used to query internal tables

net_drive : str

the network drive path

Examples

Call in the class and it will execute this init function:

from wadoh_subtyping import processor

# ---- run processor ---- #
# the processor contains general pulls and objects used in all processing (WDRS pulls, duckdb pull, etc)
instance = processor.data_processor()

Then you can call in specific objects if you want, like specific tables init read in:

result = processor.run_pulls()

respnet_table = result.respnet_table