wrangle_phl
processor.wrangle_phl(pull_res)Wrangle PHL
Usage
To be called after the read_pulls() function.
Parameters
pull_res :-
a class containing all the table pulls. see processor.read_pulls()
Returns
transformed_df : pl.DataFrame-
a Polars dataframe containing transformed PHL data
qa_issues : pl.DataFrame-
a Polars dataframe containing records with QA issues
no_qa_issues : pl.DataFrame-
a Polars dataframe with records that are cleared to send to fuzzy matching
submissions_to_fuzzy : pl.DataFrame-
records to be sent to fuzzy matching, like no_qa_issues but with only necessary columns
Examples
from src.subtype_link import processor
result = processor.run_pulls()Now call the wrangle_phl() function
wrangled_dfs = processor.wrangle_phl(pull_res=result)And you can get the dataframes like this:
wrangled_dfs.qa_issues
wrangled_dfs.no_qa_issues