wrangle_phl

processor.wrangle_phl(pull_res)

Wrangle PHL

Usage

To be called after the read_pulls() function.

Parameters

pull_res :

a class containing all the table pulls. see processor.read_pulls()

Returns

transformed_df : pl.DataFrame

a Polars dataframe containing transformed PHL data

qa_issues : pl.DataFrame

a Polars dataframe containing records with QA issues

no_qa_issues : pl.DataFrame

a Polars dataframe with records that are cleared to send to fuzzy matching

submissions_to_fuzzy : pl.DataFrame

records to be sent to fuzzy matching, like no_qa_issues but with only necessary columns

Examples

from src.subtype_link import processor

result = processor.run_pulls()

Now call the wrangle_phl() function

wrangled_dfs = processor.wrangle_phl(pull_res=result)

And you can get the dataframes like this:

wrangled_dfs.qa_issues

wrangled_dfs.no_qa_issues