Updates for issue 90 by xiaoliz0 · Pull Request #105 · InPreD/PRONTO

xiaoliz0 · 2026-04-22T13:51:24Z

Change filtering criteria:

DO NOT filter out pathogenic germline variants.
TERT always to report.

The gene with one of these 2 conditions will be reported with highest priority in PRONTO report.
These "rescued" variants should have a separate column in tables to verify this. Separate column "Filter rescued", values are "Yes" or empty.

Change filtering criteria: DO NOT filter out pathogenic germline variants. TERT always to report. The genes with one of these 2 conditions will be reported with highest priority in PRONTO report. These "rescued" variants should have a separate column (at the far right of the table) to verify this. Separate column "Filter rescued", values are "Yes" or empty.

… should not appear in the tables appearing on the right of slides in report, but only printed this column in the summary table in slide 8.

xiaoliz0 · 2026-04-30T12:12:34Z

@marrip I just got further request for this issue. I updated in the issue 90. There will be some further new codes coming soon.

marrip · 2026-04-30T12:26:11Z

@marrip I just got further request for this issue. I updated in the issue 90. There will be some further new codes coming soon.

ok, then I will wait with the review until you tell me to start ☺️

… rescued variants which are not include in Filter0-3.(Last table in the report)

…to develop_issue90

xiaoliz0 · 2026-05-04T07:48:02Z

@marrip I just got further request for this issue. I updated in the issue 90. There will be some further new codes coming soon.

ok, then I will wait with the review until you tell me to start ☺️

The new commits implement the further request. Feel free to review the codes. @marrip :)

marrip · 2026-05-04T10:11:56Z

will start latest tomorrow 🙂

marrip

hey Xiaoli! I had a couple of questions and a suggestion. I am also working on a refactoring of some of the parts but need your input first ☺️ Will continue tomorrow.

marrip · 2026-05-05T12:03:40Z

+							output_table_file_config = output_file_preMTB_table_path + "_" + output_table + ".txt"
+							if(',' in filter_column):
+								for column in filter_column.split(','):
+									all_data = read_tsv(data_file_small_variant_table,column,key_word)


here it looks like you are always overwriting all_data by using the last item in filter_column in read_tsv. Is that desired behavior?

Emm, it is from the filter conditions in the Filter sections in configure file. There are multiple filters for each of them.

I understand, but it seems you are looping through those and all_data will always be filtered according to the last item in that list - you don't seem to be saving the others or am I missing something?

In case of this:

[FILTER0-1] ;pecify the column name need to be filtered: filter_column = CPSR_ACMG_class,CPSR_ClinVar_class

you will first apply CPSR_ACMG_class and then CPSR_ClinVar_class after but only the results from CPSR_ClinVar_class are saved in all_data. The filtering with CPSR_ACMG_class seems to not be considered.

Thanks Martin! I will take a further check. I currently have trouble to login to the development server, need to resolve the issue first.

marrip · 2026-05-05T12:04:27Z

+					if(filter_section == "0"):
+						all_data_filter = []
+						top_filter = int(cfg.get("INPUT", "top_filter")) + 1
+						for top_filter_num in range(1,top_filter):	


I would like to refactor this section to make it easier to read ☺️

marrip · 2026-05-05T12:07:57Z

+							clear_blank_line(output_table_file_config_pre,output_table_file_config)
+							all_data_filter.append(all_data)
+
+						all_data_filter = sum(all_data_filter, [])


what does this do?

marrip · 2026-05-05T12:20:01Z

+								if(len(all_data_filter[i]) < header_length):
+									count = header_length - len(all_data_filter[i])
+									all_data_filter[i] = [[item.replace('\n', '') for item in cell] for cell in all_data_filter[i]]
+									all_data_filter[i].pop()
+									for j in range(1, count):
+										all_data_filter[i].append(' \t')
+									all_data_filter[i].append('\n')


could you explain to me what this section does? It replaces any \n, removes the last item and places empty fields in the table and finishes off with \n. Why is this necessary?

marrip · 2026-05-06T09:41:25Z

Looking at the remaining changes it seems that a lot of the fixes are to handle different column numbers of the combined tables, replacing tabs with linebreaks or vice versa and making data unique. I would suggest we rework this and use pandas instead which would make reading, filtering, combining and writing to file a lot easier. What do you think?

This is removed in main branch, not sure why it is existing here. Co-authored-by: Martin Rippin <74295098+marrip@users.noreply.github.com>

…to develop_issue90

xiaoliz0 added 2 commits April 22, 2026 15:36

Remove some comment lines.

62d3f99

xiaoliz0 linked an issue Apr 22, 2026 that may be closed by this pull request

National request for data filter (big change) #90

Open

New updates from molecular biology group: The column "Filter_rescued"…

d3b2ccf

… should not appear in the tables appearing on the right of slides in report, but only printed this column in the summary table in slide 8.

xiaoliz0 requested review from marrip and tonjegul April 30, 2026 11:09

xiaoliz0 and others added 3 commits April 30, 2026 14:30

Merge branch 'main' into develop_issue90

2c29b0a

Implement new updates for the request: Only print last column for the…

170bfbe

… rescued variants which are not include in Filter0-3.(Last table in the report)

Merge branch 'develop_issue90' of https://github.com/InPreD/PRONTO in…

df6490e

…to develop_issue90

Fix some codes after merging. Fix some updated codes from Martin.

00b0738

marrip reviewed May 5, 2026

View reviewed changes

xiaoliz0 and others added 4 commits May 6, 2026 13:53

Update Script/PRONTO.py

59815d0

This is removed in main branch, not sure why it is existing here. Co-authored-by: Martin Rippin <74295098+marrip@users.noreply.github.com>

Add a fake sample for testing PRONTO new functions.

d831764

Merge branch 'develop_issue90' of https://github.com/InPreD/PRONTO in…

fc9187c

…to develop_issue90

Add files via upload

873c97a

Conversation

xiaoliz0 commented Apr 22, 2026

Uh oh!

xiaoliz0 commented Apr 30, 2026

Uh oh!

marrip commented Apr 30, 2026

Uh oh!

xiaoliz0 commented May 4, 2026

Uh oh!

marrip commented May 4, 2026

Uh oh!

marrip left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

marrip commented May 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants