<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20190208//EN" "http://jats.nlm.nih.gov/publishing/1.2/JATS-journalpublishing1.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="methods-article" dtd-version="1.2" xml:lang="en">
    <front>
        <journal-meta>
            <journal-id journal-id-type="pmc">Gates Open Res</journal-id>
            <journal-title-group>
                <journal-title>Gates Open Research</journal-title>
            </journal-title-group>
            <issn pub-type="epub">2572-4754</issn>
            <publisher>
                <publisher-name>F1000 Research Limited</publisher-name>
                <publisher-loc>London, UK</publisher-loc>
            </publisher>
        </journal-meta>
        <article-meta>
            <article-id pub-id-type="doi">10.12688/gatesopenres.13179.1</article-id>
            <article-categories>
                <subj-group subj-group-type="heading">
                    <subject>Method Article</subject>
                </subj-group>
                <subj-group>
                    <subject>Articles</subject>
                </subj-group>
            </article-categories>
            <title-group>
                <article-title>Predicting TB treatment outcomes using baseline risk and treatment response markers: developing the PredictTB early treatment completion criteria</article-title>
                <fn-group content-type="pub-status">
                    <fn>
                        <p>[version 1; peer review: 1 approved, 2 not approved]</p>
                    </fn>
                </fn-group>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author" corresp="yes">
                    <name>
                        <surname>Chen</surname>
                        <given-names>Ray Y.</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Conceptualization</role>
                    <role content-type="http://credit.niso.org/">Data Curation</role>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Supervision</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <uri content-type="orcid">https://orcid.org/0000-0001-6344-1442</uri>
                    <xref ref-type="corresp" rid="c1">a</xref>
                    <xref ref-type="aff" rid="a1">1</xref>
                    <xref ref-type="aff" rid="a2">2</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Wang</surname>
                        <given-names>Jing</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Formal Analysis</role>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Software</role>
                    <role content-type="http://credit.niso.org/">Validation</role>
                    <role content-type="http://credit.niso.org/">Visualization</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a3">3</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Liang</surname>
                        <given-names>Lili</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Data Curation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a4">4</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Xie</surname>
                        <given-names>Yingda L.</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Data Curation</role>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <uri content-type="orcid">https://orcid.org/0000-0001-5587-4731</uri>
                    <xref ref-type="aff" rid="a1">1</xref>
                    <xref ref-type="aff" rid="a5">5</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Malherbe</surname>
                        <given-names>Stephanus T.</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Data Curation</role>
                    <role content-type="http://credit.niso.org/">Investigation</role>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a6">6</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Winter</surname>
                        <given-names>Jill</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Funding Acquisition</role>
                    <role content-type="http://credit.niso.org/">Project Administration</role>
                    <role content-type="http://credit.niso.org/">Resources</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a7">7</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Via</surname>
                        <given-names>Laura E.</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Conceptualization</role>
                    <role content-type="http://credit.niso.org/">Funding Acquisition</role>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Resources</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a1">1</xref>
                    <xref ref-type="aff" rid="a2">2</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Yu</surname>
                        <given-names>Xiang</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Data Curation</role>
                    <role content-type="http://credit.niso.org/">Formal Analysis</role>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Software</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Vincent</surname>
                        <given-names>Joel</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Project Administration</role>
                    <role content-type="http://credit.niso.org/">Resources</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Armstrong</surname>
                        <given-names>Derek</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Resources</role>
                    <role content-type="http://credit.niso.org/">Validation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a8">8</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Walzl</surname>
                        <given-names>Gerhard</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Conceptualization</role>
                    <role content-type="http://credit.niso.org/">Funding Acquisition</role>
                    <role content-type="http://credit.niso.org/">Investigation</role>
                    <role content-type="http://credit.niso.org/">Resources</role>
                    <role content-type="http://credit.niso.org/">Supervision</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a6">6</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Alland</surname>
                        <given-names>David</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Conceptualization</role>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Resources</role>
                    <role content-type="http://credit.niso.org/">Supervision</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a5">5</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Barry 3rd</surname>
                        <given-names>Clifton E.</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Conceptualization</role>
                    <role content-type="http://credit.niso.org/">Data Curation</role>
                    <role content-type="http://credit.niso.org/">Funding Acquisition</role>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Resources</role>
                    <role content-type="http://credit.niso.org/">Supervision</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a1">1</xref>
                    <xref ref-type="aff" rid="a2">2</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Dodd</surname>
                        <given-names>Lori E.</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Conceptualization</role>
                    <role content-type="http://credit.niso.org/">Formal Analysis</role>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Supervision</role>
                    <role content-type="http://credit.niso.org/">Validation</role>
                    <role content-type="http://credit.niso.org/">Visualization</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a9">9</xref>
                </contrib>
                <aff id="a1">
                    <label>1</label>Tuberculosis Research Section, Laboratory of Clinical Immunology and Microbiology, Division of Intramural Research, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, 20892, USA</aff>
                <aff id="a2">
                    <label>2</label>Institute of Infectious Disease and Molecular Medicine, University of Cape Town, Cape Town, South Africa</aff>
                <aff id="a3">
                    <label>3</label>Clinical Monitoring Research Program Directorate, Frederick National Laboratory for Cancer Research, Frederick, MD, USA</aff>
                <aff id="a4">
                    <label>4</label>Henan Provincial Chest Hospital, Zhengzhou, Henan, China</aff>
                <aff id="a5">
                    <label>5</label>Department of Medicine and the Public Health Research Institute, Rutgers, New Jersey Medical School, ICPH Building, Room 2232, 225 Warren Street, Newark, NJ, 07103, USA</aff>
                <aff id="a6">
                    <label>6</label>DST-NRF Centre of Excellence for Biomedical Tuberculosis Research, South African Medical Research Council Centre for Tuberculosis Research, Division of Molecular Biology and Human Genetics, Department of Biomedical Sciences, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa</aff>
                <aff id="a7">
                    <label>7</label>Catalysis Foundation for Health, 2010 Crow Canyon Pl. STE 100, San Ramon, CA, 94583, USA</aff>
                <aff id="a8">
                    <label>8</label>Johns Hopkins University School of Medicine, Baltimore, MD, 21231, USA</aff>
                <aff id="a9">
                    <label>9</label>Biostatistics Research Branch, Division of Clinical Research, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, USA</aff>
            </contrib-group>
            <author-notes>
                <corresp id="c1">
                    <label>a</label>
                    <email xlink:href="mailto:ray.chen@nih.gov">ray.chen@nih.gov</email>
                </corresp>
                <fn fn-type="conflict">
                    <p>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>14</day>
                <month>10</month>
                <year>2020</year>
            </pub-date>
            <pub-date pub-type="collection">
                <year>2020</year>
            </pub-date>
            <volume>4</volume>
            <elocation-id>157</elocation-id>
            <history>
                <date date-type="accepted">
                    <day>1</day>
                    <month>10</month>
                    <year>2020</year>
                </date>
            </history>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2020 Chen RY et al.</copyright-statement>
                <copyright-year>2020</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
                <license>
                    <license-p>The author(s) is/are employees of the US Government and therefore domestic copyright protection in USA does not apply to this work. The work may be protected under the copyright laws of other jurisdictions when used in those jurisdictions.</license-p>
                </license>
            </permissions>
            <self-uri content-type="pdf" xlink:href="https://gatesopenresearch.org/articles/4-157/pdf"/>
            <abstract>
                <p>Standard treatment of drug-sensitive pulmonary tuberculosis requires six months of treatment. Several randomized clinical trials have attempted to shorten treatment to four months using various strategies but thus far all have failed. The PredictTB trial is an ongoing international randomized clinical trial testing a treatment shortening strategy whereby only drug-sensitive pulmonary TB patients who meet the study early treatment completion criteria are randomized to four vs. six months of treatment. The PredictTB early treatment completion criteria were developed based on a cohort of 92 pulmonary tuberculosis patients treated programmatically through the local tuberculosis treatment program in Cape Town, South Africa, with FDG-PET/CT scans also performed at baseline and week 4 of treatment. Patients were followed for one year after the end of therapy for programmatic treatment outcomes. This methodology paper describes how the PET/CT scans and GeneXpert cycle threshold data of this cohort were analyzed to develop the early treatment completion algorithm currently being used in the PredictTB trial.</p>
            </abstract>
            <kwd-group kwd-group-type="author">
                <kwd>pulmonary tuberculosis</kwd>
                <kwd>drug sensitive</kwd>
                <kwd>predict tb</kwd>
                <kwd>PET/CT</kwd>
                <kwd>treatment shortening</kwd>
            </kwd-group>
            <funding-group>
                <award-group id="fund-1">
                    <funding-source>National Cancer Institute</funding-source>
                </award-group>
                <award-group id="fund-2" xlink:href="http://dx.doi.org/10.13039/100006492">
                    <funding-source>Division of Intramural Research, National Institute of Allergy and Infectious Diseases</funding-source>
                </award-group>
                <award-group id="fund-3" xlink:href="http://dx.doi.org/10.13039/100000002">
                    <funding-source>National Institutes of Health</funding-source>
                </award-group>
                <award-group id="fund-4" xlink:href="http://www.gatesfoundation.org">
                    <funding-source>Gates Foundation</funding-source>
                    <award-id>OPP51919</award-id>
                    <award-id>OPP1155128</award-id>
                </award-group>
                <funding-statement>This study was supported in part by the Gates Foundation [OPP51919, OPP1155128], the National Cancer Institute, National Institutes of Health, under Contract No. 75N91019D00024, Task Order No. 75N91019F00130, and the Division of Intramural Research, National Institute of Allergy and Infectious Diseases, National Institutes of Health. The content of this publication does not necessarily reflect the views or policies of the U.S. Department of Health and Human Services, nor does mention of trade names, commercial products, or organizations imply endorsement by the U.S. Government.</funding-statement>
                <funding-statement>
                    <italic>The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.</italic>
                </funding-statement>
            </funding-group>
        </article-meta>
    </front>
    <body>
        <sec>
            <title>Background</title>
            <p>Multiple clinical trials over the last 40 years attempting to shorten therapy for pulmonary, drug-sensitive tuberculosis from six to four months have failed
                <sup>
                    <xref ref-type="bibr" rid="ref-1">1</xref>&#x2013;
                    <xref ref-type="bibr" rid="ref-4">4</xref>
                </sup>. Despite this, the various four-month treatment arms consistently cured about 80&#x2013;85% of patients. The premise of the ongoing PredictTB clinical trial is that this sub-population of lower risk patients who are cured at four months can be identified prospectively
                <sup>
                    <xref ref-type="bibr" rid="ref-5">5</xref>
                </sup>. Two subsequent secondary analyses of the three fluoroquinolone treatment shortening trials
                <sup>
                    <xref ref-type="bibr" rid="ref-2">2</xref>&#x2013;
                    <xref ref-type="bibr" rid="ref-4">4</xref>
                </sup> support this premise by finding that a subset of lower risk participants, those without cavity on baseline chest-x-ray, acid-fast bacilli (AFB) smear less than 2+ at baseline, or AFB smear negative at month 2 of treatment were associated with successful four-month treatment outcomes
                <sup>
                    <xref ref-type="bibr" rid="ref-6">6</xref>,
                    <xref ref-type="bibr" rid="ref-7">7</xref>
                </sup>. Another trial prospectively attempted a similar risk stratification by shortening treatment for those without cavity on baseline chest x-ray, a measure of baseline risk,  and those with negative sputum cultures at two months of treatment, a measure of treatment response
                <sup>
                    <xref ref-type="bibr" rid="ref-8">8</xref>
                </sup>. Although this trial also failed, the treatment success rate in the four-month arm achieved 93%, higher than other four-month treatment trials that did not risk stratify participants. The PredictTB trial was developed on the hypothesis that more precise methods of evaluating baseline risk and treatment response will successfully identify lower risk participants cured with four months of treatment.</p>
            <p>The value of early chest x-ray changes to predict TB treatment outcomes has been recognized for over 60 years
                <sup>
                    <xref ref-type="bibr" rid="ref-9">9</xref>
                </sup>. Cavities on baseline chest x-ray
                <sup>
                    <xref ref-type="bibr" rid="ref-10">10</xref>&#x2013;
                    <xref ref-type="bibr" rid="ref-12">12</xref>
                </sup> and residual cavity at cure
                <sup>
                    <xref ref-type="bibr" rid="ref-13">13</xref>
                </sup> have been associated with poor treatment outcomes. Computed tomography scans are more sensitive than x-ray. Further, changes on 2-deoxy-2-[
                <sup>18</sup>F]fluoro-D-glucose (FDG)-positron emission tomography/computed tomography (PET/CT) scans have been associated with treatment outcomes in nonhuman primates
                <sup>
                    <xref ref-type="bibr" rid="ref-14">14</xref>,
                    <xref ref-type="bibr" rid="ref-15">15</xref>
                </sup>. Results in patients have been limited to small numbers due to the difficulty of collecting PET/CT imaging in this setting. However, a study of patients with pulmonary, multidrug-resistant tuberculosis (MDR-TB) showed a relationship with long-term treatment outcomes
                <sup>
                    <xref ref-type="bibr" rid="ref-16">16</xref>
                </sup>. In drug sensitive TB (DS-TB), distinct response patterns on PET/CT scan from baseline to month 6 on treatment correlated with treatment outcomes
                <sup>
                    <xref ref-type="bibr" rid="ref-17">17</xref>
                </sup>.</p>
            <p>Month 2 sputum culture conversion has traditionally been considered the best biomarker of treatment outcome available
                <sup>
                    <xref ref-type="bibr" rid="ref-18">18</xref>
                </sup> despite a meta-analysis demonstrating poor sensitivity and specificity in predicting outcomes
                <sup>
                    <xref ref-type="bibr" rid="ref-19">19</xref>
                </sup>. This has been confirmed by additional analyses of the REMoxTB treatment shortening trial
                <sup>
                    <xref ref-type="bibr" rid="ref-2">2</xref>
                </sup>, where month 2 culture conversion status poorly predicted final outcomes
                <sup>
                    <xref ref-type="bibr" rid="ref-20">20</xref>
                </sup>. Time-delays associated with culture conversion tests can be as long as 6-8 weeks, resulting in delayed assessments of the patient&#x2019;s bacillary status. Immediately available test results are likely to predict outcomes better than delayed results because the results reflect the patient&#x2019;s current status rather than a historical status from 1-2 months ago and thus poor results can be acted upon immediately. The value of contemporaneous results has been shown with CD4 cell counts in HIV
                <sup>
                    <xref ref-type="bibr" rid="ref-21">21</xref>
                </sup>, as well as in TB where a month 6 culture conversion status predicted final outcomes significantly better than month 2 culture conversion status
                <sup>
                    <xref ref-type="bibr" rid="ref-22">22</xref>
                </sup>. A point-of-care test that measures TB bacterial load has the potential to predict outcomes better than a delayed culture conversion result.</p>
            <p>The aim of this analysis is to describe how the early treatment completion criteria used in the ongoing PredictTB clinical trial (NIH IRB #16IN133; 
                <ext-link ext-link-type="uri" xlink:href="https://clinicaltrials.gov/ct2/show/NCT02821832?term=NCT02821832&amp;draw=2&amp;rank=1">NCT02821832</ext-link>) were developed. In this analysis, we assess the relationship between measures of baseline risk and treatment response with poor treatment outcomes in a cohort of pulmonary DS-TB patients treated programmatically in South Africa. Baseline risk is assessed using quantitative markers from FDG-PET/CT scans at baseline. Early treatment response is assessed by changes in these markers on FDG-PET/CT scan at week 4 of treatment. Adherence is also assessed, as is a quantitative measure of residual bacterial load using sputum Xpert MTB/RIF cycle threshold at week 16.</p>
        </sec>
        <sec sec-type="methods">
            <title>Methods</title>
            <sec>
                <title>Dataset</title>
                <p>The dataset we used to develop our algorithm were the PET/CT scans (DICOM format), the Xpert MTB/RIF cycle threshold values, and the final treatment outcomes from a cohort of 92 pulmonary DS-TB patients treated programmatically in Cape Town, South Africa with PET/CT scans performed at baseline, week 4, and at the end-of-treatment (Catalysis cohort; end-of-treatment scans not used in this analysis)
                    <sup>
                        <xref ref-type="bibr" rid="ref-17">17</xref>
                    </sup>. Ninety-nine participants were included in the original study but seven participants did not have a complete set of PET/CT scans, Xpert MTB/RIF cycle threshold results, and treatment outcomes available. Data were collected under written informed consent and the study was reviewed by the Stellenbosch University ethics committee (approval number N10/01/013). The Catalysis cohort dataset used for this analysis was de-identified and no additional ethical approval was required. We developed a risk stratification algorithm for the PredictTB trial
                    <sup>
                        <xref ref-type="bibr" rid="ref-5">5</xref>
                    </sup> that predicted participants who would successfully complete TB treatment early at four months (compared to the standard six months) with the following aims: 1) capture all treatment failure and recurrent TB patients as high risk; and 2) stratify 50% of all patients as low-risk. Although previous four-month treatment trials consistently cured 80-85% of patients, we conservatively lowered this estimate to target 50% of patients as eligible for treatment shortening. Patients included in this analysis from the Catalysis cohort were contacted at &#x2265;1 year after treatment completion for final treatment outcomes, including cures, treatment failures, and retreatments. Retreatment outcomes were defined programmatically as patients who restarted TB treatment for any reason and may include true relapses, re-infections, or nontuberculous infections with symptoms that mimic TB. Culture confirmation was not routinely obtained nor was mycobacterial genetic strain-typing performed to differentiate relapse from re-infection.</p>
            </sec>
            <sec>
                <title>Criteria development and rationale</title>
                <p>We patterned our risk stratification algorithm on the Johnson 
                    <italic toggle="yes">et al.</italic> trial that, although was stopped early by its Data and Safety Monitoring Board as a failure, increased the treatment success rate in the four-month arm to 93%
                    <sup>
                        <xref ref-type="bibr" rid="ref-8">8</xref>
                    </sup>. This study used a measure of baseline disease burden (cavity on baseline chest x-ray) as well as a measure of treatment response (sputum culture conversion at week 8). For baseline disease burden, we used disease severity measured on PET/CT scan. For treatment response, we measured change in disease severity on the week 4 PET/CT scan. As we developed the specific risk stratification algorithm thresholds, it became clear that we would not be able to accomplish both aims simultaneously. Any algorithm sensitive enough to capture all unfavorable outcomes as high risk was poorly specific, with well below 50% remaining as low risk. Any algorithm specific enough to capture 50% as low risk was not sensitive enough to capture all or nearly all unfavorable outcomes as high risk. We realized that we could not capture all treatment failure and retreatment patients as high risk in our risk stratification algorithm because some patients who failed may have failed due to patient-related factors, such as poor treatment adherence, that we could not predict because we only had adherence data based on monthly pill counts, which can be inaccurate
                    <sup>
                        <xref ref-type="bibr" rid="ref-23">23</xref>,
                        <xref ref-type="bibr" rid="ref-24">24</xref>
                    </sup>. For example, a patient with less severe baseline disease and a good response after one month of treatment and therefore predicted to be low risk may subsequently fail treatment due to poor adherence after the initial month of therapy. Retreatment TB patients were even more complex because, in addition to the possibility of poor adherence, retreatment TB was defined as programmatic restart of TB treatment by the local TB clinic for any reason. In most cases, retreated patients were not confirmed as true TB by culture, which also prevented strain typing to differentiate relapse with the same strain from re-infection with a different strain. Re-infection contributed to about 50% of recurrent TB cases in a previous analysis from Cape Town
                    <sup>
                        <xref ref-type="bibr" rid="ref-25">25</xref>
                    </sup> and may not be differentiated from relapse in an algorithm based on radiology. A rigorous analysis of risk criteria would have required a larger prospective dataset of cures, treatment failures, and confirmed relapses with PET/CT scans to facilitate model training, testing and validation. Because of these limitations of the Catalysis dataset for our purposes, particularly our lack of clearly defined poor treatment outcomes, we shifted our aims to emphasize: 1) stratifying about 50% of the cohort as low-risk and eligible for treatment shortening, while 2) capturing as many treatment failures over retreatments as possible because failures were felt to be more reliably determined than retreatments.</p>
            </sec>
            <sec>
                <title>Measurement of criteria</title>
                <p>Each PET/CT scan was read using MIM software version 6 (MIM Software Inc, Cleveland, Ohio USA; freely available alternatives include ITK-SNAP, 3D Slicer, and MeVisLab), with all diseased areas of the lung included in regions of interest (ROI). Data exported from each scan included cavity air measurements and Hounsfield unit (HU) histograms of volumes of each ROI on CT scans and total lesion glycolysis (TLG) for each ROI on PET scans. Hounsfield units are a measure of density, with air about -1000 HU, normal lung around -700 to -950 HU, water at 0 HU, and bone ranging from +500 to +1000 HU. TB lesion density ranges from near normal lung to about +200 HU, with dense consolidations measuring around -100 to +200 HU. As there are few other densities in this &#x201c;hard&#x201d; region in normal lung (for example, blood vessels), the volumes measured in this range almost completely represent TB lesion density so we focused on this &#x201c;hard&#x201d; HU range for this analysis (compared to &#x201c;softer&#x201d; TB lesions below  100 HU). For PET scans, we determined total lesion glycolysis (TLG) in diseased lung regions, which is a measure of the amount of FDG uptake and is calculated by the mean standardized uptake value in each lung region multiplied by the volume of that region. However, neither CT hard volume nor PET TLG have previously been validated as markers of treatment outcome in TB, whereas previous studies have found chest x-ray cavity size to correlate with unfavorable outcomes
                    <sup>
                        <xref ref-type="bibr" rid="ref-10">10</xref>&#x2013;
                        <xref ref-type="bibr" rid="ref-12">12</xref>
                    </sup>. Therefore, for this analysis, cavity air volume measured at baseline and change in cavity air at week 4 were weighted more heavily than CT hard volume and PET TLG.</p>
                <p>In addition to PET/CT scan quantitation, two other variables were incorporated into the risk stratification algorithm. Instead of the month 2 sputum culture traditionally used to predict treatment outcome, the Xpert MTB/RIF cycle threshold assay at week 16 was included in our early treatment completion algorithm. Xpert MTB/RIF correlates well with sputum smear and culture results, with excellent sensitivity but poor specificity
                    <sup>
                        <xref ref-type="bibr" rid="ref-26">26</xref>
                    </sup>. When the assay cycle threshold is incorporated, the balance between sensitivity and specificity improves correlations with sputum smear
                    <sup>
                        <xref ref-type="bibr" rid="ref-27">27</xref>,
                        <xref ref-type="bibr" rid="ref-28">28</xref>
                    </sup> or culture
                    <sup>
                        <xref ref-type="bibr" rid="ref-29">29</xref>
                    </sup>. We applied this test at week 16 as a measure of residual bacterial load at the time of potential treatment completion because this is a point-of-care test with immediate results. The second variable incorporated at week 16 is an adherence dose count requirement of about 90% (minimum 100 out of possible 112 doses [7 doses/week x 16 weeks]) because at least 90% has been correlated with better treatment outcomes
                    <sup>
                        <xref ref-type="bibr" rid="ref-6">6</xref>,
                        <xref ref-type="bibr" rid="ref-30">30</xref>
                    </sup>.</p>
            </sec>
            <sec>
                <title>Statistical analysis</title>
                <p>	Statistical analyses were conducted in R (version 3.6.1).  Primary analyses compared tested imaging markers (cavity air, hard volume, and TLG) measured at baseline and one-month after treatment initiation between cures vs failures (and retreatments) using Wilcoxon rank-sum tests. As an exploratory analysis with limited statistical power, statistical significance was defined by p &lt;0.05, without adjustment for multiplicity. Non-parametric receiver operating characteristic (ROC) curves were generated using R packages, pROC and ROCR
                    <sup>
                        <xref ref-type="bibr" rid="ref-31">31</xref>,
                        <xref ref-type="bibr" rid="ref-32">32</xref>
                    </sup>. Sensitivity and specificity estimates were computed as binomial proportions, along with 95% confidence intervals using a normal approximation.</p>
            </sec>
        </sec>
        <sec sec-type="results">
            <title>Results</title>
            <p>Among the 92 patients that we analyzed from the Catalysis cohort, 73 were cured (asymptomatic two years after the end of treatment), eight failed treatment, and 11 programmatically restarted TB treatment during follow-up. For the PET/CT imaging analysis, the baseline and week 4 PET/CT scans were each read by a single reader, with overall summary statistics presented in 
                <xref ref-type="table" rid="T1">Table 1</xref>. At baseline, cured patients were significantly different from treatment failure patients in CT cavity air volume, with CT hard volume and PET TLG differences being borderline significant (P=0.059 for both). At week 4, only the difference in total cavity air remained significantly different. In contrast, the results for patients who were cures and retreatments were not significantly different from each other in any parameter at baseline or at week 4, making it very difficult to differentiate these two cohorts using these parameters. The comparison of treatment failures with retreatments was similar to that of treatment failures with cures. We therefore developed our criteria based primarily on differences between the cured and failure cohorts.</p>
            <table-wrap id="T1" orientation="portrait" position="anchor">
                <label>Table 1. </label>
                <caption>
                    <title>Summary statistics of PET/CT scan read results at baseline and change at week 4.</title>
                    <p>Wilcoxon rank-sum test was performed to assess the difference in image features by outcome groups.</p>
                </caption>
                <table content-type="article-table" frame="hsides">
                    <thead>
                        <tr>
                            <th colspan="2" rowspan="1"/>
                            <th align="center" colspan="1" rowspan="1">All
                                <break/>(n=92)
                                <break/>Median (IQR)</th>
                            <th align="center" colspan="1" rowspan="1">Cure
                                <break/>(n=73)
                                <break/>Median (IQR)</th>
                            <th align="center" colspan="1" rowspan="1">Failure
                                <break/>(n=8)
                                <break/>Median (IQR)</th>
                            <th align="center" colspan="1" rowspan="1">Retreatment
                                <break/>(n=11)
                                <break/>Median (IQR)</th>
                            <th align="center" colspan="1" rowspan="1">P-value
                                <break/>(Cure
                                <break/>vs.
                                <break/>Failure)</th>
                            <th align="center" colspan="1" rowspan="1">P-value
                                <break/>(Cure vs.
                                <break/>Retreatment)</th>
                            <th align="center" colspan="1" rowspan="1">P-value (Failure vs.
                                <break/>Retreatment)</th>
                        </tr>
                    </thead>
                    <tbody>
                        <tr>
                            <td align="left" colspan="1" rowspan="4">Baseline</td>
                            <td align="left" colspan="1" rowspan="1">Largest cavity
                                <break/>air volume (mL)</td>
                            <td align="center" colspan="1" rowspan="1">7.2 (1.7, 17.4)</td>
                            <td align="center" colspan="1" rowspan="1">7.2 (1.8, 16.6)</td>
                            <td align="center" colspan="1" rowspan="1">32.1 (12.9, 58.2)</td>
                            <td align="center" colspan="1" rowspan="1">5.1 (0.6, 7.4)</td>
                            <td align="center" colspan="1" rowspan="1">0.008</td>
                            <td align="center" colspan="1" rowspan="1">0.212</td>
                            <td align="center" colspan="1" rowspan="1">0.004</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Total cavity air
                                <break/>volume (mL)</td>
                            <td align="center" colspan="1" rowspan="1">7.2 (1.8, 20.9)</td>
                            <td align="center" colspan="1" rowspan="1">7.2 (1.9, 19.6)</td>
                            <td align="center" colspan="1" rowspan="1">36.4 (19.8, 73.7)</td>
                            <td align="center" colspan="1" rowspan="1">5.1 (0.7, 7.4)</td>
                            <td align="center" colspan="1" rowspan="1">0.004</td>
                            <td align="center" colspan="1" rowspan="1">0.189</td>
                            <td align="center" colspan="1" rowspan="1">0.003</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">CT hard
                                <break/>volume (mL)</td>
                            <td align="center" colspan="1" rowspan="1">56.5 (32.7, 111.4)</td>
                            <td align="center" colspan="1" rowspan="1">56.2 (31.1, 109.7)</td>
                            <td align="center" colspan="1" rowspan="1">142.3 (67.5, 157.8)</td>
                            <td align="center" colspan="1" rowspan="1">51.1 (33.1, 67.6)</td>
                            <td align="center" colspan="1" rowspan="1">0.059</td>
                            <td align="center" colspan="1" rowspan="1">0.353</td>
                            <td align="center" colspan="1" rowspan="1">0.012</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">PET total lesion
                                <break/>glycolysis</td>
                            <td align="center" colspan="1" rowspan="1">522.6 (292.2, 995.4)</td>
                            <td align="center" colspan="1" rowspan="1">519.5 (279.1, 947.4)</td>
                            <td align="center" colspan="1" rowspan="1">1127.2 (732.1, 1469.2)</td>
                            <td align="center" colspan="1" rowspan="1">451.9 (349.0, 568.8)</td>
                            <td align="center" colspan="1" rowspan="1">0.059</td>
                            <td align="center" colspan="1" rowspan="1">0.419</td>
                            <td align="center" colspan="1" rowspan="1">0.051</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="4">Percent
                                <break/>change at
                                <break/>week 4</td>
                            <td align="left" colspan="1" rowspan="1">Largest cavity
                                <break/>air volume (mL)</td>
                            <td align="center" colspan="1" rowspan="1">-61.8 (-81.0, -32.2)</td>
                            <td align="center" colspan="1" rowspan="1">-63.0 (-81.0, -34.8)</td>
                            <td align="center" colspan="1" rowspan="1">-39.6 (-53.9, -24.7)</td>
                            <td align="center" colspan="1" rowspan="1">-63.4 (-90.4, -16.3)</td>
                            <td align="center" colspan="1" rowspan="1">0.065</td>
                            <td align="center" colspan="1" rowspan="1">0.851</td>
                            <td align="center" colspan="1" rowspan="1">0.321</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Total cavity air
                                <break/>volume (mL)</td>
                            <td align="center" colspan="1" rowspan="1">-61.2 (-78.4, -33.6)</td>
                            <td align="center" colspan="1" rowspan="1">-63.3 (-78.8, -38.6)</td>
                            <td align="center" colspan="1" rowspan="1">-40.1 (-51.2, -20.3)</td>
                            <td align="center" colspan="1" rowspan="1">-63.4 (-89.0, -16.3)</td>
                            <td align="center" colspan="1" rowspan="1">0.033</td>
                            <td align="center" colspan="1" rowspan="1">0.825</td>
                            <td align="center" colspan="1" rowspan="1">0.167</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">CT hard
                                <break/>volume (mL)</td>
                            <td align="center" colspan="1" rowspan="1">-16.7 (-31.1, -4.9)</td>
                            <td align="center" colspan="1" rowspan="1">-17.2 (-34.4, -4.8)</td>
                            <td align="center" colspan="1" rowspan="1">-8.2 (-22.6, -0.4)</td>
                            <td align="center" colspan="1" rowspan="1">-21.4 (-29.6, -6.4)</td>
                            <td align="center" colspan="1" rowspan="1">0.376</td>
                            <td align="center" colspan="1" rowspan="1">0.724</td>
                            <td align="center" colspan="1" rowspan="1">0.310</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">PET total lesion
                                <break/>glycolysis</td>
                            <td align="center" colspan="1" rowspan="1">-17.7 (-31.3, -2.6)</td>
                            <td align="center" colspan="1" rowspan="1">-20.3 (-31.7, -8.1)</td>
                            <td align="center" colspan="1" rowspan="1">-8.6 (-21.6, 16.7)</td>
                            <td align="center" colspan="1" rowspan="1">-8.0 (-22.1, -2.0)</td>
                            <td align="center" colspan="1" rowspan="1">0.125</td>
                            <td align="center" colspan="1" rowspan="1">0.308</td>
                            <td align="center" colspan="1" rowspan="1">0.545</td>
                        </tr>
                    </tbody>
                </table>
                <table-wrap-foot>
                    <fn>
                        <p>PET, positron emission tomography; CT, computed tomography; IQR, interquartile range.</p>
                    </fn>
                </table-wrap-foot>
            </table-wrap>
            <p>To identify specific thresholds that predicted cure vs treatment failure, ROC curves were drawn for each variable (
                <xref ref-type="fig" rid="f1">Figure 1</xref>). When using the optimal ROC thresholds of all baseline and week 4 PET/CT criteria into a single algorithm, the combined criteria predicted cure with 100% sensitivity, capturing all eight failures and 11 retreatments as high risk. However, specificity was very poor at 19.2% and only 14/81 (17.3%) of subjects with both baseline and week 4 PET/CT scans were classified as low risk (
                <xref ref-type="table" rid="T2">Table 2</xref>). This is well below our target of 50%, classifying too many as false positive high risk, resulting in an algorithm that is neither practical nor scalable. We therefore adjusted the thresholds to be more specific at the cost of sensitivity to approach our 50% target.</p>
            <fig fig-type="figure" id="f1" orientation="portrait" position="float">
                <label>Figure 1. </label>
                <caption>
                    <title>ROC curves of radiological biomarkers in predicting failure vs. cure.</title>
                    <p>Panel 
                        <bold>A</bold>&#x2013;
                        <bold>D</bold>: baseline; Panel 
                        <bold>E</bold>&#x2013;
                        <bold>H</bold>: %change of week 4 from baseline; AUCs along with 95% CIs were added as blue text at the bottom of each plot; optimal threshold and the corresponding sensitivity and specificity were labeled on the curve. ROC, receiver operating characteristic; AUC, area-under-the-curve; CI, confidence intervals; TLG, total lesion glycolysis.</p>
                </caption>
                <graphic orientation="portrait" position="float" xlink:href="https://gatesopenresearch-files.f1000.com/manuscripts/14379/0667647c-a512-40b6-8c0e-fd64f8eb2436_figure1.gif"/>
            </fig>
            <table-wrap id="T2" orientation="portrait" position="anchor">
                <label>Table 2. </label>
                <caption>
                    <title>Using optimal AUC cutoffs of imaging features to predict Failure vs. Cure (N=81).</title>
                </caption>
                <table content-type="article-table" frame="hsides">
                    <thead>
                        <tr>
                            <th colspan="1" rowspan="1"/>
                            <th align="center" colspan="1" rowspan="1">Sensitivity (95% CI)</th>
                            <th align="center" colspan="1" rowspan="1">Specificity (95% CI)</th>
                            <th align="center" colspan="1" rowspan="1">% of Total Subjects</th>
                        </tr>
                    </thead>
                    <tbody>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Baseline largest cavity air &lt;10.5ml</td>
                            <td align="center" colspan="1" rowspan="1">75% (34.9%, 96.8%)</td>
                            <td align="center" colspan="1" rowspan="1">63% (50.9%, 74%)</td>
                            <td align="center" colspan="1" rowspan="1">48/81 (59.3%)</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Baseline hard &lt;135.6ml</td>
                            <td align="center" colspan="1" rowspan="1">62.5% (24.5%, 91.5%)</td>
                            <td align="center" colspan="1" rowspan="1">79.5% (68.4%, 88%)</td>
                            <td align="center" colspan="1" rowspan="1">61/81 (75.3%)</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Baseline TLG &lt;876.8</td>
                            <td align="center" colspan="1" rowspan="1">62.5% (24.5%, 91.5%)</td>
                            <td align="center" colspan="1" rowspan="1">68.5% (56.6%, 78.9%)</td>
                            <td align="center" colspan="1" rowspan="1">53/81 (65.4%)</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Baseline combined criteria</td>
                            <td align="center" colspan="1" rowspan="1">87.5% (47.3%, 99.7%)</td>
                            <td align="center" colspan="1" rowspan="1">53.4% (41.4%, 65.2%)</td>
                            <td align="center" colspan="1" rowspan="1">40/81 (49.4%)</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Baseline largest cavity air &lt;10.5ml &amp; %change
                                <break/>&lt;-64%</td>
                            <td align="center" colspan="1" rowspan="1">100% (63.1%, 100%)</td>
                            <td align="center" colspan="1" rowspan="1">39.7% (28.5%, 51.9%)</td>
                            <td align="center" colspan="1" rowspan="1">29/81 (35.8%)</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Baseline hard &lt;135.6ml &amp; %change &lt;-10.7%</td>
                            <td align="center" colspan="1" rowspan="1">87.5% (47.3%, 99.7%)</td>
                            <td align="center" colspan="1" rowspan="1">53.4% (41.4%, 65.2%)</td>
                            <td align="center" colspan="1" rowspan="1">40/81 (49.4%)</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Baseline TLG &lt;876.8 &amp; %change &lt;-10.7%</td>
                            <td align="center" colspan="1" rowspan="1">87.5% (47.3%, 99.7%)</td>
                            <td align="center" colspan="1" rowspan="1">49.3% (37.4%, 61.3%)</td>
                            <td align="center" colspan="1" rowspan="1">37/81 (45.7%)</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Final combined criteria</td>
                            <td align="center" colspan="1" rowspan="1">100% (63.1%, 100%)</td>
                            <td align="center" colspan="1" rowspan="1">19.2% (10.9%, 30.1%)</td>
                            <td align="center" colspan="1" rowspan="1">14/81 (17.3%)</td>
                        </tr>
                    </tbody>
                </table>
                <table-wrap-foot>
                    <fn>
                        <p>AUC, area-under-the-curve; CI, confidence interval; TLG, total lesion glycolysis.</p>
                    </fn>
                </table-wrap-foot>
            </table-wrap>
            <p>Because prior clinical trials data already demonstrated that baseline cavity was a risk factor for poor outcomes
                <sup>
                    <xref ref-type="bibr" rid="ref-8">8</xref>,
                    <xref ref-type="bibr" rid="ref-10">10</xref>
                </sup>, and because cavity size was the strongest predictor of poor outcome in our ROC curves (cavity air area-under-the-curve (AUC) &gt; CT hard volume and PET TLG AUCs), we built the algorithm around this parameter first. We defined cavity as largest single cavity size rather than total cavity size because we posited that a single large cavity induced higher risk and may take longer to heal than multiple smaller cavities (i.e. one 30 mL cavity had a higher risk for poor outcome than two 15 mL cavities). In examining the baseline cavity size threshold, a 10.5 mL threshold captured 6/8 (75%) of treatment failures as high risk but only 46/73 (63.0%) cures as low risk. By increasing the threshold to 30 mL, the algorithm would miss one additional failure (now only 5/8 [62.5%] as high risk; increasing beyond 30 mL would lose more than one additional failure) but specificity would increase to 86.3% (63/73 cures now classified as low risk; 
                <xref ref-type="table" rid="T3">Table 3</xref>). Overall, using 30 mL instead of 10.5 mL as the baseline cavity threshold increases the proportion of cured (N=73) and failure (N=8) patients defined as low risk from 48/81 (59.3%) to 66/81 (81.5%). Changing the week 4 cavity volume reduction threshold from 64% to 20% results in a similar sensitivity/specificity tradeoff. After applying both baseline and week 4 cavity change thresholds, only 29/81 (35.8%) would have been classified as low risk (before adding any other imaging criteria) using the 64% threshold but 58/81 (71.6%) were low risk with the 20% cavity reduction threshold (
                <xref ref-type="table" rid="T3">Table 3</xref>). In contrast to prior data on the risk from baseline cavities, quantitation of CT disease volumes and PET TLG has not previously been validated. Therefore, weighing cavity size to threshold about half (28.4% of total) of the target 50% as high risk seemed appropriate, allowing the remaining criteria (CT hard volume, PET TLG, Xpert cycle threshold, and adherence) to threshold the other half.</p>
            <table-wrap id="T3" orientation="portrait" position="anchor">
                <label>Table 3. </label>
                <caption>
                    <title>Compare different cavity air thresholds to predict Failure vs. Cure (N=81).</title>
                </caption>
                <table content-type="article-table" frame="hsides">
                    <thead>
                        <tr>
                            <th align="left" colspan="1" rowspan="1"/>
                            <th align="center" colspan="1" rowspan="1">Sensitivity (95% CI)</th>
                            <th align="center" colspan="1" rowspan="1">Specificity (95% CI)</th>
                            <th align="center" colspan="1" rowspan="1">% of Total Subjects</th>
                        </tr>
                    </thead>
                    <tbody>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Baseline cavity air &lt;30ml</td>
                            <td align="center" colspan="1" rowspan="1">62.5% (24.5%, 91.5%)</td>
                            <td align="center" colspan="1" rowspan="1">86.3% (76.2%, 93.2%)</td>
                            <td align="center" colspan="1" rowspan="1">66/81 (81.5%)</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Baseline cavity air &lt;10.5ml</td>
                            <td align="center" colspan="1" rowspan="1">75% (34.9%, 96.8%)</td>
                            <td align="center" colspan="1" rowspan="1">63% (50.9%, 74%)</td>
                            <td align="center" colspan="1" rowspan="1">48/81 (59.3%)</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Baseline cavity air &lt;30ml &amp; %change cavity air &lt;-20%
                                <break/>(Predict baseline and one-month cavity criteria)</td>
                            <td align="center" colspan="1" rowspan="1">75% (34.9%, 96.8%)</td>
                            <td align="center" colspan="1" rowspan="1">76.7% (65.4%, 85.8%)</td>
                            <td align="center" colspan="1" rowspan="1">58/81 (71.6%)</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Baseline cavity air &lt;30ml &amp; %change cavity air &lt;-64%</td>
                            <td align="center" colspan="1" rowspan="1">100% (63.1%, 100%)</td>
                            <td align="center" colspan="1" rowspan="1">54.8% (42.7%, 66.5%)</td>
                            <td align="center" colspan="1" rowspan="1">40/81 (49.4%)</td>
                        </tr>
                    </tbody>
                </table>
                <table-wrap-foot>
                    <fn>
                        <p>CI, confidence interval.</p>
                    </fn>
                </table-wrap-foot>
            </table-wrap>
            <p>Similar to the cavity size thresholds, the cutoffs for hard CT volume and PET TLG were adjusted by decreasing sensitivity but increasing specificity from the optimal ROC parameters (
                <xref ref-type="fig" rid="f1">Figure 1</xref>, 
                <xref ref-type="table" rid="T2">Table 2</xref>) to stratify about 50% of the total cohort as low risk, aiming to capture as many treatment failure and retreatment patients as possible within the 50%. 
                <xref ref-type="fig" rid="f2">Figure 2</xref> demonstrates the patient stratifications when baseline CT hard volume &lt;200 mL and TLG &lt;1500 units were used as low risk criteria at baseline. When combined with largest cavity air &lt;30 mL, 60/81 (74.1%) patients were classified as low risk at baseline (
                <xref ref-type="table" rid="T4">Table 4</xref>). When applied to the entire Catalysis cohort, including the retreatment patients, 6/8 (75%) failures but only 1/11 retreatments were classified as high risk at baseline, suggesting that treatment failure patients may be more correlated with severity of baseline disease compared to retreatment TB patients. Week 4 change criteria for CT hard volume and PET TLG were similarly adjusted and allowed for slight increases to account for potential paradoxical treatment responses that were ultimately still favorable. The final week 4 criteria selected allowed up to a 10% increase in hard volume and 30% increase in TLG at week 4 to remain low risk, resulting in 46/81 (56.8%) classified as low risk after applying both baseline and week 4 PET/CT criteria (
                <xref ref-type="table" rid="T4">Table 4</xref>). The week 4 criteria captured only one additional failure but two additional retreatments, suggesting that retreatment TB patients may be more correlated with poor treatment response at week 4 rather than severity of disease at baseline.</p>
            <fig fig-type="figure" id="f2" orientation="portrait" position="float">
                <label>Figure 2. </label>
                <caption>
                    <title>Hard volume vs. total lesion glycolysis (TLG).</title>
                    <p>Left: baseline; Right: % change at four weeks from baseline. Left plot: Six failures and one retreated were caught by baseline criteria (five failures and one retreated have cavity air &gt;=30 and two have TLG&gt;1500); Right plot: Two retreated subjects were selected by week 4 cavity air criteria (decrease of cavity air &lt; 20%). Two additional retreated cases and one failure were caught by Week 16 Xpert &lt;30.</p>
                </caption>
                <graphic orientation="portrait" position="float" xlink:href="https://gatesopenresearch-files.f1000.com/manuscripts/14379/0667647c-a512-40b6-8c0e-fd64f8eb2436_figure2.gif"/>
            </fig>
            <table-wrap id="T4" orientation="portrait" position="anchor">
                <label>Table 4. </label>
                <caption>
                    <title>Radiological markers to predict Cure vs Failure (N=81).</title>
                </caption>
                <table content-type="article-table" frame="hsides">
                    <thead>
                        <tr>
                            <th align="center" colspan="1" rowspan="1"/>
                            <th align="center" colspan="1" rowspan="1">Sensitivity (95% CI)</th>
                            <th align="center" colspan="1" rowspan="1">Specificity (95% CI)</th>
                            <th align="center" colspan="1" rowspan="1">% of Total Subjects</th>
                        </tr>
                    </thead>
                    <tbody>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Baseline cavity criteria</td>
                            <td align="center" colspan="1" rowspan="1">62.5% (24.5%, 91.5%)</td>
                            <td align="center" colspan="1" rowspan="1">86.3% (76.2%, 93.2%)</td>
                            <td align="center" colspan="1" rowspan="1">66/81 (81.5%)</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Baseline hard criteria</td>
                            <td align="center" colspan="1" rowspan="1">0% (0%, 36.9%)</td>
                            <td align="center" colspan="1" rowspan="1">91.8% (83%, 96.9%)</td>
                            <td align="center" colspan="1" rowspan="1">75/81 (92.6%)</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Baseline TLG criteria</td>
                            <td align="center" colspan="1" rowspan="1">25% (3.2%, 65.1%)</td>
                            <td align="center" colspan="1" rowspan="1">93.2% (84.7%, 97.7%)</td>
                            <td align="center" colspan="1" rowspan="1">74/81 (91.4%)</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Baseline Predict criteria</td>
                            <td align="center" colspan="1" rowspan="1">75% (34.9%, 96.8%)</td>
                            <td align="center" colspan="1" rowspan="1">79.5% (68.4%, 88%)</td>
                            <td align="center" colspan="1" rowspan="1">60/81 (74.1%)</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Baseline and one-month cavity criteria</td>
                            <td align="center" colspan="1" rowspan="1">75% (34.9%, 96.8%)</td>
                            <td align="center" colspan="1" rowspan="1">76.7% (65.4%, 85.8%)</td>
                            <td align="center" colspan="1" rowspan="1">58/81 (71.6%)</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Baseline and one-month hard criteria</td>
                            <td align="center" colspan="1" rowspan="1">25% (3.2%, 65.1%)</td>
                            <td align="center" colspan="1" rowspan="1">79.5% (68.4%, 88%)</td>
                            <td align="center" colspan="1" rowspan="1">64/81 (79%)</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Baseline and one-month TLG criteria</td>
                            <td align="center" colspan="1" rowspan="1">37.5% (8.5%, 75.5%)</td>
                            <td align="center" colspan="1" rowspan="1">84.9% (74.6%, 92.2%)</td>
                            <td align="center" colspan="1" rowspan="1">67/81 (82.7%)</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Final Predict Radiological criteria</td>
                            <td align="center" colspan="1" rowspan="1">87.5% (47.3%, 99.7%)</td>
                            <td align="center" colspan="1" rowspan="1">61.6% (49.5%, 72.8%)</td>
                            <td align="center" colspan="1" rowspan="1">46/81 (56.8%)</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Final Predict criteria (+Week16 Xpert)</td>
                            <td align="center" colspan="1" rowspan="1">87.5% (47.3%, 99.7%)</td>
                            <td align="center" colspan="1" rowspan="1">54.8% (42.7%, 66.5%)</td>
                            <td align="center" colspan="1" rowspan="1">41/81 (50.6%)</td>
                        </tr>
                    </tbody>
                </table>
                <table-wrap-foot>
                    <fn>
                        <p>CI, confidence interval; TLG, total lesion glycolysis.</p>
                    </fn>
                </table-wrap-foot>
            </table-wrap>
            <p>	Finally, we included a measure of residual TB bacterial load in sputum in the early treatment completion criteria based on an analysis that Xpert cycle threshold around 30 at weeks 8 and 24 correlated with culture negativity and patient treatment outcomes
                <sup>
                    <xref ref-type="bibr" rid="ref-29">29</xref>
                </sup>. We incorporated this measure at week 16 as a safety mechanism to ensure that participants with higher sputum bacterial load (cycle threshold &lt;30) did not stop treatment early. Combining the baseline and week 4 criteria stratified 41/81 (50.6%) of all cured and failure patients as low risk (
                <xref ref-type="table" rid="T4">Table 4</xref>). When applied to the entire Catalysis cohort, including retreatment patients, 1/8 (12.5%) failures and 6/11 (54.5%) retreatments were captured as low risk. The PredictTB early treatment completion criteria when the trial started is shown in 
                <xref ref-type="table" rid="T5a">Table 5a</xref>. </p>
            <table-wrap id="T5a" orientation="portrait" position="anchor">
                <label>Table 5a. </label>
                <caption>
                    <title>Predict TB early treatment completion criteria at the start of the trial.</title>
                </caption>
                <table content-type="article-table" frame="hsides">
                    <thead>
                        <tr>
                            <th align="left" colspan="1" rowspan="1">Early completion
                                <break/>criteria:</th>
                            <th align="left" colspan="1" rowspan="1">Determined at Week 16 &#x2013; unless known to have failed a radiologic criterion at baseline or
                                <break/>week 4.</th>
                        </tr>
                    </thead>
                    <tbody>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Radiologic criteria</td>
                            <td align="left" colspan="1" rowspan="1">Baseline PET/CT:
                                <break/>&#x00a0;&#x00a0;&#x00a0;&#x2022;&#x00a0;&#x00a0;No total lung collapse of a single side, AND
                                <break/>&#x00a0;&#x00a0;&#x00a0;&#x2022;&#x00a0;&#x00a0;No pleural effusion, AND
                                <break/>&#x00a0;&#x00a0;&#x00a0;&#x2022;&#x00a0;&#x00a0;No single cavity air volume on CT scan &gt;30 mL, AND
                                <break/>&#x00a0;&#x00a0;&#x00a0;&#x2022;&#x00a0;&#x00a0;CT scan hard volume (-100 to +100 HU density) &lt;200 mL, AND
                                <break/>&#x00a0;&#x00a0;&#x00a0;&#x2022;&#x00a0;&#x00a0;PET total lesion glycolysis &lt;1500 units
                                <break/>Week 4 PET/CT:
                                <break/>&#x00a0;&#x00a0;&#x00a0;&#x2022;&#x00a0;&#x00a0;All individual cavities decrease by &gt;20% (unless cavity &lt;2 mL), AND
                                <break/>&#x00a0;&#x00a0;&#x00a0;&#x2022;&#x00a0;&#x00a0;CT scan hard volume does not increase by &gt;10% unless the increase is &lt;5 mL, AND
                                <break/>&#x00a0;&#x00a0;&#x00a0;&#x2022;&#x00a0;&#x00a0;PET total lesion glycolysis does not increase by &gt;30% unless the increase is &lt;50 units</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Bacterial load criterion</td>
                            <td align="left" colspan="1" rowspan="1">Week 16 Xpert cycle threshold &#x2265;30
                                <xref ref-type="other" rid="tf1">*</xref>
                            </td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Adherence criterion</td>
                            <td align="left" colspan="1" rowspan="1">Minimum of 100 doses received by week 16</td>
                        </tr>
                    </tbody>
                </table>
                <table-wrap-foot>
                    <fn>
                        <p id="tf1">*If the week 16 solid medium sputum culture is subsequently found to be positive for 
                            <italic toggle="yes">Mtb</italic> in a participant randomized to Arm B or C, this participant will be called in for evaluation and to provide sputum for a repeat culture. If the initial positive culture is confirmed by a second culture positive for 
                            <italic toggle="yes">Mtb</italic>, this participant will be considered to have met the study endpoint as a treatment failure and will be referred for continued treatment.</p>
                        <p>TB, tuberculosis; PET, positron emission tomography; CT, computed tomography; HU, Hounsfield unit.</p>
                    </fn>
                </table-wrap-foot>
            </table-wrap>
            <p>The initial criteria were established as described above, acknowledging that early changes may be needed once the trial began if the actual proportion of PredictTB study participants stratified to the low- and high-risk arms were not close to the 50:50 target. Indeed, after about nine months of enrollment, only 23.4% of participants to reach week 16 were stratified as low risk (Arms B and C), with the remainder stratified to the high-risk arm (Arm A). This was less than half of the estimated 50% we expected to be low risk, which had major implications for the cost and duration of the study (total sample size, study duration, and cost would need to be increased to achieve required sample size in Arms B and C) as well as the scientific relevance of the study (if trial successful, it would only apply to less than 25% of TB patients, diminishing relevance). After discussion with our study Data and Safety Monitoring Board (DSMB), we revised our study early treatment completion criteria.</p>
            <sec>
                <title>Revising the early treatment completion criteria</title>
                <p>We considered how to change both the Xpert cycle threshold cutoff and the PET/CT radiology thresholds. For the Xpert cycle threshold, the original cutoff at week 16 was based on a cohort study in South Africa with MGIT culture results, the only data available to us at the time. We adopted a stringent cycle threshold value of 30 based upon analysis of these data for subjects to be randomized to Arms B and C. Xpert detects bacterial DNA but does not determine the viability of detected DNA (i.e., detected bacteria may be dead). For PredictTB, however, LJ culture is used to determine primary study outcomes. For this re-analysis, we received unpublished results from TBTC study 29, which collected cycle threshold values and LJ culture results (Rada Savic, personal communication). In evaluating the change, we considered the chance of missing an LJ+ result, as well as the sensitivity and specificity of various cycle threshold cutoffs. In contrast to positive and negative predictive values, sensitivity and specificity do not depend on the underlying proportion of culture positive results, which varies over time and from study to study. That said, patient safety was a driving factor so we considered how many positive cultures might be missed for various cutoffs. This was defined as the probability of being LJ+ given a Xpert cycle threshold value less than the cutoff, i.e., P(LJ+ | Ct-). We assumed what we considered were high proportions of LJ+ cultures (i.e., 10% and 5% at week 16 of treatment in the lower risk cohort of arm B/C) when making this decision. In contrast to TBTC study 29, which randomized all-comers and did not stratify participants by risk, the PredictTB study further excludes poorly adherent participants, those with too severe disease at baseline, or those not responding appropriately to treatment at one month. As a result, the expected LJ+ rates of 10% and 5% were considered to be very high. 
                    <xref ref-type="table" rid="T6">Table 6</xref> describes these proportions for the sensitivity and specificity estimates from TBTC study 29. Based on these estimates, a cycle threshold of 30 was expected to miss 2.1% of LJ+ results, while a threshold of 28 would miss 2.5%, assuming a 10% LJ+ rate. This translated to an increase in less than one participant being missed among those randomized to arm C. That is, if the background LJ+ rate was 10%, 3.3 (of 155 randomized to stop treatment at week 16) true LJ+ participants may be missed with a cycle threshold of 30, and 3.9 may be missed with a cycle threshold of 28. If the underlying LJ+ rate was 5%, this becomes 1.6 missed LJ+ participants with cycle threshold 30 and 1.9 missed LJ+ participants with cycle threshold 28. If the true underlying LJ+ rate was even lower (as we would expect it to be), the difference between the two cycle threshold values becomes even smaller. Of the 12 participants already enrolled in the PredictTB study with week 16 Xpert cycle threshold results at the time of this analysis, eight had negative results, two had cycle thresholds below 28 (18.2 and 25.2) and two had cycle thresholds above 28 (28.4 and 28.5). Thus changing the Xpert cycle threshold cutoff from 30 to 28 would potentially (depending on radiology criteria) have retained an additional two participants in arms B and C.</p>
                <p>To further correct the arm imbalance, we also changed the baseline and week 4 radiologic criteria. Prior studies validated that cavity on baseline CXR is a risk factor for treatment relapse. In our analyses of prior data, cavity size was also the strongest factor in predicting poor treatment outcome so we did not adjust this criterion. The data for CT hard volume and PET TLG as risk factors for poor treatment outcomes, however, were weak. 
                    <xref ref-type="fig" rid="f3a">Figure 3a</xref> shows the distribution of participants stratified to Arm A at baseline by the original radiology criteria. The numbers in the circles represent the number of participants that fell into arm A according to the defined criteria. The hard volume and total activity criteria were relatively well correlated in capturing participants, with only five participants moved to Arm A based on a single criterion, hard volume or PET TLG. Therefore, instead of arbitrarily increasing the hard volume and PET TLG cutoffs, we changed the criteria from requiring both hard volume 
                    <bold>AND</bold> total activity to be below the thresholds to be considered low risk, to only requiring one criterion. That is, participants with either hard volume 
                    <bold>OR</bold> total activity below the threshold at both baseline and week 4 would be considered low risk. The thresholds themselves did not change. Applying this change to the PET/CT criteria results in the revised Venn diagram in 
                    <xref ref-type="fig" rid="f3b">Figure 3b</xref>, which is the same as 
                    <xref ref-type="fig" rid="f3a">Figure 3a</xref> except for the five participants moved to Arm A based on hard volume or PET activity alone are no longer considered high risk. The revised early treatment completion criteria incorporating both Xpert cycle threshold and radiologic criteria changes are shown in 
                    <xref ref-type="table" rid="T5b">Table 5b</xref>.</p>
                <fig fig-type="figure" id="f3a" orientation="portrait" position="float">
                    <label>Figure 3a. </label>
                    <caption>
                        <title>Venn diagram of the 
                            <italic toggle="yes">original</italic> baseline PET/CT criteria by which participants were stratified to Arm A.</title>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://gatesopenresearch-files.f1000.com/manuscripts/14379/0667647c-a512-40b6-8c0e-fd64f8eb2436_figure3a.gif"/>
                </fig>
                <fig fig-type="figure" id="f3b" orientation="portrait" position="float">
                    <label>Figure 3b. </label>
                    <caption>
                        <title>Venn diagram of the 
                            <italic toggle="yes">revised</italic> baseline PET/CT criteria by which participants are stratified to Arm A.</title>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://gatesopenresearch-files.f1000.com/manuscripts/14379/0667647c-a512-40b6-8c0e-fd64f8eb2436_figure3b.gif"/>
                </fig>
                <table-wrap id="T5b" orientation="portrait" position="anchor">
                    <label>Table 5b. </label>
                    <caption>
                        <title>Amended Predict TB early treatment completion criteria to correct arm imbalance between Arm A and Arms B/C.</title>
                        <p>Changes are highlighted in yellow.</p>
                    </caption>
                    <table content-type="article-table" frame="hsides">
                        <thead>
                            <tr>
                                <th align="left" colspan="1" rowspan="1">Early completion
                                    <break/>criteria:</th>
                                <th align="left" colspan="1" rowspan="1">Determined at Week 16 &#x2013; unless known to have failed a radiologic criterion at baseline or week 4.</th>
                            </tr>
                        </thead>
                        <tbody>
                            <tr>
                                <td align="left" colspan="1" rowspan="1">Radiologic criteria</td>
                                <td align="left" colspan="1" rowspan="1">Baseline PET/CT:
                                    <break/>&#x00a0;&#x00a0;&#x00a0;&#x2022;&#x00a0;&#x00a0;No total lung collapse of a single side, AND
                                    <break/>&#x00a0;&#x00a0;&#x00a0;&#x2022;&#x00a0;&#x00a0;No pleural effusion, AND
                                    <break/>&#x00a0;&#x00a0;&#x00a0;&#x2022;&#x00a0;&#x00a0;No single cavity air volume on CT scan &gt;30 mL, AND
                                    <break/>&#x00a0;&#x00a0;&#x00a0;&#x2022;&#x00a0;&#x00a0;CT scan hard volume (-100 to +100 HU density) &lt;200 mL 
                                    <styled-content style="#FFFF00" style-type="background-color">OR</styled-content> PET total lesion glycolysis &lt;1500 units
                                    <break/>Week 4 PET/CT:
                                    <break/>&#x00a0;&#x00a0;&#x00a0;&#x2022;&#x00a0;&#x00a0;All individual cavities decrease by &gt;20% (unless cavity &lt;2 mL), AND
                                    <break/>&#x00a0;&#x00a0;&#x00a0;&#x2022;&#x00a0;&#x00a0;CT scan hard volume does not increase by &gt;10% unless the increase is &lt;5 mL 
                                    <styled-content style="#FFFF00" style-type="background-color">OR</styled-content> PET total lesion
                                    <break/>glycolysis does not increase by &gt;30% unless the increase is &lt;50 units</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1">Bacterial load criterion</td>
                                <td align="left" colspan="1" rowspan="1">Week 16 Xpert cycle threshold &#x2265;
                                    <styled-content style="#FFFF00" style-type="background-color">28</styled-content>
                                    <xref ref-type="other" rid="tf2">*</xref>
                                </td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1">Adherence criterion</td>
                                <td align="left" colspan="1" rowspan="1">Minimum of 100 doses received by week 16</td>
                            </tr>
                        </tbody>
                    </table>
                    <table-wrap-foot>
                        <fn>
                            <p id="tf2">*If the week 16 solid medium sputum culture is subsequently found to be positive for 
                                <italic toggle="yes">Mtb</italic> in a participant randomized to Arm B or C, this participant will be called in for evaluation and to provide sputum for a repeat culture. If the initial positive culture is confirmed by a second culture positive for 
                                <italic toggle="yes">Mtb</italic>, this participant will be considered to have met the study endpoint as a treatment failure and will be referred for continued treatment.</p>
                            <p>TB, tuberculosis; PET, positron emission tomography; CT, computed tomography; HU, Hounsfield unit.</p>
                        </fn>
                    </table-wrap-foot>
                </table-wrap>
                <table-wrap id="T6" orientation="portrait" position="anchor">
                    <label>Table 6. </label>
                    <caption>
                        <title>Sensitivity and specificity estimates from TBTC study 29 for various Xpert cycle threshold cutoffs, along with estimates of missed LJ+ and missed LJ- results for assumed (week 16) culture-positivity rates of 10% and 5%.</title>
                    </caption>
                    <table content-type="article-table" frame="hsides">
                        <thead>
                            <tr>
                                <th align="center" colspan="1" rowspan="1" valign="top">Xpert cycle
                                    <break/>threshold
                                    <break/>Cutoff</th>
                                <th align="center" colspan="1" rowspan="1" valign="top">Sensitivity:
                                    <break/>P(Ct&lt;c|LJ+)</th>
                                <th align="center" colspan="1" rowspan="1" valign="top">Specificity:
                                    <break/>P(Ct&gt;c|LJ-)</th>
                                <th align="center" colspan="1" rowspan="1" valign="top">Chance of missed
                                    <break/>LJ+ P(LJ+|Ct-)
                                    <break/>with 10% LJ+ rate</th>
                                <th align="center" colspan="1" rowspan="1" valign="top">Chance of missed LJ+
                                    <break/>P(LJ+|Ct-)
                                    <break/>with 5% LJ+ rate</th>
                                <th align="center" colspan="1" rowspan="1" valign="top">Chance of missed LJ+
                                    <break/>P(LJ+|Ct-)
                                    <break/>with 2.5% LJ+ rate</th>
                            </tr>
                        </thead>
                        <tbody>
                            <tr>
                                <td align="center" colspan="1" rowspan="1" valign="top">31</td>
                                <td align="center" colspan="1" rowspan="1" valign="top">0.93</td>
                                <td align="center" colspan="1" rowspan="1" valign="top">0.43</td>
                                <td align="center" colspan="1" rowspan="1" valign="top">0.018</td>
                                <td align="center" colspan="1" rowspan="1" valign="top">0.008</td>
                                <td align="center" colspan="1" rowspan="1" valign="top">0.004</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="1" rowspan="1" valign="top">30</td>
                                <td align="center" colspan="1" rowspan="1" valign="top">0.91</td>
                                <td align="center" colspan="1" rowspan="1" valign="top">0.46</td>
                                <td align="center" colspan="1" rowspan="1" valign="top">0.021</td>
                                <td align="center" colspan="1" rowspan="1" valign="top">0.010</td>
                                <td align="center" colspan="1" rowspan="1" valign="top">0.005</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="1" rowspan="1" valign="top">29</td>
                                <td align="center" colspan="1" rowspan="1" valign="top">0.89</td>
                                <td align="center" colspan="1" rowspan="1" valign="top">0.49</td>
                                <td align="center" colspan="1" rowspan="1" valign="top">0.024</td>
                                <td align="center" colspan="1" rowspan="1" valign="top">0.012</td>
                                <td align="center" colspan="1" rowspan="1" valign="top">0.006</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="1" rowspan="1" valign="top">28</td>
                                <td align="center" colspan="1" rowspan="1" valign="top">0.88</td>
                                <td align="center" colspan="1" rowspan="1" valign="top">0.52</td>
                                <td align="center" colspan="1" rowspan="1" valign="top">0.025</td>
                                <td align="center" colspan="1" rowspan="1" valign="top">0.012</td>
                                <td align="center" colspan="1" rowspan="1" valign="top">0.006</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="1" rowspan="1" valign="top">27</td>
                                <td align="center" colspan="1" rowspan="1" valign="top">0.86</td>
                                <td align="center" colspan="1" rowspan="1" valign="top">0.55</td>
                                <td align="center" colspan="1" rowspan="1" valign="top">0.028</td>
                                <td align="center" colspan="1" rowspan="1" valign="top">0.013</td>
                                <td align="center" colspan="1" rowspan="1" valign="top">0.006</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="1" rowspan="1" valign="top">26</td>
                                <td align="center" colspan="1" rowspan="1" valign="top">0.84</td>
                                <td align="center" colspan="1" rowspan="1" valign="top">0.60</td>
                                <td align="center" colspan="1" rowspan="1" valign="top">0.029</td>
                                <td align="center" colspan="1" rowspan="1" valign="top">0.014</td>
                                <td align="center" colspan="1" rowspan="1" valign="top">0.007</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="1" rowspan="1" valign="top">25</td>
                                <td align="center" colspan="1" rowspan="1" valign="top">0.79</td>
                                <td align="center" colspan="1" rowspan="1" valign="top">0.66</td>
                                <td align="center" colspan="1" rowspan="1" valign="top">0.034</td>
                                <td align="center" colspan="1" rowspan="1" valign="top">0.016</td>
                                <td align="center" colspan="1" rowspan="1" valign="top">0.008</td>
                            </tr>
                        </tbody>
                    </table>
                </table-wrap>
                <p>The revised early treatment completion criteria were accepted by the NIAID DSMB on March 16, 2018 and implemented after local regulatory approvals at the Henan, China sites on May 19, 2018 and at the Western Cape, South Africa sites on June 15, 2018. Only 12 (3.9%) of the total sample size of 310 to the low risk arms were recruited under the original early treatment completion criteria. The revised criteria re-balanced the arm proportions very nicely, approaching 50:50 in Arms A and B/C. The data used to develop both the original and revised early treatment completion criteria are deposited on 
                    <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.7910/DVN/97HYQ5">Harvard Dataverse</ext-link> (see 
                    <italic toggle="yes">Data availability</italic>).</p>
            </sec>
        </sec>
        <sec sec-type="discussion">
            <title>Discussion</title>
            <p>	Previously conducted treatment shortening studies for DS-TB suggested that approximately 80-85% of patients are cured with four months of treatment
                <sup>
                    <xref ref-type="bibr" rid="ref-1">1</xref>&#x2013;
                    <xref ref-type="bibr" rid="ref-4">4</xref>
                </sup>. Shortening treatment only in lower risk participants who had no cavity on baseline chest x-ray and had sputum culture converted to negative by two months of treatment resulted in the four-month treatment success proportion increasing to 93% in one trial but this was still significantly worse than six months of treatment
                <sup>
                    <xref ref-type="bibr" rid="ref-8">8</xref>
                </sup>. The PredictTB trial tests an alternate risk stratification criteria based on FDG-PET/CT disease burden at baseline, the change in PET/CT disease burden at week 4 of treatment, and a marker of residual bacterial load and adherence dose count at the end of treatment, hypothesizing that this combination will identify patients with tuberculosis who are cured with four months of standard treatment
                <sup>
                    <xref ref-type="bibr" rid="ref-5">5</xref>
                </sup>. Risk signatures based on transcriptomics have recently been shown to correlate with treatment outcomes
                <sup>
                    <xref ref-type="bibr" rid="ref-33">33</xref>,
                    <xref ref-type="bibr" rid="ref-34">34</xref>
                </sup>.</p>
            <p>The development of the PredictTB early treatment completion criteria was based on a cohort of 92 DS-TB patients programmatically treated in Cape Town, South Africa on whom we had PET/CT scans at baseline and week 4 of treatment, Xpert cycle threshold data, and programmatic treatment outcomes (Catalysis cohort). Because these patients were treated programmatically, treatment was not directly observed and we were thus unable to determine the proportion of treatment failures due to poor adherence or differentiate true relapsed disease patients from those re-infected. The lack of these data confounded our attempts to develop early treatment completion criteria that captured treatment failure and true relapse patients with meaningful sensitivity and specificity. We resorted to developing criteria that stratified about 50% of patients as high risk, trusting that the most severely diseased patients at baseline, those with poor treatment responses at week 4, and those under the Xpert cycle threshold cutoff at week 16 were captured as higher risk and therefore not eligible for treatment shortening.</p>
            <p>A major limitation of developing our algorithm was the lack of sufficient relapse data to validate our early treatment completion criteria. This limitation is challenging to overcome, given the small numbers of available patient data anywhere with microbiological, strain-type confirmed relapses and the even smaller numbers of these with FDG-PET/CT scan data. We acknowledge the risk of overfitting our data (i.e., producing a risk model that may not be generalizable because it was fit only to the data on which it was developed) and in fact, after the study started, it became clear that our criteria were too conservative, stratifying &gt;75% of participants as high risk and therefore not eligible for treatment shortening. Without immediate correction, we would likely run out of funding before the end of the trial due to the increased total sample size needed to achieve the required lower risk cohort sample size. Even worse, we would end up with a trial result that was applicable only to the 20&#x2013;25% of patients stratified to the lower risk arms and therefore not relevant to the majority TB patients. Our amended early treatment completion criteria, however, have been stratifying participants at roughly 50:50 to the high- vs. low-risk arms.</p>
            <p>	The PredictTB trial early treatment completion criteria were developed to identify those with the most severe disease at baseline (potentially at higher risk for treatment failure) and with a poor week 4 treatment response (potentially at higher risk of relapse), along with a marker of residual bacterial load and an adherence dose count at treatment completion. These criteria are currently stratifying about 50% of patients to the higher risk arm and 50% of patients to the two lower risk arms, which is the target goal. Whether or not this will successfully identify a lower risk cohort that can be successfully cured with four months of standard therapy awaits the results of the trial, expected in 2022.</p>
        </sec>
        <sec>
            <title>Data availability</title>
            <sec>
                <title>Underlying data</title>
                <p>Harvard Dataverse: Replication Data for PredictTB Early Treatment Completion Criteria. 
                    <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.7910/DVN/97HYQ5">https://doi.org/10.7910/DVN/97HYQ5</ext-link>
                    <sup>
                        <xref ref-type="bibr" rid="ref-35">35</xref>
                    </sup>.</p>
                <p>Data are available under the terms of the 
                    <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/publicdomain/zero/1.0/">Creative Commons Zero "No rights reserved" data waiver</ext-link> (CC0 1.0 Public domain dedication).</p>
            </sec>
        </sec>
    </body>
    <back>
        <ref-list>
            <ref id="ref-1">
                <label>1</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Fox</surname>
                            <given-names>W</given-names>
                        </name>
</person-group>:
                    <article-title>Whither short-course chemotherapy?</article-title>
                    <source>

                        <italic toggle="yes">Br J Dis Chest.</italic>
</source>
                    <year>1981</year>;<volume>75</volume>(<issue>4</issue>):<fpage>331</fpage>&#x2013;<lpage>57</lpage>.
                    <pub-id pub-id-type="pmid">7030377</pub-id>
                    <pub-id pub-id-type="doi">10.1016/0007-0971(81)90022-x</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-2">
                <label>2</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Gillespie</surname>
                            <given-names>SH</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Crook</surname>
                            <given-names>AM</given-names>
                        </name>

                        <name name-style="western">
                            <surname>McHugh</surname>
                            <given-names>TD</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Four-month moxifloxacin-based regimens for drug-sensitive tuberculosis.</article-title>
                    <source>

                        <italic toggle="yes">N Engl J Med.</italic>
</source>
                    <year>2014</year>;<volume>371</volume>(<issue>17</issue>):<fpage>1577</fpage>&#x2013;<lpage>87</lpage>.
                    <pub-id pub-id-type="pmid">25196020</pub-id>
                    <pub-id pub-id-type="doi">10.1056/NEJMoa1407426</pub-id>
                    <pub-id pub-id-type="pmcid">4277680</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-3">
                <label>3</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Jindani</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Harrison</surname>
                            <given-names>TS</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Nunn</surname>
                            <given-names>AJ</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>High-Dose Rifapentine with Moxifloxacin for Pulmonary Tuberculosis.</article-title>
                    <source>

                        <italic toggle="yes">N Engl J Med.</italic>
</source>
                    <year>2014</year>;<volume>371</volume>(<issue>17</issue>):<fpage>1599</fpage>&#x2013;<lpage>608</lpage>.
                    <pub-id pub-id-type="pmid">25337749</pub-id>
                    <pub-id pub-id-type="doi">10.1056/NEJMoa1314210</pub-id>
                    <pub-id pub-id-type="pmcid">4233406</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-4">
                <label>4</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Merle</surname>
                            <given-names>CS</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Fielding</surname>
                            <given-names>K</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Sow</surname>
                            <given-names>OB</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>A Four-Month Gatifloxacin-Containing Regimen for Treating Tuberculosis.</article-title>
                    <source>

                        <italic toggle="yes">N Engl J Med.</italic>
</source>
                    <year>2014</year>;<volume>371</volume>(<issue>17</issue>):<fpage>1588</fpage>&#x2013;<lpage>98</lpage>.
                    <pub-id pub-id-type="pmid">25337748</pub-id>
                    <pub-id pub-id-type="doi">10.1056/NEJMoa1315817</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-5">
                <label>5</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Chen</surname>
                            <given-names>RY</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Via</surname>
                            <given-names>LE</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Dodd</surname>
                            <given-names>LE</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Using biomarkers to predict TB treatment duration (Predict TB): a prospective, randomized, noninferiority, treatment shortening clinical trial.</article-title>
                    <source>

                        <italic toggle="yes">Gates Open Res.</italic>
</source>
                    <year>2017</year>;<volume>1</volume>:<fpage>9</fpage>.
                    <pub-id pub-id-type="pmid">29528048</pub-id>
                    <pub-id pub-id-type="doi">10.12688/gatesopenres.12750.1</pub-id>
                    <pub-id pub-id-type="pmcid">5841574</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-6">
                <label>6</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Imperial</surname>
                            <given-names>MZ</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Nahid</surname>
                            <given-names>P</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Phillips</surname>
                            <given-names>PPJ</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>A patient-level pooled analysis of treatment-shortening regimens for drug-susceptible pulmonary tuberculosis.</article-title>
                    <source>

                        <italic toggle="yes">Nat Med.</italic>
</source>
                    <year>2018</year>;<volume>24</volume>(<issue>11</issue>):<fpage>1708</fpage>&#x2013;<lpage>15</lpage>.
                    <pub-id pub-id-type="pmid">30397355</pub-id>
                    <pub-id pub-id-type="doi">10.1038/s41591-018-0224-2</pub-id>
                    <pub-id pub-id-type="pmcid">6685538</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-7">
                <label>7</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Romanowski</surname>
                            <given-names>K</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Balshaw</surname>
                            <given-names>RF</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Benedetti</surname>
                            <given-names>A</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Predicting tuberculosis relapse in patients treated with the standard 6-month regimen: an individual patient data meta-analysis.</article-title>
                    <source>

                        <italic toggle="yes">Thorax.</italic>
</source>
                    <year>2019</year>;<volume>74</volume>(<issue>3</issue>):<fpage>291</fpage>&#x2013;<lpage>297</lpage>.
                    <pub-id pub-id-type="pmid">30420407</pub-id>
                    <pub-id pub-id-type="doi">10.1136/thoraxjnl-2017-211120</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-8">
                <label>8</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Johnson</surname>
                            <given-names>JL</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Hadad</surname>
                            <given-names>DJ</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Dietze</surname>
                            <given-names>R</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Shortening treatment in adults with noncavitary tuberculosis and 2-month culture conversion.</article-title>
                    <source>

                        <italic toggle="yes">Am J Respir Crit Care Med.</italic>
</source>
                    <year>2009</year>;<volume>180</volume>(<issue>6</issue>):<fpage>558</fpage>&#x2013;<lpage>63</lpage>.
                    <pub-id pub-id-type="pmid">19542476</pub-id>
                    <pub-id pub-id-type="doi">10.1164/rccm.200904-0536OC</pub-id>
                    <pub-id pub-id-type="pmcid">2742745</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-9">
                <label>9</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Fox</surname>
                            <given-names>W</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Sutherland</surname>
                            <given-names>I</given-names>
                        </name>
</person-group>:
                    <article-title>A five-year assessment of patients in a controlled trial of streptomycin, para-aminosalicylic acid, and streptomycin plus para-aminosalicylic acid, in pulmonary tuberculosis.</article-title>
                    <source>

                        <italic toggle="yes">Q J Med.</italic>
</source>
                    <year>1956</year>;<volume>25</volume>(<issue>98</issue>):<fpage>221</fpage>&#x2013;<lpage>43</lpage>.
                    <pub-id pub-id-type="pmid">13323251</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-10">
                <label>10</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Benator</surname>
                            <given-names>D</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Bhattacharya</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Bozeman</surname>
                            <given-names>L</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Rifapentine and isoniazid once a week versus rifampicin and isoniazid twice a week for treatment of drug-susceptible pulmonary tuberculosis in HIV-negative patients: a randomised clinical trial.</article-title>
                    <source>

                        <italic toggle="yes">Lancet.</italic>
</source>
                    <year>2002</year>;<volume>360</volume>(<issue>9332</issue>):<fpage>528</fpage>&#x2013;<lpage>34</lpage>.
                    <pub-id pub-id-type="pmid">12241657</pub-id>
                    <pub-id pub-id-type="doi">10.1016/s0140-6736(02)09742-8</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-11">
                <label>11</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Nettles</surname>
                            <given-names>RE</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Mazo</surname>
                            <given-names>D</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Alwood</surname>
                            <given-names>K</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Risk factors for relapse and acquired rifamycin resistance after directly observed tuberculosis treatment: a comparison by HIV serostatus and rifamycin use.</article-title>
                    <source>

                        <italic toggle="yes">Clin Infect Dis.</italic>
</source>
                    <year>2004</year>;<volume>38</volume>(<issue>5</issue>):<fpage>731</fpage>&#x2013;<lpage>6</lpage>.
                    <pub-id pub-id-type="pmid">14986259</pub-id>
                    <pub-id pub-id-type="doi">10.1086/381675</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-12">
                <label>12</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Yew</surname>
                            <given-names>WW</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Chan</surname>
                            <given-names>CK</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Chau</surname>
                            <given-names>CH</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Outcomes of patients with multidrug-resistant pulmonary tuberculosis treated with ofloxacin/levofloxacin-containing regimens.</article-title>
                    <source>

                        <italic toggle="yes">Chest.</italic>
</source>
                    <year>2000</year>;<volume>117</volume>(<issue>3</issue>):<fpage>744</fpage>&#x2013;<lpage>51</lpage>.
                    <pub-id pub-id-type="pmid">10713001</pub-id>
                    <pub-id pub-id-type="doi">10.1378/chest.117.3.744</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-13">
                <label>13</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Sonnenberg</surname>
                            <given-names>P</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Murray</surname>
                            <given-names>J</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Glynn</surname>
                            <given-names>JR</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>HIV-1 and recurrence, relapse, and reinfection of tuberculosis after cure: a cohort study in South African mineworkers.</article-title>
                    <source>

                        <italic toggle="yes">Lancet.</italic>
</source>
                    <year>2001</year>;<volume>358</volume>(<issue>9294</issue>):<fpage>1687</fpage>&#x2013;<lpage>93</lpage>.
                    <pub-id pub-id-type="pmid">11728545</pub-id>
                    <pub-id pub-id-type="doi">10.1016/S0140-6736(01)06712-5</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-14">
                <label>14</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Coleman</surname>
                            <given-names>MT</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Maiello</surname>
                            <given-names>P</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Tomko</surname>
                            <given-names>J</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Early Changes by (18)Fluorodeoxyglucose positron emission tomography coregistered with computed tomography predict outcome after Mycobacterium tuberculosis infection in cynomolgus macaques.</article-title>
                    <source>

                        <italic toggle="yes">Infect Immun.</italic>
</source>
                    <year>2014</year>;<volume>82</volume>(<issue>6</issue>):<fpage>2400</fpage>&#x2013;<lpage>4</lpage>.
                    <pub-id pub-id-type="pmid">24664509</pub-id>
                    <pub-id pub-id-type="doi">10.1128/IAI.01599-13</pub-id>
                    <pub-id pub-id-type="pmcid">4019174</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-15">
                <label>15</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Lin</surname>
                            <given-names>PL</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Coleman</surname>
                            <given-names>T</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Carney</surname>
                            <given-names>JP</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Radiologic responses in cynomolgous macaques for assessing tuberculosis chemotherapy regimens.</article-title>
                    <source>

                        <italic toggle="yes">Antimicrob Agents Chemother.</italic>
</source>
                    <year>2013</year>;<volume>57</volume>(<issue>9</issue>):<fpage>4237</fpage>&#x2013;<lpage>4244</lpage>.
                    <pub-id pub-id-type="pmid">23796926</pub-id>
                    <pub-id pub-id-type="doi">10.1128/AAC.00277-13</pub-id>
                    <pub-id pub-id-type="pmcid">3754323</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-16">
                <label>16</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Chen</surname>
                            <given-names>RY</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Dodd</surname>
                            <given-names>LE</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Lee</surname>
                            <given-names>M</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>PET/CT imaging correlates with treatment outcome in patients with multidrug-resistant tuberculosis.</article-title>
                    <source>

                        <italic toggle="yes">Sci Transl Med.</italic>
</source>
                    <year>2014</year>;<volume>6</volume>(<issue>265</issue>):<fpage>265ra166</fpage>.
                    <pub-id pub-id-type="pmid">25473034</pub-id>
                    <pub-id pub-id-type="doi">10.1126/scitranslmed.3009501</pub-id>
                    <pub-id pub-id-type="pmcid">5567784</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-17">
                <label>17</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Malherbe</surname>
                            <given-names>ST</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Shenai</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ronacher</surname>
                            <given-names>K</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Persisting positron emission tomography lesion activity and 
                        <italic toggle="yes">Mycobacterium tuberculosis</italic> mRNA after tuberculosis cure.</article-title>
                    <source>

                        <italic toggle="yes">Nat Med.</italic>
</source>
                    <year>2016</year>;<volume>22</volume>(<issue>10</issue>):<fpage>1094</fpage>&#x2013;<lpage>1100</lpage>.
                    <pub-id pub-id-type="pmid">27595324</pub-id>
                    <pub-id pub-id-type="doi">10.1038/nm.4177</pub-id>
                    <pub-id pub-id-type="pmcid">5053881</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-18">
                <label>18</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Wallis</surname>
                            <given-names>RS</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Kim</surname>
                            <given-names>P</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Cole</surname>
                            <given-names>S</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Tuberculosis biomarkers discovery: developments, needs, and challenges.</article-title>
                    <source>

                        <italic toggle="yes">Lancet Infect Dis.</italic>
</source>
                    <year>2013</year>;<volume>13</volume>(<issue>4</issue>):<fpage>362</fpage>&#x2013;<lpage>72</lpage>.
                    <pub-id pub-id-type="pmid">23531389</pub-id>
                    <pub-id pub-id-type="doi">10.1016/S1473-3099(13)70034-3</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-19">
                <label>19</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Horne</surname>
                            <given-names>DJ</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Royce</surname>
                            <given-names>SE</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Gooze</surname>
                            <given-names>L</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Sputum monitoring during tuberculosis treatment for predicting outcome: systematic review and meta-analysis.</article-title>
                    <source>

                        <italic toggle="yes">Lancet Infect Dis.</italic>
</source>
                    <year>2010</year>;<volume>10</volume>(<issue>6</issue>):<fpage>387</fpage>&#x2013;<lpage>94</lpage>.
                    <pub-id pub-id-type="pmid">20510279</pub-id>
                    <pub-id pub-id-type="doi">10.1016/S1473-3099(10)70071-2</pub-id>
                    <pub-id pub-id-type="pmcid">3046810</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-20">
                <label>20</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Phillips</surname>
                            <given-names>PP</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Mendel</surname>
                            <given-names>CM</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Burger</surname>
                            <given-names>DA</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Limited role of culture conversion for decision-making in individual patient care and for advancing novel regimens to confirmatory clinical trials.</article-title>
                    <source>

                        <italic toggle="yes">BMC Med.</italic>
</source>
                    <year>2016</year>;<volume>14</volume>:<fpage>19</fpage>.
                    <pub-id pub-id-type="pmid">26847437</pub-id>
                    <pub-id pub-id-type="doi">10.1186/s12916-016-0565-y</pub-id>
                    <pub-id pub-id-type="pmcid">4743210</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-21">
                <label>21</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Brennan</surname>
                            <given-names>AT</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Maskew</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Sanne</surname>
                            <given-names>I</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>The interplay between CD4 cell count, viral load suppression and duration of antiretroviral therapy on mortality in a resource-limited setting.</article-title>
                    <source>

                        <italic toggle="yes">Trop Med Int Health.</italic>
</source>
                    <year>2013</year>;<volume>18</volume>(<issue>5</issue>):<fpage>619</fpage>&#x2013;<lpage>31</lpage>.
                    <pub-id pub-id-type="pmid">23419157</pub-id>
                    <pub-id pub-id-type="doi">10.1111/tmi.12079</pub-id>
                    <pub-id pub-id-type="pmcid">3625450</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-22">
                <label>22</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Kurbatova</surname>
                            <given-names>EV</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Cegielski</surname>
                            <given-names>JP</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Lienhardt</surname>
                            <given-names>C</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Sputum culture conversion as a prognostic marker for end-of-treatment outcome in patients with multidrug-resistant tuberculosis: a secondary analysis of data from two observational cohort studies.</article-title>
                    <source>

                        <italic toggle="yes">Lancet Respir Med.</italic>
</source>
                    <year>2015</year>;<volume>3</volume>(<issue>3</issue>):<fpage>201</fpage>&#x2013;<lpage>9</lpage>.
                    <pub-id pub-id-type="pmid">25726085</pub-id>
                    <pub-id pub-id-type="doi">10.1016/S2213-2600(15)00036-3</pub-id>
                    <pub-id pub-id-type="pmcid">4401426</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-23">
                <label>23</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>El Alili</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Vrijens</surname>
                            <given-names>B</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Demonceau</surname>
                            <given-names>J</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>A scoping review of studies comparing the medication event monitoring system (MEMS) with alternative methods for measuring medication adherence.</article-title>
                    <source>

                        <italic toggle="yes">Br J Clin Pharmacol.</italic>
</source>
                    <year>2016</year>;<volume>82</volume>(<issue>1</issue>):<fpage>268</fpage>&#x2013;<lpage>79</lpage>.
                    <pub-id pub-id-type="pmid">27005306</pub-id>
                    <pub-id pub-id-type="doi">10.1111/bcp.12942</pub-id>
                    <pub-id pub-id-type="pmcid">4917812</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-24">
                <label>24</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Valencia</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Le&#x00f3;n</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Losada</surname>
                            <given-names>I</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>How do we measure adherence to anti-tuberculosis treatment?</article-title>
                    <source>

                        <italic toggle="yes">Expert Rev Anti Infect Ther.</italic>
</source>
                    <year>2017</year>;<volume>15</volume>(<issue>2</issue>):<fpage>157</fpage>&#x2013;<lpage>65</lpage>.
                    <pub-id pub-id-type="pmid">27910715</pub-id>
                    <pub-id pub-id-type="doi">10.1080/14787210.2017.1264270</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-25">
                <label>25</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Marx</surname>
                            <given-names>FM</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Dunbar</surname>
                            <given-names>R</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Enarson</surname>
                            <given-names>DA</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>The temporal dynamics of relapse and reinfection tuberculosis after successful treatment: a retrospective cohort study.</article-title>
                    <source>

                        <italic toggle="yes">Clin Infect Dis.</italic>
</source>
                    <year>2014</year>;<volume>58</volume>(<issue>12</issue>):<fpage>1676</fpage>&#x2013;<lpage>83</lpage>.
                    <pub-id pub-id-type="pmid">24647020</pub-id>
                    <pub-id pub-id-type="doi">10.1093/cid/ciu186</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-26">
                <label>26</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Friedrich</surname>
                            <given-names>SO</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Rachow</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Saathoff</surname>
                            <given-names>E</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Assessment of the sensitivity and specificity of Xpert MTB/RIF assay as an early sputum biomarker of response to tuberculosis treatment.</article-title>
                    <source>

                        <italic toggle="yes">Lancet Respir Med.</italic>
</source>
                    <year>2013</year>;<volume>1</volume>(<issue>6</issue>):<fpage>462</fpage>&#x2013;<lpage>70</lpage>.
                    <pub-id pub-id-type="pmid">24429244</pub-id>
                    <pub-id pub-id-type="doi">10.1016/S2213-2600(13)70119-X</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-27">
                <label>27</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Lange</surname>
                            <given-names>B</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Khan</surname>
                            <given-names>P</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Kalmambetova</surname>
                            <given-names>G</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Diagnostic accuracy of the Xpert &#x00ae; MTB/RIF cycle threshold level to predict smear positivity: a meta-analysis.</article-title>
                    <source>

                        <italic toggle="yes">Int J Tuberc Lung Dis.</italic>
</source>
                    <year>2017</year>;<volume>21</volume>(<issue>5</issue>):<fpage>493</fpage>&#x2013;<lpage>502</lpage>.
                    <pub-id pub-id-type="pmid">28399963</pub-id>
                    <pub-id pub-id-type="doi">10.5588/ijtld.16.0702</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-28">
                <label>28</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Beynon</surname>
                            <given-names>F</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Theron</surname>
                            <given-names>G</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Respeito</surname>
                            <given-names>D</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Correlation of Xpert MTB/RIF with measures to assess 
                        <italic toggle="yes">Mycobacterium tuberculosis</italic> bacillary burden in high HIV burden areas of Southern Africa.</article-title>
                    <source>

                        <italic toggle="yes">Sci Rep.</italic>
</source>
                    <year>2018</year>;<volume>8</volume>(<issue>1</issue>):<fpage>5201</fpage>.
                    <pub-id pub-id-type="pmid">29581435</pub-id>
                    <pub-id pub-id-type="doi">10.1038/s41598-018-23066-2</pub-id>
                    <pub-id pub-id-type="pmcid">5980110</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-29">
                <label>29</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Shenai</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ronacher</surname>
                            <given-names>K</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Malherbe</surname>
                            <given-names>S</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Bacterial Loads Measured by the Xpert MTB/RIF Assay as Markers of Culture Conversion and Bacteriological Cure in Pulmonary TB.</article-title>
                    <source>

                        <italic toggle="yes">PLoS One.</italic>
</source>
                    <year>2016</year>;<volume>11</volume>(<issue>8</issue>):<fpage>e0160062</fpage>.
                    <pub-id pub-id-type="pmid">27508390</pub-id>
                    <pub-id pub-id-type="doi">10.1371/journal.pone.0160062</pub-id>
                    <pub-id pub-id-type="pmcid">4980126</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-30">
                <label>30</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Podewils</surname>
                            <given-names>LJ</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Gler</surname>
                            <given-names>MT</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Quelapio</surname>
                            <given-names>MI</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Patterns of treatment interruption among patients with multidrug-resistant TB (MDR TB) and association with interim and final treatment outcomes.</article-title>
                    <source>

                        <italic toggle="yes">PLoS One.</italic>
</source>
                    <year>2013</year>;<volume>8</volume>(<issue>7</issue>):<fpage>e70064</fpage>.
                    <pub-id pub-id-type="pmid">23922904</pub-id>
                    <pub-id pub-id-type="doi">10.1371/journal.pone.0070064</pub-id>
                    <pub-id pub-id-type="pmcid">3726487</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-31">
                <label>31</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Robin</surname>
                            <given-names>X</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Turck</surname>
                            <given-names>N</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Hainard</surname>
                            <given-names>A</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>pROC: an open-source package for R and S+ to analyze and compare ROC curves.</article-title>
                    <source>

                        <italic toggle="yes">BMC Bioinformatics.</italic>
</source>
                    <year>2011</year>;<volume>12</volume>:<fpage>77</fpage>.
                    <pub-id pub-id-type="pmid">21414208</pub-id>
                    <pub-id pub-id-type="doi">10.1186/1471-2105-12-77</pub-id>
                    <pub-id pub-id-type="pmcid">3068975</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-32">
                <label>32</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Sing</surname>
                            <given-names>T</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Sander</surname>
                            <given-names>O</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Beerenwinkel</surname>
                            <given-names>N</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>ROCR: visualizing classifier performance in R.</article-title>
                    <source>

                        <italic toggle="yes">Bioinformatics.</italic>
</source>
                    <year>2005</year>;<volume>21</volume>(<issue>20</issue>):<fpage>3940</fpage>&#x2013;<lpage>1</lpage>.
                    <pub-id pub-id-type="pmid">16096348</pub-id>
                    <pub-id pub-id-type="doi">10.1093/bioinformatics/bti623</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-33">
                <label>33</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Thompson</surname>
                            <given-names>EG</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Du</surname>
                            <given-names>Y</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Malherbe</surname>
                            <given-names>ST</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Host blood RNA signatures predict the outcome of tuberculosis treatment.</article-title>
                    <source>

                        <italic toggle="yes">Tuberculosis (Edinb).</italic>
</source>
                    <year>2017</year>;<volume>107</volume>:<fpage>48</fpage>&#x2013;<lpage>58</lpage>.
                    <pub-id pub-id-type="pmid">29050771</pub-id>
                    <pub-id pub-id-type="doi"> 10.1016/j.tube.2017.08.004</pub-id>
                    <pub-id pub-id-type="pmcid">5658513</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-34">
                <label>34</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Penn-Nicholson</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Mbandi</surname>
                            <given-names>SK</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Thompson</surname>
                            <given-names>E</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>RISK6, a 6-gene transcriptomic signature of TB disease risk, diagnosis and treatment response.</article-title>
                    <source>

                        <italic toggle="yes">Sci Rep.</italic>
</source>
                    <year>2020</year>;<volume>10</volume>(<issue>1</issue>):<fpage>8629</fpage>.
                    <pub-id pub-id-type="pmid">32451443</pub-id>
                    <pub-id pub-id-type="doi">10.1038/s41598-020-65043-8</pub-id>
                    <pub-id pub-id-type="pmcid">7248089</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-35">
                <label>35</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Chen</surname>
                            <given-names>R</given-names>
                        </name>
</person-group>:
                    <article-title>"Replication Data for PredictTB Early Treatment Completion Criteria". Harvard Dataverse, V1, UNF: 6:wckzF/sNge+t4N9AnwnfpA== [fileUNF].</article-title>
                    <year> 2020</year>.
                    <ext-link ext-link-type="uri" xlink:href="http://www.doi.org/10.7910/DVN/97HYQ5">http://www.doi.org/10.7910/DVN/97HYQ5</ext-link>
                </mixed-citation>
            </ref>
        </ref-list>
    </back>
    <sub-article article-type="reviewer-report" id="report29920">
        <front-stub>
            <article-id pub-id-type="doi">10.21956/gatesopenres.14379.r29920</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Menzies</surname>
                        <given-names>Richard</given-names>
                    </name>
                    <xref ref-type="aff" rid="r29920a1">1</xref>
                    <role>Referee</role>
                </contrib>
                <aff id="r29920a1">
                    <label>1</label>McGill University International TB Centre, Montreal, QC, Canada</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>17</day>
                <month>2</month>
                <year>2021</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2021 Menzies R</copyright-statement>
                <copyright-year>2021</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport29920" related-article-type="peer-reviewed-article" xlink:href="10.12688/gatesopenres.13179.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>reject</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>This is an interesting paper in which the authors describe development of risk stratification that they then apply to an ongoing randomized trial. The fundamental objective is to predict the subgroup of patients with active TB with high treatment success rates with shortened therapy, i.e. 4 months of therapy, and identifying those who require longer treatment.</p>
            <p> </p>
            <p> 
                <bold>Major comments:</bold>
            </p>
            <p> </p>
            <p> The problem in treatment of drug-sensitive TB is the relatively high relapse rate with standard 6-month treatment, not failure. In most systematic reviews, the failure rate is less than 1%, but the relapse rate is around 4% or 5% in patients with DS-TB treated with the standard therapy of INH, Rifampin, PZA and Ethambutol for 2 months followed by 4 months of INH and Rifampin (2HRZE/4HR). And in unselected patients receiving 4 months - the relapse rate is much higher (10-15%). A number of risk factors for relapse have been identified: intermittent therapy, particularly in the first 2 months of therapy, cavities (on plain chest radiograph) present at initiation of treatment, or at 2 months or at 5 months, smear positivity at baseline or at 2 months, and culture positivity at 2 months. All of these risk factors predict higher rates of relapse; currently therapy in patients with these risk factors is prolonged until 9 months. One would imagine that the counterfactual would be true, i.e. patients without these risk factors should benefit from shorter treatment, but this remains a hypothesis that has not yet been borne out in trials.</p>
            <p> </p>
            <p> There are two major problems with the objectives of this study: (1) The investigators are assessing different information in an attempt to refine the prediction of relapse. This is good, since prediction of relapse is imperfect at present. BUT, they have not made use of the available prediction tools that are already known, but rather have used ONLY the different prediction tools - findings on CT scan and PET reactivity. So we do not know if CT or PET findings actually improve accuracy of prediction when added to the microbiologic and radiographic information that is currently available to clinicians in all settings. (2) They focused on treatment failure rather than treatment relapse. The authors justified this because relapse was more difficult to diagnose under programmatic conditions. But unfortunately this does not change the fact that an analysis of treatment failure is simply looking at the wrong outcome for treatment shortening. The study should remain focused on the key problem for TB treatment shortening - which is relapse after the end of treatment.</p>
            <p> </p>
            <p> 
                <bold>Specific comments:</bold>
            </p>
            <p> </p>
            <p> There are a number of points of information that would be helpful in order to be able to better interpret this paper. This information includes: 
                <list list-type="order">
                    <list-item>
                        <p>Patients had drug-sensitive TB, but how many of them had prior treatment? This is an important variable to consider, because it may predict adherence behaviour but also because there is some evidence that previously treated patients may be at greater risk of failure because of some degree of drug resistance, which is not always evident with phenotypic DST.</p>
                    </list-item>
                    <list-item>
                        <p>Were all isolates from these patients tested for all first-line agents or only for Rifampin? If only for Rifampin, they could miss important resistance to other first-line drugs which would substantially increase the risk of both failure and relapse, but also make the results of less interest because treatment shortening should not be considered if there is any first-line drug resistance.</p>
                    </list-item>
                    <list-item>
                        <p>What was the regimen given? I&#x2019;m assuming it was 2HRZE/4HR but this should be specified. Also, was it daily or intermittent? Was DOT used and, if so, how was DOT given? Were patients hospitalized for any significant time at the beginning of therapy?</p>
                    </list-item>
                    <list-item>
                        <p>Patient characteristics are missing. In particular, we need to know if they were HIV-infected and, if so, if they were on effective ART.</p>
                    </list-item>
                    <list-item>
                        <p>Adherence &#x2013; the authors mentioned that pill counts are done, which they acknowledge has limitations. Again, this raises the issue of DOT and why DOT was not used? If they were relying on pill counts, what was the percent adherence based on pill counts? This would be important particularly to look at the difference in those who were cured, failed or had retreatment.</p>
                    </list-item>
                    <list-item>
                        <p>Failure &#x2013; very high failure rates are noted here. 8 out of 92 failed which is 9%. As noted earlier, generally drug-sensitive patients with standard regimen have less than 1% failure rate. Why so many failures?</p>
                    </list-item>
                    <list-item>
                        <p>Was there evidence of acquired drug resistance? Could this have been in fact re-infection or new infection with drug-resistant strains?</p>
                    </list-item>
                    <list-item>
                        <p>Relapse &#x2013; also very high. 13%, or 11 out of 84, relapsed compared to usually 4-5% in most series. I understand they were not all culture confirmed, but how many were culture confirmed, and of these, how many have acquired drug resistance or evidence of re-infection? Even the basic information of whether they were HIV-infected or not would be helpful in order to better interpret this information.</p>
                    </list-item>
                </list> </p>
            <p> Recommendations: 
                <list list-type="order">
                    <list-item>
                        <p>First provide the missing information as described above.</p>
                    </list-item>
                    <list-item>
                        <p>Some explanation as to the high failure and relapse rates would be important.</p>
                    </list-item>
                    <list-item>
                        <p>In Table 1, add microbiologic findings, particularly smear and culture status at baseline or pre-treatment and at 2 months. Also add plain chest radiography results so that we can see whether CT or PET findings are adding to these other known predictors.</p>
                    </list-item>
                    <list-item>
                        <p>Incorporate the known predictive findings (plain chest Xray, smear and culture) into all predictive algorithms. That should be the starting point. Then assess whether adding CT and PET findings improves the prediction.</p>
                    </list-item>
                    <list-item>
                        <p>PET findings, I think it would be helpful to have a simple summary of the PET findings at baseline and if the repeat simply is improving or not, or whether the PET has improved in some places but worsened in others. These qualitative interpretations would be helpful because that is the info in reports that clinicians receive, and would give a&#x00a0;better sense of whether PET scan findings are in fact predictive.</p>
                    </list-item>
                </list>
            </p>
            <p>Is the rationale for developing the new method (or application) clearly explained?</p>
            <p>Partly</p>
            <p>Is the description of the method technically sound?</p>
            <p>No</p>
            <p>Are the conclusions about the method and its performance adequately supported by the findings presented in the article?</p>
            <p>Partly</p>
            <p>If any results are presented, are all the source data underlying the results available to ensure full reproducibility?</p>
            <p>Yes</p>
            <p>Are sufficient details provided to allow replication of the method development and its use by others?</p>
            <p>Partly</p>
            <p>Reviewer Expertise:</p>
            <p>Clinical trials, Epidemiology - in TB</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to state that I do not consider it to be of an acceptable scientific standard, for reasons outlined above.</p>
        </body>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report30223">
        <front-stub>
            <article-id pub-id-type="doi">10.21956/gatesopenres.14379.r30223</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Chadha</surname>
                        <given-names>Vineet K.</given-names>
                    </name>
                    <xref ref-type="aff" rid="r30223a1">1</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0003-4457-4842</uri>
                </contrib>
                <aff id="r30223a1">
                    <label>1</label>National TB Institute, Bengaluru, Karnataka, India</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>16</day>
                <month>2</month>
                <year>2021</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2021 Chadha VK</copyright-statement>
                <copyright-year>2021</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport30223" related-article-type="peer-reviewed-article" xlink:href="10.12688/gatesopenres.13179.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>reject</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>In the &#x2018;Background&#x2019; section, the authors as a justification for their analysis have referred to high treatment success rates with shorter drug regimen but have conveniently ignored the higher relapse/recurrence rates with these regimens as observed during the same studies as well as in many other studies. The Predict TB trial wherein patients were stratified into shorter and standard duration therapy based on the criteria as proposed by the authors in the current manuscript has already been discontinued for the same reason of significantly higher rates of recurrence in the shorter regimen arm. This is a further testimony to the poor justification for the analysis as well as arbitrariness in the repeated mathematical jugglery to reach 50% allocation target.</p>
            <p> </p>
            <p> The small sample size of 81 patients used for the present analysis is another issue. Besides, the method used by the authors to ascertain recurrence based on symptomatic status after 2 years without any systematic effort by regular screening and sputum examination during the post treatment period also propagates a wrong method for ascertaining recurrence. I don&#x2019;t find any novelty or learning for other researchers in adopting their method in any of their endeavors. &#x00a0;</p>
            <p> </p>
            <p> The manuscript should therefore not pass peer-review.</p>
            <p>Is the rationale for developing the new method (or application) clearly explained?</p>
            <p>Partly</p>
            <p>Is the description of the method technically sound?</p>
            <p>No</p>
            <p>Are the conclusions about the method and its performance adequately supported by the findings presented in the article?</p>
            <p>No</p>
            <p>If any results are presented, are all the source data underlying the results available to ensure full reproducibility?</p>
            <p>Partly</p>
            <p>Are sufficient details provided to allow replication of the method development and its use by others?</p>
            <p>No</p>
            <p>Reviewer Expertise:</p>
            <p>Epidemiology</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to state that I do not consider it to be of an acceptable scientific standard, for reasons outlined above.</p>
        </body>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report30195">
        <front-stub>
            <article-id pub-id-type="doi">10.21956/gatesopenres.14379.r30195</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Vorster</surname>
                        <given-names>Mariza</given-names>
                    </name>
                    <xref ref-type="aff" rid="r30195a1">1</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0001-5643-1553</uri>
                </contrib>
                <aff id="r30195a1">
                    <label>1</label>Department of Nuclear Medicine, University of Pretoria, Pretoria, South Africa</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>25</day>
                <month>1</month>
                <year>2021</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2021 Vorster M</copyright-statement>
                <copyright-year>2021</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport30195" related-article-type="peer-reviewed-article" xlink:href="10.12688/gatesopenres.13179.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>In this paper the authors set out to explain how image findings on 
                <sup>18</sup>F-FDG PET/CT and GeneXpert cycle threshold data have been analyzed in order to develop the early treatment completion algorithm that is currently applied in the ongoing PredictTB trial. The PredictTB trial aims to prospectively identify the sub-group of patients that are at a lower risk and who could successfully complete treatment at 4 months.</p>
            <p> </p>
            <p> 
                <sup>18</sup>F-FDG PET/CT was acquired at baseline and again at week 4 of the treatment regimen for the 92 patients with pulmonary TB. Patient adherence and residual TB bacterial load was assessed by means of Sputum Xpert MTB/RIF cycle threshold at week 16. Patients included were followed up for a period of at least one year to assess outcomes. Measurements included determination of baseline disease burden as well as treatment response evaluation.</p>
            <p> </p>
            <p> Semi-quantitative parameters included Total Lesion Glycolysis (TLG) on PET and Hounsfield units and air volumes on CT. The software used and other aspects of methodology were described in sufficient detail to allow replication. Details on the statistical analysis are adequate.</p>
            <p> </p>
            <p> The results indicated that 73 patients (out of 92 included) were asymptomatic at 2 years after completion of therapy, with 8 treatment failures and 11 retreated patients. Baseline imaging parameters demonstrated statistically significant differences between cure and treatment failure and between failure vs re-treatment with regards to the largest cavity air volume and total cavity air volume. None of the other imaging features could reliably differentiate the various treatment response groups from one another.</p>
            <p> </p>
            <p> 
                <bold>With regards to the statistical analysis, it may be problematic to compare one large group of patients with two other groups that are significantly smaller. Therefore, important differences could potentially emerge with bigger patient numbers in the other groups.</bold>
            </p>
            <p> </p>
            <p> In addition, ROCs were drawn for each variable and thresholds adjusted in order to determine the best possible sensitivity vs specificity trade-off. The authors detail the effects of varying thresholds in various figures and tables. These indicated that any algorithm with high sensitivity for high-risk patients (with unfavourable outcomes), suffered from low specificity for low-risk patients (and vice versa). The algorithm was therefore adapted to stratify 50% of patients to the higher risk arm and 50% to the lower risk arm.</p>
            <p> </p>
            <p> Study limitations included the lack of directly observed treatment, with subsequent inability to differentiate poor adherence from true treatment failure and true relapse from re-infections and are detailed in the discussion.</p>
            <p> </p>
            <p> The developed algorithm is used in the ongoing PredictTB trial and its prediction performance remains to be seen.</p>
            <p>Is the rationale for developing the new method (or application) clearly explained?</p>
            <p>Yes</p>
            <p>Is the description of the method technically sound?</p>
            <p>Yes</p>
            <p>Are the conclusions about the method and its performance adequately supported by the findings presented in the article?</p>
            <p>Partly</p>
            <p>If any results are presented, are all the source data underlying the results available to ensure full reproducibility?</p>
            <p>Yes</p>
            <p>Are sufficient details provided to allow replication of the method development and its use by others?</p>
            <p>Yes</p>
            <p>Reviewer Expertise:</p>
            <p>Gallium-68 based PET, Theranostics, Infection and Inflammation imaging</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.</p>
        </body>
        <sub-article article-type="response" id="comment3411-30195">
            <front-stub>
                <contrib-group>
                    <contrib contrib-type="author">
                        <name>
                            <surname>Chen</surname>
                            <given-names>Ray</given-names>
                        </name>
                        <aff/>
                    </contrib>
                </contrib-group>
                <author-notes>
                    <fn fn-type="conflict">
                        <p>
                            <bold>Competing interests: </bold>None.</p>
                    </fn>
                </author-notes>
                <pub-date pub-type="epub">
                    <day>3</day>
                    <month>2</month>
                    <year>2021</year>
                </pub-date>
            </front-stub>
            <body>
                <p>We thank the reviewer for these positive comments. With respect to the smaller number of unfavorable outcomes, we use statistical methodology that conditions on outcomes (e.g., methods appropriate for case-control studies), which appropriately accounts for different sample sizes in the two groups. With respect to the total sample size, as stated in our limitations paragraph, we acknowledge the challenge of drawing definitive conclusions from small numbers. This became evident early on in the trial by the imbalance in arm distribution, requiring a modification to the early treatment completion criteria. However, these were the only data we had when we began the trial. With the data generated in the PredictTB trial, we may find that additional modifications to the early treatment completion criteria are necessary.</p>
            </body>
        </sub-article>
    </sub-article>
</article>
