<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20190208//EN" "http://jats.nlm.nih.gov/publishing/1.2/JATS-journalpublishing1.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article" dtd-version="1.2" xml:lang="en">
    <front>
        <journal-meta>
            <journal-id journal-id-type="pmc">Gates Open Res</journal-id>
            <journal-title-group>
                <journal-title>Gates Open Research</journal-title>
            </journal-title-group>
            <issn pub-type="epub">2572-4754</issn>
            <publisher>
                <publisher-name>F1000 Research Limited</publisher-name>
                <publisher-loc>London, UK</publisher-loc>
            </publisher>
        </journal-meta>
        <article-meta>
            <article-id pub-id-type="doi">10.12688/gatesopenres.13116.2</article-id>
            <article-categories>
                <subj-group subj-group-type="heading">
                    <subject>Research Article</subject>
                </subj-group>
                <subj-group>
                    <subject>Articles</subject>
                </subj-group>
            </article-categories>
            <title-group>
                <article-title>Estimating the power to detect a change caused by a vaccine from time series data</article-title>
                <fn-group content-type="pub-status">
                    <fn>
                        <p>[version 2; peer review: 2 approved]</p>
                    </fn>
                </fn-group>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author" corresp="yes">
                    <name>
                        <surname>Weinberger</surname>
                        <given-names>Daniel M.</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Conceptualization</role>
                    <role content-type="http://credit.niso.org/">Formal Analysis</role>
                    <role content-type="http://credit.niso.org/">Funding Acquisition</role>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Software</role>
                    <role content-type="http://credit.niso.org/">Visualization</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <uri content-type="orcid">https://orcid.org/0000-0003-1178-8086</uri>
                    <xref ref-type="corresp" rid="c1">a</xref>
                    <xref ref-type="aff" rid="a1">1</xref>
                    <xref ref-type="aff" rid="a2">2</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Warren</surname>
                        <given-names>Joshua L.</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a2">2</xref>
                    <xref ref-type="aff" rid="a3">3</xref>
                </contrib>
                <aff id="a1">
                    <label>1</label>Department of Epidemiology of Microbial Diseases, Yale School of Public Health, New Haven, CT, 06520, USA</aff>
                <aff id="a2">
                    <label>2</label>Public Health Modeling Unit, Yale School of Public Health, New Haven, CT, 06520, USA</aff>
                <aff id="a3">
                    <label>3</label>Department of Biostatistics, Yale School of Public Health, New Haven, CT, 06520, USA</aff>
            </contrib-group>
            <author-notes>
                <corresp id="c1">
                    <label>a</label>
                    <email xlink:href="mailto:daniel.weinberger@yale.edu">daniel.weinberger@yale.edu</email>
                </corresp>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>DMW has received consulting fees from Pfizer, Merck GSK, and Affinivax for work outside of this manuscript. JLW has no conflicts to disclose.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>19</day>
                <month>10</month>
                <year>2020</year>
            </pub-date>
            <pub-date pub-type="collection">
                <year>2020</year>
            </pub-date>
            <volume>4</volume>
            <elocation-id>27</elocation-id>
            <history>
                <date date-type="accepted">
                    <day>26</day>
                    <month>8</month>
                    <year>2020</year>
                </date>
            </history>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2020 Weinberger DM and Warren JL</copyright-statement>
                <copyright-year>2020</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <self-uri content-type="pdf" xlink:href="https://gatesopenresearch.org/articles/4-27/pdf"/>
            <abstract>
                <p>When evaluating the effects of vaccination programs, it is common to estimate changes in rates of disease before and after vaccine introduction. There are a number of related approaches that attempt to adjust for trends unrelated to the vaccine and to detect changes that coincide with introduction. However, characteristics of the data can influence the ability to estimate such a change. These include, but are not limited to, the number of years of available data prior to vaccine introduction, the expected strength of the effect of the intervention, the strength of underlying secular trends, and the amount of unexplained variability in the data. Sources of unexplained variability include model misspecification, epidemics due to unidentified pathogens, and changes in ascertainment or coding practice among others. In this study, we present a simple simulation framework for estimating the power to detect a decline and the precision of these estimates. We use real-world data from a pre-vaccine period to generate simulated time series where the vaccine effect is specified 
                    <italic toggle="yes">a priori</italic>. We present an interactive web-based tool to implement this approach. We also demonstrate the use of this approach using observed data on pneumonia hospitalization from the states in Brazil from a period prior to introduction of pneumococcal vaccines to generate the simulated time series. We relate the power of the hypothesis tests to the number of cases per year and the amount of unexplained variability in the data and demonstrate how fewer years of data influence the results.</p>
            </abstract>
            <kwd-group kwd-group-type="author">
                <kwd>time series analysis</kwd>
                <kwd>program evaluation</kwd>
                <kwd>vaccine evaluation</kwd>
                <kwd>power calculation</kwd>
            </kwd-group>
            <funding-group>
                <award-group id="fund-1" xlink:href="http://dx.doi.org/10.13039/100000060">
                    <funding-source>National Institute of Allergy and Infectious Diseases</funding-source>
                    <award-id>R01AI123208</award-id>
                    <award-id>R01137093</award-id>
                </award-group>
                <award-group id="fund-2" xlink:href="http://dx.doi.org/10.13039/100000865">
                    <funding-source>Gates Foundation</funding-source>
                    <award-id>OPP1176267</award-id>
                </award-group>
                <funding-statement>This work is supported by the Gates Foundation [OPP1176267]; and the National Institute of Allergy and Infectious Disease [R01AI123208; R01137093].</funding-statement>
                <funding-statement>
                    <italic>The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.</italic>
                </funding-statement>
            </funding-group>
        </article-meta>
        <notes>
            <sec sec-type="version-changes">
                <label>Revised</label>
                <title>Amendments from Version 1</title>
                <p>The analyses were changed so that now only the post vaccine data are simulated. Therefore the calculations of power are conditional on the observed pre-vaccine data. The user interface now also allows the user to adjust the number of pre- and post-vaccine time points. Specific responses are below.</p>
            </sec>
        </notes>
    </front>
    <body>
        <sec sec-type="intro">
            <title>Introduction</title>
            <p>After a new vaccine is introduced, it is often necessary to evaluate the effect of the intervention on disease rates. This is typically done by evaluating changes in the average number of cases or the trend in cases before and after vaccine introduction
                <sup>
                    <xref ref-type="bibr" rid="ref-1">1</xref>
                </sup>. However, this type of analysis is challenging because it can be difficult to distinguish changes in disease rates caused by the vaccine from changes resulting from random variations, epidemics, changes in healthcare utilization, or changes in reporting practices. Additionally, there is often no &#x2018;ground-truth&#x2019; against which estimates can be compared to determine whether an estimate is credible. These issues are a threat to the validity of any vaccine evaluation study, even when using large nationwide databases. However, the problems are especially acute when moving from larger to smaller populations, where the signal to noise ratio will be lower.</p>
            <p>While it is intuitive that having more noise in the data makes it more difficult to detect a change, it is not clear how much data are needed to effectively quantify a vaccine-associated change if one exists. The power to detect a decline will depend on many factors, including the magnitude of the expected effect (higher power with a greater expected decline), the number of cases per unit time, and the number of years of pre- and post-vaccine data. Because the specific characteristics of datasets can vary, it is difficult to make general statements about power. However, simulation-based methods can be used to evaluate and compare power in different datasets based on the pre-vaccine time series and the magnitude of the expected effects
                <sup>
                    <xref ref-type="bibr" rid="ref-2">2</xref>,
                    <xref ref-type="bibr" rid="ref-3">3</xref>
                </sup>.</p>
            <p>In this study, we present a simple web-based tool that can be used to input any disease time series and obtain an estimate of the power for that series to detect a specified vaccine-associated decline. This is accomplished by extracting characteristics of the time series (e.g., seasonality, trends, unexplained variability) from the pre-vaccine period and simulating a set of time series that have similar characteristics and have a vaccine impact that is specified 
                <italic toggle="yes">a priori</italic>. We demonstrate the application of this approach on observed pre-vaccine data on pneumonia hospitalizations from the 26 states plus the federal district in Brazil and relate characteristics of the time series to the power.</p>
        </sec>
        <sec sec-type="methods">
            <title>Methods</title>
            <sec>
                <title>Data</title>
                <p>We used state-level hospitalization data from Brazil, which have been described in detail previously
                    <sup>
                        <xref ref-type="bibr" rid="ref-4">4</xref>
                    </sup>. These de-identified data are drawn from the Unified Health System (SIH-SUS, Ministry of Health), which captures ~70% of the population in Brazil. The raw data can be obtained directly by contacting the Ministry of Health in Brazil. The formatted time series data are available in the Github repository for this study.  Each hospitalization is assigned a unique ICD10 code. For these analyses, we focused on data on &lt;12 month old children and 80+ year old adults for the pre-vaccine period 2003&#x2013;2009. These two populations provide a useful contrast. The time series for the infants was relatively stable prior to vaccine introduction, while the data for the 80+ year old adults had a notable increasing trend before vaccine introduction. Both sets of time series exhibit strong seasonality with a peak in the winter. </p>
            </sec>
            <sec>
                <title>Simulating data based on observed time series from Brazil</title>
                <p>The goal for this exercise was to simulate a set of time series with characteristics that resembled the observed hospitalization data from the pre-vaccine period but that had specified vaccine effects added in. We then sought to estimate the vaccine effect using the same model that was used to generate the data and evaluate the power to detect the effect. This provides a best-case scenario where the underlying model is correctly specified. </p>
                <p>The first step in this process was to extract characteristics of the time series from the pre-vaccine period (trend, seasonality, and amount of unexplained variation). For each state, we fit a regression model to the data from the pre-vaccine period (2003&#x2013;2009). The outcome variable was the number of pneumonia hospitalizations (coded as J12-18) per month, and the covariates were an index variable for time (to capture any linear trends in the data) and 12-month and 6-month harmonic variables (to capture seasonality)
                    <sup>
                        <xref ref-type="bibr" rid="ref-5">5</xref>
                    </sup>. We used a Poisson regression model with a Gaussian observation-level random intercept to account for overdispersion in the data such that</p>
                <p>
                    <disp-formula id="e1">
                        <mml:math display="block" id="math1">
                            <mml:msub>
                                <mml:mtext>Y</mml:mtext>
                                <mml:mtext>t</mml:mtext>
                            </mml:msub>
                            <mml:mspace width="0.2em"/>
                            <mml:mo>~</mml:mo>
                            <mml:mtext>Poisson</mml:mtext>
                            <mml:mo stretchy="false">(</mml:mo>
                            <mml:msup>
                                <mml:mi>e</mml:mi>
                                <mml:mrow>
                                    <mml:msub>
                                        <mml:mi>&#x03bc;</mml:mi>
                                        <mml:mi>t</mml:mi>
                                    </mml:msub>
                                    <mml:mo>+</mml:mo>
                                    <mml:msub>
                                        <mml:mi>&#x03d5;</mml:mi>
                                        <mml:mi>t</mml:mi>
                                    </mml:msub>
                                </mml:mrow>
                            </mml:msup>
                            <mml:mo stretchy="false">)</mml:mo>
                            <mml:mo>,</mml:mo>
                        </mml:math>
                    </disp-formula>
                </p>
                <p>
                    <disp-formula id="e2">
                        <mml:math display="block" id="math2">
                            <mml:msub>
                                <mml:mtext>&#x03bc;</mml:mtext>
                                <mml:mtext>t</mml:mtext>
                            </mml:msub>
                            <mml:mo>=</mml:mo>
                            <mml:msub>
                                <mml:mtext>&#x03b2;</mml:mtext>
                                <mml:mn>0</mml:mn>
                            </mml:msub>
                            <mml:mo>+</mml:mo>
                            <mml:msub>
                                <mml:mtext>&#x03b2;</mml:mtext>
                                <mml:mn>1</mml:mn>
                            </mml:msub>
                            <mml:mo>*</mml:mo>
                            <mml:msub>
                                <mml:mtext>time</mml:mtext>
                                <mml:mtext>t</mml:mtext>
                            </mml:msub>
                            <mml:mo>+</mml:mo>
                            <mml:msub>
                                <mml:mtext>&#x03b2;</mml:mtext>
                                <mml:mn>2</mml:mn>
                            </mml:msub>
                            <mml:mo>*</mml:mo>
                            <mml:mtext>cos</mml:mtext>
                            <mml:mo stretchy="false">(</mml:mo>
                            <mml:msub>
                                <mml:mtext>&#x03b8;</mml:mtext>
                                <mml:mtext>t</mml:mtext>
                            </mml:msub>
                            <mml:mo>/</mml:mo>
                            <mml:mn>12</mml:mn>
                            <mml:mo stretchy="false">)</mml:mo>
                            <mml:mo>+</mml:mo>
                            <mml:msub>
                                <mml:mtext>&#x03b2;</mml:mtext>
                                <mml:mn>3</mml:mn>
                            </mml:msub>
                            <mml:mo>*</mml:mo>
                            <mml:mtext>cos</mml:mtext>
                            <mml:mo stretchy="false">(</mml:mo>
                            <mml:msub>
                                <mml:mtext>&#x03b8;</mml:mtext>
                                <mml:mtext>t</mml:mtext>
                            </mml:msub>
                            <mml:mo>/</mml:mo>
                            <mml:mn>12</mml:mn>
                            <mml:mo stretchy="false">)</mml:mo>
                            <mml:mo>+</mml:mo>
                            <mml:msub>
                                <mml:mtext>&#x03b2;</mml:mtext>
                                <mml:mn>4</mml:mn>
                            </mml:msub>
                            <mml:mo>*</mml:mo>
                            <mml:mtext>sin</mml:mtext>
                            <mml:mo stretchy="false">(</mml:mo>
                            <mml:msub>
                                <mml:mtext>&#x03b8;</mml:mtext>
                                <mml:mtext>t</mml:mtext>
                            </mml:msub>
                            <mml:mo>/</mml:mo>
                            <mml:mn>6</mml:mn>
                            <mml:mo stretchy="false">)</mml:mo>
                            <mml:mo>+</mml:mo>
                            <mml:msub>
                                <mml:mtext>&#x03b2;</mml:mtext>
                                <mml:mn>5</mml:mn>
                            </mml:msub>
                            <mml:mo>*</mml:mo>
                            <mml:mtext>sin</mml:mtext>
                            <mml:mo stretchy="false">(</mml:mo>
                            <mml:msub>
                                <mml:mtext>&#x03b8;</mml:mtext>
                                <mml:mtext>t</mml:mtext>
                            </mml:msub>
                            <mml:mo>/</mml:mo>
                            <mml:mn>6</mml:mn>
                            <mml:mo stretchy="false">)</mml:mo>
                            <mml:mo>,</mml:mo>
                            <mml:mspace width="0.2em"/>
                            <mml:mtext>and</mml:mtext>
                        </mml:math>
                    </disp-formula>
                </p>
                <p>
                    <disp-formula id="e3">
                        <mml:math display="block" id="math3">
                            <mml:msub>
                                <mml:mtext>&#x03b8;</mml:mtext>
                                <mml:mtext>t</mml:mtext>
                            </mml:msub>
                            <mml:mo>=</mml:mo>
                            <mml:mn>2</mml:mn>
                            <mml:mo>*</mml:mo>
                            <mml:mtext>&#x03c0;</mml:mtext>
                            <mml:mo>*</mml:mo>
                            <mml:msub>
                                <mml:mtext>time</mml:mtext>
                                <mml:mtext>t</mml:mtext>
                            </mml:msub>
                            <mml:mtext>.</mml:mtext>
                        </mml:math>
                    </disp-formula>
                </p>
                <p>The models were fit using the glmer function in the lme4 package in R, version 3.6.1.</p>
                <p>The next step is to use the fitted model to simulate time series of counts of hospitalization for the post-vaccine period with similar characteristics as the pre-vaccine period, with a specified vaccine-associated decline added to the simulated data. Using the estimated regression coefficients 
                    <inline-formula>
                        <mml:math display="inline" id="M">
                            <mml:mrow>
                                <mml:mo stretchy="false">(</mml:mo>
                                <mml:msub>
                                    <mml:mover accent="true">
                                        <mml:mtext>&#x03b2;</mml:mtext>
                                        <mml:mo>^</mml:mo>
                                    </mml:mover>
                                    <mml:mtext>k</mml:mtext>
                                </mml:msub>
                                <mml:mo stretchy="false">)</mml:mo>
                            </mml:mrow>
                        </mml:math>
                    </inline-formula> and their estimated variance/covariance matrix, we generated 500 independent random draws of the parameters from a multivariate normal distribution for each state and age group combination. These were combined with the design matrix to obtain simulated 
                    <inline-formula>
                        <mml:math display="inline" id="M1">
                            <mml:mrow>
                                <mml:msub>
                                    <mml:mover accent="true">
                                        <mml:mi>&#x03bc;</mml:mi>
                                        <mml:mo>^</mml:mo>
                                    </mml:mover>
                                    <mml:mi>t</mml:mi>
                                </mml:msub>
                                <mml:mo>.</mml:mo>
                            </mml:mrow>
                        </mml:math>
                    </inline-formula> Random draws of 
                    <inline-formula>
                        <mml:math display="inline" id="M2">
                            <mml:mrow>
                                <mml:msub>
                                    <mml:mover accent="true">
                                        <mml:mi>&#x03d5;</mml:mi>
                                        <mml:mo>^</mml:mo>
                                    </mml:mover>
                                    <mml:mi>t</mml:mi>
                                </mml:msub>
                            </mml:mrow>
                        </mml:math>
                    </inline-formula> were independently generated from a normal distribution with a mean of 0 and a standard deviation equal to the standard deviation estimated for 
                    <italic toggle="yes">&#x03d5;
                        <sub>t</sub>
                    </italic> from the fitted model. To incorporate a known vaccine effect, we assumed that the time series declined by a specified amount over a 24-month period. We generated time series where the maximum vaccine effect after 24 months ranged from a 10% &#x2013; 50% reduction (rate ratio of 0.5&#x2013;0.9). To capture these declines, we generated a vector, 
                    <italic toggle="yes">
                        <bold>
                            <italic toggle="yes">v</italic>
                        </bold>
                    </italic>
                    <sub>t</sub>, with entries equal to 0 at the time of vaccine introduction and decreasing linearly to the log(Rate-Ratio-Final) over 24 months. Simulated counts, 
                    <italic toggle="yes">Y</italic>
                    <sub>
                        <italic toggle="yes">t</italic>,sim</sub>, were generated by taking a random draw from the Poisson distribution with mean 
                    <inline-formula>
                        <mml:math display="inline" id="M3">
                            <mml:mrow>
                                <mml:msup>
                                    <mml:mi>e</mml:mi>
                                    <mml:mrow>
                                        <mml:msub>
                                            <mml:mover accent="true">
                                                <mml:mi>&#x03bc;</mml:mi>
                                                <mml:mo>^</mml:mo>
                                            </mml:mover>
                                            <mml:mi>t</mml:mi>
                                        </mml:msub>
                                        <mml:mo>+</mml:mo>
                                        <mml:msub>
                                            <mml:mover accent="true">
                                                <mml:mi>&#x03d5;</mml:mi>
                                                <mml:mo>^</mml:mo>
                                            </mml:mover>
                                            <mml:mi>t</mml:mi>
                                        </mml:msub>
                                        <mml:mo>+</mml:mo>
                                        <mml:msub>
                                            <mml:mi>v</mml:mi>
                                            <mml:mi>t</mml:mi>
                                        </mml:msub>
                                    </mml:mrow>
                                </mml:msup>
                                <mml:mo>.</mml:mo>
                            </mml:mrow>
                        </mml:math>
                    </inline-formula> The simulated counts reflect uncertainty in the regression parameters, unexplained variability in the data, as well as uncertainty from the observation process. The simulated counts from the post-vaccine period were combined with the observed counts from the pre-vaccine period. Therefore, the power calculations are conditional on the observed pre-vaccine data.</p>
            </sec>
            <sec>
                <title>Estimation of vaccine effects</title>
                <p>We next estimated the vaccine effect using a regression model similar to the one used to generate the data. The outcome was the (simulated) number of counts per month. As above, we adjusted for seasonality using 6- and 12-month harmonic terms, and secular trends were captured using an index for time. The vaccine effect was quantified using a linear spline term that began at the time of vaccine introduction and continued for 24 months before stabilizing. An observation-level random intercept was included to capture overdispersion of the count data. Using the fitted model, we calculated the estimated rate ratio 24 months after vaccine introduction as 24*(coefficient for the vaccine effect term). To evaluate how many years of pre-vaccine data are needed to estimate the effects, we sequentially removed the first 1, 2, or 3 years of data and evaluated the effect on power. Coverage of the 95% confidence intervals (alpha=0.05) were used to assess power.</p>
            </sec>
            <sec>
                <title>Data and availability</title>
                <p>All of the time series data and code used in these analyses are available from a Github repository 
                    <ext-link ext-link-type="uri" xlink:href="https://github.com/weinbergerlab/PoissonITS_power">https://github.com/weinbergerlab/PoissonITS_power</ext-link>. The interactive tool, along with a sample dataset, can be accessed at 
                    <ext-link ext-link-type="uri" xlink:href="https://weinbergerlab.shinyapps.io/ITS_Poisson_Power">https://weinbergerlab.shinyapps.io/ITS_Poisson_Power</ext-link>.</p>
            </sec>
        </sec>
        <sec sec-type="results">
            <title>Results</title>
            <sec>
                <title>Interactive tool to estimate power</title>
                <p>Because the power to detect a change in a time series is influenced by the expected effect size, the amount of unexplained variation in the data, and the number of years of data available, it can be difficult to make general statements about power. However, observed time series from the pre-vaccine period can be used to simulate time series to perform a best-case power calculation. This can provide an indication of whether it is worth performing an analysis or whether collecting additional data (e.g., additional pre-vaccine time points) could be helpful. We provide a simple &#x2018;point-and-click&#x2019; interface where the user provides a time series in a csv or Excel format, indicates which columns contain the date variable, the outcome, and any potential controls, and the date at which the intervention is introduced (
                    <xref ref-type="fig" rid="f1">Figure 1</xref>). Controls are time series that share important characteristics with the outcome time series. Relevant controls could include population size, all-cause hospitalizations, or other specific causes of disease that share similar trends and are not influenced by the intervention.</p>
                <fig fig-type="figure" id="f1" orientation="portrait" position="float">
                    <label>Figure 1. </label>
                    <caption>
                        <title>User interface for power calculation.</title>
                        <p>The user uploads a time series, specifies the expected decline in terms of a rate ratio, specifies the key variables (date, outcome of interest, and controls), the date of the intervention, and the number of simulations to generate. A sample dataset can be downloaded by clicking the button at the top of the screen.</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://gatesopenresearch-files.f1000.com/manuscripts/14384/30132de9-7331-40c9-a565-d69d06d32d40_figure1.gif"/>
                </fig>
            </sec>
            <sec>
                <title>Characteristics of the state-level data from Brazil</title>
                <p>As a demonstration of this approach, we apply this simulation framework to data from Brazil, disaggregated to different subnational levels (state, region). The size of the population varies drastically by state, from 450,000 to 41 million individuals (in 2010). On average there were 30-1900 hospitalizations due to pneumonia per month per state among children &lt;12 m and 12-1100 hospitalizations per month per state among adults 80+ years of age during the pre-vaccine period. The time series for the &lt;12m old children were highly seasonal but without a strong long-term trend, while the time series for the 80+ year olds increased markedly starting in the pre-vaccine period. We simulated time series for each of the states that had similar characteristics to the observed time series in the pre-vaccine period but with vaccine effects of different magnitudes (
                    <xref ref-type="fig" rid="f2">Figure 2</xref>).</p>
                <fig fig-type="figure" id="f2" orientation="portrait" position="float">
                    <label>Figure 2. </label>
                    <caption>
                        <p>Sample simulated monthly time series of hospitalizations due to all-cause pneumonia for adults 80&#x00b1; years of age from a small state (
                            <bold>A,B</bold>) and a large state (
                            <bold>C,D</bold>) in Brazil with a 50% decline post-vaccination (
                            <bold>A,C</bold>) or 10% (
                            <bold>B,D</bold>).</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://gatesopenresearch-files.f1000.com/manuscripts/14384/30132de9-7331-40c9-a565-d69d06d32d40_figure2.gif"/>
                </fig>
            </sec>
            <sec>
                <title>Effect of number of cases and random noise on ability to accurately to detect a decline</title>
                <p>We first evaluate the relationship between the amount of unexplained variability in the data and the ability to accurately estimate the effect of the vaccine. There is a clear relationship between the amount of unexplained variability in the data and the power to detect a vaccine-associated change (
                    <xref ref-type="fig" rid="f3">Figure 3A</xref>). This trend was consistent across all of the states between both children and adults.</p>
                <fig fig-type="figure" id="f3" orientation="portrait" position="float">
                    <label>Figure 3. </label>
                    <caption>
                        <p>Relationship between power to detect a decline associated with vaccine introduction and (
                            <bold>A</bold>) the amount of unexplained variation in the time series or (
                            <bold>B</bold>) the average number of cases per month for different specified magnitudes of vaccine effects. The labels at the top of the panel indicate the magnitude of the expected vaccine effect, with an incidence rate ratio (IRR) of 0.5 representing a 50% decline associated with the vaccine and a IRR of 0.9 equal to a 10% decline. Each dot represents the power for one state in Brazil. The black triangles represent estimates for adults 80+ years of age, and the gray circles represent estimates for children &lt;12 months of age.</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://gatesopenresearch-files.f1000.com/manuscripts/14384/30132de9-7331-40c9-a565-d69d06d32d40_figure3.gif"/>
                </fig>
                <p>Plotting the estimated power against the average number of hospitalizations in the state/region, there is also a relationship, but the trend differed between children and adults (
                    <xref ref-type="fig" rid="f3">Figure 3B</xref>). This is because the amount of unexplained variability was higher in the &lt;12m old children than in the 80+ year old adults (
                    <italic toggle="yes">Extended data:</italic> Figure S1)</p>
            </sec>
            <sec>
                <title>Effect of number of years of data on ability to accurately to detect a decline</title>
                <p>With fewer years of baseline data, the power to detect a change in disease rates associated with the vaccine also declines. For datasets with little unexplained variability, even with just 12 months of pre-vaccine data, there could be high power to detect a vaccine-associated decline of 20%. However, when there is more unexplained variability in the time series, power declines with shorter pre-vaccine periods (
                    <xref ref-type="fig" rid="f4">Figure 4</xref>). These declines in power are particularly dramatic for time series with intermediate levels of unexplained variability (
                    <xref ref-type="fig" rid="f4">Figure 4</xref>).</p>
                <fig fig-type="figure" id="f4" orientation="portrait" position="float">
                    <label>Figure 4. </label>
                    <caption>
                        <title>Effect of shortened baseline periods on the relationship between the power to detect a decline associated with vaccine introduction and the amount of unexplained variation in the time series.</title>
                        <p>Each dot represents the power for one state/age group in Brazil. Dots with lighter colors had fewer years of data.</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://gatesopenresearch-files.f1000.com/manuscripts/14384/30132de9-7331-40c9-a565-d69d06d32d40_figure4.gif"/>
                </fig>
            </sec>
            <sec>
                <title>Demonstration of the interface</title>
                <p>As a demonstration of the point-and-click interface, we use hospitalization data from Chile among children &lt;24 months (raw data available from 
                    <ext-link ext-link-type="uri" xlink:href="http://www.deis.cl/">http://www.deis.cl/</ext-link>)
                    <sup>
                        <xref ref-type="bibr" rid="ref-6">6</xref>
                    </sup>. This sample time series can be downloaded directly from the interface. The outcome variable is the number of hospitalizations per month due to all-cause pneumonia (J12_18) for 2003&#x2013;2014. The number of non-respiratory hospitalizations per month (ach_noj) is included as a control time series. If no control is present, this field can be left blank. The date of vaccine introduction is set to January 1, 2011. The program generates a specified number of simulated post-vaccine time series (N) based on the pre-intervention data (
                    <xref ref-type="fig" rid="f5">Figure 5A</xref>). With two years of post-vaccine data (vaccine introduction in 2011, evaluating through 2012), the 500 estimates of the rate ratio are centered on the true value (indicated by a red dashed line), with a moderate degree of uncertainty (
                    <xref ref-type="fig" rid="f5">Figure 5B</xref>). This yields 63% power to detect a rate ratio of 0.8 (
                    <xref ref-type="fig" rid="f5">Figure 5B</xref>). Compared with the analyses of the Brazil series, the power is reasonable given the amount of unexplained variability in the data but could be increased (
                    <xref ref-type="fig" rid="f5">Figure 5C</xref>). This can be seen in the simulation by increasing the length of the evaluation period by a year (e.g., through 2013) (Extended data: Figure S2). The power in this instance would increase from 63% to 83% with an improvement in the precision of the estimates.</p>
                <fig fig-type="figure" id="f5" orientation="portrait" position="float">
                    <label>Figure 5. </label>
                    <caption>
                        <title>Sample output from the interactive interface using hospitalization data for Chile under 2 years of age (data can be obtained by clicking the download button on the interface).</title>
                        <p>The upper left panel shows the 100 simulated time series. The upper right panel shows the estimates of the rate ratio for each of the 100 simulations. The true specified rate ratio (0.8) is denoted by a red dashed line. 63% of the estimates had 95% confidence intervals that did not cross 1. The bottom left panel shows the estimate of power for this study (red dot) compared with the estimates from the Brazil states with different length baseline periods.</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://gatesopenresearch-files.f1000.com/manuscripts/14384/30132de9-7331-40c9-a565-d69d06d32d40_figure5.gif"/>
                </fig>
            </sec>
        </sec>
        <sec sec-type="discussion">
            <title>Discussion</title>
            <p>In this study, we describe a simple interface for conducting simulations to evaluate the power to detect a vaccine-associated decline from time series data. This approach provides analysts a simple best-case scenario for determining whether they are likely to detect specified vaccine effects with the data on hand or whether collecting additional pre- or post- vaccine data would be beneficial. This type of tool should be used when planning analyses and prior to conducting a formal evaluation analysis with the data on hand.</p>
            <p>By analyzing subnational data from Brazil, we demonstrate how power varies with the number of cases and the degree of unexplained variability in the data. Reducing unexplained variability in the data by using time-varying covariates can help to increase power. Such covariate could include other causes of disease/hospitalization/death or known correlates of changes in disease rates (e.g., percent of the population with access to healthcare).</p>
            <p>These analyses evaluate power based on the statistical characteristics of the time series. As with any analysis, failure to correctly control for relevant trends will also introduce important biases and could greatly outweigh the issues related to statistical characteristics of the data. For instance, if there is a non-linear trend that is not well-captured by an interrupted time series analysis, the vaccine effect could be substantially over- or under-estimated.</p>
            <p>The estimates generated with this approach represent a &#x2018;best-case&#x2019; scenario where we know the exact date of vaccine introduction and where all non-vaccine-associated changes are linear and can be controlled with a simple model. In reality, numerous factors can influence pneumonia hospitalization rates. The use of control variables can help to adjust for these, but often remain unexplained factors that cannot be easily adjusted.</p>
            <p>In the statistical model used here, the control variable(s) are included as covariates in the regression framework. An alternative approach would be to pick a single covariate and include it as an offset term
                <sup>
                    <xref ref-type="bibr" rid="ref-7">7</xref>
                </sup>. There are advantages and disadvantages to using the control variables as covariates rather than an offset term (which effectively includes the control variable as a covariate but fixes the regression coefficient to one). Error in the control time series can lead to inadequate control of underlying trends
                <sup>
                    <xref ref-type="bibr" rid="ref-8">8</xref>
                </sup>. This is particularly an issue with data from small regions with few cases of disease. Using a control as an offset ensures that the trend is captured. However, using the control variable as an offset term can introduce biases when the control itself experiences major changes that are caused by a factor that does not reflect the outcome. By allowing the regression coefficient to be estimated instead of fixed at one, it guards against this by giving less weight to controls that do not reflect changes in the outcome.</p>
            <p>In our analyses here and for the online tool, we use independent observation-level random effects. When analyzing time series data, there is often correlation between observations across time. Adjusting for seasonality and time-varying covariates can reduce this autocorrelation considerably in many settings, though residual autocorrelation can persist. An alternative approach would be to use an autoregressive model for the random effects. We have found that the autoregressive model of order one (AR(1)) for the random effects can lead to identifiability issues (similar to issues observed with spatial models
                <sup>
                    <xref ref-type="bibr" rid="ref-9">9</xref>
                </sup>) where introduction of the correlated random effects biases the estimation of trend or control variables. Depending on the characteristics of the underlying trends in the time series, this can bias estimates of vaccine impact.</p>
            <p>The analyses presented here assume a gradual decline in disease rates following an intervention, which plateaus after 24 months. This would be a reasonable assumption for a new vaccine being introduced for routine use. In the case of a vaccine campaign, the change might be much more abrupt and resemble a step change. The interactive app allows the user to specify the length of time that passes before disease rates stabilize.</p>
            <p>We summarize the results of these simulations in terms of statistical power (i.e., what percentage of simulations yielded a statistically significant effect when an actual non-zero effect was present). In practice, we typically avoid describing evaluations of vaccine impact made using observational time series data in terms of statistical significance. It is often more informative to instead describe the estimate of vaccine impact and the strength of the evidence/precision of the estimates. These types of analyses are rarely used for making dichotomous policy decision (e.g., licensure), so using an arbitrary threshold for declaring whether a vaccine &#x2018;works&#x2019; is not needed.</p>
            <p>In conclusion, we present a simple framework for evaluating the power to detect vaccine-associated declines of a specified magnitude. This approach can help in planning for an evaluation study and for understanding differences between studies.</p>
        </sec>
        <sec>
            <title>Data availability</title>
            <sec>
                <title>Underlying data</title>
                <p>The Brazilian dataset can be accessed by contacting the Ministry of Health (Minist&#x00e9;rio da Sa&#x00fa;de) directly via 
                    <ext-link ext-link-type="uri" xlink:href="http://portalms.saude.gov.br/">http://portalms.saude.gov.br</ext-link>.</p>
                <p>The Chilean dataset can be accessed from the Chilean Department of Statistics website 
                    <ext-link ext-link-type="uri" xlink:href="http://www.deis.cl">http://www.deis.cl</ext-link> Time series data and code available from: 
                    <ext-link ext-link-type="uri" xlink:href="https://github.com/weinbergerlab/PoissonITS_power">https://github.com/weinbergerlab/PoissonITS_power</ext-link>
                </p>
                <p>Archived data and code as at time of publication: 
                    <ext-link ext-link-type="uri" xlink:href="https://github.com/weinbergerlab/PoissonITS_power">https://github.com/weinbergerlab/PoissonITS_power</ext-link>
                </p>
                <p>License: CC0</p>
            </sec>
            <sec>
                <title>Extended data</title>
                <p>Figshare: Estimating the power to detect a change caused by a vaccine from time series data,</p>
                <p>This project contains the following extended data:</p>
                <list list-type="bullet">
                    <list-item>
                        <label>-</label>
                        <p>
                            <bold>Figure S1.</bold> Relationship between amount of unexplained variability in the data and the average number of cases per month for each of the Brazilian states for children &lt;12 months of age (gray circles) and adults 80+ years of age (black triangles). 
                            <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.6084/m9.figshare.11908143">https://doi.org/10.6084/m9.figshare.11908143</ext-link>
                            <sup>
                                <xref ref-type="bibr" rid="ref-10">10</xref>
                            </sup>
                        </p>
                    </list-item>
                    <list-item>
                        <label/>
                        <p>-	
                            <bold>Figure S2.</bold> Sample output from the interactive interface using hospitalization data for Chile under 2 years of age, where the date of introduction is shifted earlier by 12 months (to January 1, 2010). In comparison to 
                            <xref ref-type="fig" rid="f4">Figure 4</xref>, (where a date of introduction of January 1, 2011 is used), the estimates are more precise, and the power is higher. 
                            <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.6084/m9.figshare.11908158.v2">https://doi.org/10.6084/m9.figshare.11908158.v2</ext-link>
                            <sup>
                                <xref ref-type="bibr" rid="ref-11">11</xref>
                            </sup>
                        </p>
                    </list-item>
                </list>
                <p>Data are available under the terms of the 
                    <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/legalcode">Creative Commons Attribution 4.0 International license</ext-link> (CC-BY 4.0).</p>
            </sec>
        </sec>
        <sec>
            <title>Software availability</title>
            <p>Interactive tool available from: 
                <ext-link ext-link-type="uri" xlink:href="https://weinbergerlab.shinyapps.io/ITS_Poisson_Power">https://weinbergerlab.shinyapps.io/ITS_Poisson_Power</ext-link>
            </p>
            <p>Source code available from: 
                <ext-link ext-link-type="uri" xlink:href="https://github.com/weinbergerlab/PoissonITS_power">https://github.com/weinbergerlab/PoissonITS_power</ext-link>
            </p>
            <p>Archived source code as at time of publication: 
                <ext-link ext-link-type="uri" xlink:href="https://zenodo.org/badge/latestdoi/177850456">https://zenodo.org/badge/latestdoi/177850456</ext-link>
            </p>
            <p>License: CC-BY 4.0</p>
        </sec>
    </body>
    <back>
        <ref-list>
            <ref id="ref-1">
                <label>1</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Bernal</surname>
                            <given-names>JL</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Cummins</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Gasparrini</surname>
                            <given-names>A</given-names>
                        </name>
</person-group>:
                    <article-title>Interrupted time series regression for the evaluation of public health interventions: a tutorial.</article-title>
                    <source>

                        <italic toggle="yes">Int J Epidemiol.</italic>
</source>
                    <year>2017</year>;<volume>46</volume>(<issue>1</issue>):<fpage>348</fpage>&#x2013;<lpage>55</lpage>.
                    <pub-id pub-id-type="pmid">27283160</pub-id>
                    <pub-id pub-id-type="doi">10.1093/ije/dyw098</pub-id>
                    <pub-id pub-id-type="pmcid">5407170</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-2">
                <label>2</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Zhang</surname>
                            <given-names>F</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Wagner</surname>
                            <given-names>AK</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ross-Degnan</surname>
                            <given-names>D</given-names>
                        </name>
</person-group>:
                    <article-title>Simulation-based power calculation for designing interrupted time series analyses of health policy interventions.</article-title>
                    <source>

                        <italic toggle="yes">J Clin Epidemiol.</italic>
</source>
                    <year>2011</year>;<volume>64</volume>(<issue>11</issue>):<fpage>1252</fpage>&#x2013;<lpage>61</lpage>.
                    <pub-id pub-id-type="pmid">21640554</pub-id>
                    <pub-id pub-id-type="doi">10.1016/j.jclinepi.2011.02.007</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-3">
                <label>3</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Liu</surname>
                            <given-names>W</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ye</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Barton</surname>
                            <given-names>BA</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Simulation-based power and sample size calculation for designing interrupted time series analyses of count outcomes in evaluation of health policy interventions.</article-title>
                    <source>

                        <italic toggle="yes">Contemp Clin Trials Commun.</italic>
</source>
                    <year>2020</year>;<volume>17</volume>: 100474.
                    <pub-id pub-id-type="pmid">31886433</pub-id>
                    <pub-id pub-id-type="doi">10.1016/j.conctc.2019.100474</pub-id>
                    <pub-id pub-id-type="pmcid">6920506</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-4">
                <label>4</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Warren</surname>
                            <given-names>JL</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Shioda</surname>
                            <given-names>K</given-names>
                        </name>

                        <name name-style="western">
                            <surname>K&#x00fc;r&#x00fc;m</surname>
                            <given-names>E</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Impact of Pneumococcal Conjugate Vaccines on Pneumonia Hospitalizations in High- and Low-Income Subpopulations in Brazil.</article-title>
                    <source>

                        <italic toggle="yes">Clin Infect Dis.</italic>
</source>
                    <year>2017</year>;<volume>65</volume>(<issue>11</issue>):<fpage>1813</fpage>&#x2013;<lpage>8</lpage>.
                    <pub-id pub-id-type="pmid">29020195</pub-id>
                    <pub-id pub-id-type="doi">10.1093/cid/cix638</pub-id>
                    <pub-id pub-id-type="pmcid">5848248</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-5">
                <label>5</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Hyndman</surname>
                            <given-names>RJ</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Athanasopoulos</surname>
                            <given-names>G</given-names>
                        </name>
</person-group>:
                    <article-title>Forecasting: principles and practice</article-title>. chapter 5.4. OTexts,<year>2018</year>.
                    <ext-link ext-link-type="uri" xlink:href="https://books.google.co.in/books?hl=en&amp;lr=&amp;id=_bBhDwAAQBAJ&amp;oi=fnd&amp;pg=PA1">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref-6">
                <label>6</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Bruhn</surname>
                            <given-names>CA</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Hetterich</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Schuck-Paim</surname>
                            <given-names>C</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Estimating the population-level impact of vaccines using synthetic controls.</article-title>
                    <source>

                        <italic toggle="yes">Proc Natl Acad Sci U S A.</italic>
</source>
                    <year>2017</year>;<volume>114</volume>(<issue>7</issue>):<fpage>1524</fpage>&#x2013;<lpage>9</lpage>.
                    <pub-id pub-id-type="pmid">28154145</pub-id>
                    <pub-id pub-id-type="doi">10.1073/pnas.1612833114</pub-id>
                    <pub-id pub-id-type="pmcid">5321019</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-7">
                <label>7</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Bottomley</surname>
                            <given-names>C</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Scott</surname>
                            <given-names>JAG</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Isham</surname>
                            <given-names>V</given-names>
                        </name>
</person-group>:
                    <article-title>Analysing interrupted time series with a control.</article-title>
                    <source>

                        <italic toggle="yes">Epidemiologic Methods.</italic>
</source>
                    <year>2019</year>;<volume>8</volume>(<issue>1</issue>).
                    <pub-id pub-id-type="doi">10.1515/em-2018-0010</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-8">
                <label>8</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Shioda</surname>
                            <given-names>K</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Schuck-Paim</surname>
                            <given-names>C</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Taylor</surname>
                            <given-names>RJ</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Challenges in estimating the impact of vaccination with sparse data.</article-title>
                    <source>

                        <italic toggle="yes"> Epidemiology.</italic>
</source>
                    <year>2019</year>;<volume>30</volume>(<issue>1</issue>):<fpage>61</fpage>&#x2013;<lpage>68</lpage>.
                    <pub-id pub-id-type="pmid">30334918</pub-id>
                    <pub-id pub-id-type="doi">10.1097/EDE.0000000000000938</pub-id>
                    <pub-id pub-id-type="pmcid">6276862</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-9">
                <label>9</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Hodges</surname>
                            <given-names>JS</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Reich</surname>
                            <given-names>BJ</given-names>
                        </name>
</person-group>:
                    <article-title>Adding spatially-correlated errors can mess up the fixed effect you love.</article-title>
                    <source>

                        <italic toggle="yes">Am Stat.</italic>
</source>
                    <year>2010</year>;<volume>64</volume>(<issue>4</issue>):<fpage>325</fpage>&#x2013;<lpage>34</lpage>.
                    <pub-id pub-id-type="doi">10.1198/tast.2010.10052</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-10">
                <label>10</label>
                <mixed-citation publication-type="data">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Weinberger</surname>
                            <given-names>D</given-names>
                        </name>
</person-group>:
                    <article-title>Extended Data Figure S1.</article-title>
                    <source>

                        <italic toggle="yes">figshare.</italic>
</source>Figure.<year>2020</year>.
                    <ext-link ext-link-type="uri" xlink:href="http://www.doi.org/10.6084/m9.figshare.11908143.v1">http://www.doi.org/10.6084/m9.figshare.11908143.v1</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref-11">
                <label>11</label>
                <mixed-citation publication-type="data">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Weinberger</surname>
                            <given-names>D</given-names>
                        </name>
</person-group>:
                    <article-title>Extended Data Figure S2.</article-title>
                    <source>

                        <italic toggle="yes">figshare.</italic>
</source>Figure.<year>2020</year>.
                    <ext-link ext-link-type="uri" xlink:href="http://www.doi.org/10.6084/m9.figshare.11908158.v2">http://www.doi.org/10.6084/m9.figshare.11908158.v2</ext-link>
                </mixed-citation>
            </ref>
        </ref-list>
    </back>
    <sub-article article-type="reviewer-report" id="report29849">
        <front-stub>
            <article-id pub-id-type="doi">10.21956/gatesopenres.14384.r29849</article-id>
            <title-group>
                <article-title>Reviewer response for version 2</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Bottomley</surname>
                        <given-names>Christian</given-names>
                    </name>
                    <xref ref-type="aff" rid="r29849a1">1</xref>
                    <role>Referee</role>
                </contrib>
                <aff id="r29849a1">
                    <label>1</label>Department of Infectious Disease Epidemiology, London School of Hygiene and Tropical Medicine, London, UK</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>21</day>
                <month>10</month>
                <year>2020</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2020 Bottomley C</copyright-statement>
                <copyright-year>2020</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport29849" related-article-type="peer-reviewed-article" xlink:href="10.12688/gatesopenres.13116.2"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>The authors have provided a thorough and thoughful reponse to my comments. The revised web-based tool seems to work well on the example dataset and I look forward to using it to plan future studies.</p>
            <p>Is the work clearly and accurately presented and does it cite the current literature?</p>
            <p>Yes</p>
            <p>If applicable, is the statistical analysis and its interpretation appropriate?</p>
            <p>Partly</p>
            <p>Are all the source data underlying the results available to ensure full reproducibility?</p>
            <p>Yes</p>
            <p>Is the study design appropriate and is the work technically sound?</p>
            <p>Partly</p>
            <p>Are the conclusions drawn adequately supported by the results?</p>
            <p>Yes</p>
            <p>Are sufficient details of methods and analysis provided to allow replication by others?</p>
            <p>Partly</p>
            <p>Reviewer Expertise:</p>
            <p>NA</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.</p>
        </body>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report28669">
        <front-stub>
            <article-id pub-id-type="doi">10.21956/gatesopenres.14294.r28669</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Levy</surname>
                        <given-names>Corinne</given-names>
                    </name>
                    <xref ref-type="aff" rid="r28669a1">1</xref>
                    <xref ref-type="aff" rid="r28669a3">3</xref>
                    <role>Referee</role>
                </contrib>
                <contrib contrib-type="author">
                    <name>
                        <surname>Ouldali</surname>
                        <given-names>Naim</given-names>
                    </name>
                    <xref ref-type="aff" rid="r28669a1">1</xref>
                    <xref ref-type="aff" rid="r28669a2">2</xref>
                    <role>Co-referee</role>
                </contrib>
                <aff id="r28669a1">
                    <label>1</label>ACTIV, Saint-Maur-des-Foss&#x00e9;s, France</aff>
                <aff id="r28669a2">
                    <label>2</label>Unit&#x00e9; d&#x2019;&#x00e9;pid&#x00e9;miologie clinique, Assistance Publique-H&#x00f4;pitaux de Paris, H&#x00f4;pital Robert Debr&#x00e9;, Paris, France</aff>
                <aff id="r28669a3">
                    <label>3</label>Universit&#x00e9; Paris Est, IMRB-GRC GEMINI, Cr&#x00e9;teil, France</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>14</day>
                <month>4</month>
                <year>2020</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2020 Ouldali N and Levy C</copyright-statement>
                <copyright-year>2020</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport28669" related-article-type="peer-reviewed-article" xlink:href="10.12688/gatesopenres.13116.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>In this relevant paper, the authors try to estimate the power to detect a change (a decline) caused by a vaccine from time series data. To implement this approach, they created an interactive web-based tool. They used data on pneumonia from Brazil before pneumococcal conjugate vaccine introduction to generate the simulated time series and to demonstrate the use of this approach.</p>
            <p> Interestingly, whether in times series analysis or in before after studies, the specific question of power is rarely raised and from our point of view, this is the strength of this work. This relevant information can have an added value to perform impact studies. Although the results the authors presented were expected, such as increasing power when increasing years of surveillance before the intervention, the interest of this work is to provide the demonstration of this effect, and to allow quantification of this power.</p>
            <p> </p>
            <p> For administrative data bases, we understand the usefulness of this tool. However, many surveillance systems are specifically set up to prospectively follow the evolution of an outcome after vaccine implementation, and adding a posteriori several years of surveillance before the vaccine to improve power when analyzing data is clearly not possible. With the proposed interactive tool, the dataset has to be downloaded to perform the power analysis, meaning that the study is already finished.</p>
            <p> </p>
            <p> But of course, the major interest of a power calculation lies in the fact that it is calculated a priori and not after the data were collected. Thus, we were not convinced by: &#x201c;This type of tool should be used when planning analyses and prior to conducting a formal evaluation analysis with the data on hand.&#x201d; nor by &#x201c;This approach can help in planning for an evaluation study and for understanding differences between studies.&#x201d;</p>
            <p> </p>
            <p> Several revisions should be made to strengthen the current paper: 
                <list list-type="bullet">
                    <list-item>
                        <p>First of all, the authors should add limitations of their study.</p>
                    </list-item>
                    <list-item>
                        <p>To allow a priori power comutation, one suggestion would be the possibility of adding a module to simulate data based on expected population sizes with assumed variability based on previous studies and a defined number of years. This would allow adjusting the number of baseline years needed before the intervention and would be really helpful to appropriately assess public health intervention impact.</p>
                    </list-item>
                    <list-item>
                        <p>Another model than Poisson regression could also be proposed such as ARIMA or negative binomial regression modelling.</p>
                    </list-item>
                    <list-item>
                        <p>How was the adjustment made on the control outcome?</p>
                    </list-item>
                    <list-item>
                        <p>Inclusion of an ARMA term in the model should also be considered, adjusting for seasonality may not be always sufficient to capture all the autocorrelation of the data</p>
                    </list-item>
                    <list-item>
                        <p>The evaluation of the residuals, particularly with correlograms, is also an important point</p>
                    </list-item>
                    <list-item>
                        <p>The authors should improve the clarity of their R scripts and should add explanatory text so that non-experts can better understand the model building.</p>
                    </list-item>
                </list>
            </p>
            <p>Is the work clearly and accurately presented and does it cite the current literature?</p>
            <p>Yes</p>
            <p>If applicable, is the statistical analysis and its interpretation appropriate?</p>
            <p>Partly</p>
            <p>Are all the source data underlying the results available to ensure full reproducibility?</p>
            <p>Partly</p>
            <p>Is the study design appropriate and is the work technically sound?</p>
            <p>Yes</p>
            <p>Are the conclusions drawn adequately supported by the results?</p>
            <p>Partly</p>
            <p>Are sufficient details of methods and analysis provided to allow replication by others?</p>
            <p>Partly</p>
            <p>Reviewer Expertise:</p>
            <p>Pediatric infectious disease epidemiology</p>
            <p>We confirm that we have read this submission and believe that we have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.</p>
        </body>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report28685">
        <front-stub>
            <article-id pub-id-type="doi">10.21956/gatesopenres.14294.r28685</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Bottomley</surname>
                        <given-names>Christian</given-names>
                    </name>
                    <xref ref-type="aff" rid="r28685a1">1</xref>
                    <role>Referee</role>
                </contrib>
                <aff id="r28685a1">
                    <label>1</label>Department of Infectious Disease Epidemiology, London School of Hygiene and Tropical Medicine, London, UK</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>27</day>
                <month>3</month>
                <year>2020</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2020 Bottomley C</copyright-statement>
                <copyright-year>2020</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport28685" related-article-type="peer-reviewed-article" xlink:href="10.12688/gatesopenres.13116.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve-with-reservations</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>In their paper Weinberger and Warren describe a simulation-based approach for calculating statistical power in time series studies of vaccine impact. They apply their methodology to data from Brazil and Chile, using the latter to demonstrate a web-based interactive tool they have developed.</p>
            <p> I agree with the premise of the paper that power calculations may have an important role to play in designing vaccine impact studies, particularly in deciding how much pre and post vaccine data needs to be collected. And the web-based tool is potentially a useful tool for researchers planning a vaccine impact study.</p>
            <p> However, my major concern is that in its current form the tool has limited flexibility to evaluate power for different amounts of pre and post vaccine data. For example, if a researcher wants to know the power associated with different lengths of post-vaccine surveillance, given they have, say, three years of pre-vaccine surveillance, can they use the tool to calculate power for these scenarios? Based on my understanding of the Chile example, the amount of post-intervention data can only be increased if the pre-intervention data is reduced. This is a significant constraint since it is hard to imagine why a researcher would not want to use all the pre-vaccine data available.</p>
            <p> My recommendation would therefore be to adapt the current version of this tool to include numbers of pre and post vaccine observations as additional parameters.&#x00a0; In theory this should be straightforward to do by:</p>
            <p> &#x00a0; 
                <list list-type="order">
                    <list-item>
                        <p>Fitting the model to pre-vaccine data</p>
                    </list-item>
                    <list-item>
                        <p>Using the resulting parameter estimates, and an assumed vaccine impact, to simulate additional pre-vaccine and post-vaccine data</p>
                    </list-item>
                    <list-item>
                        <p>Estimating vaccine impact based on the combined (observed + simulated) data</p>
                    </list-item>
                    <list-item>
                        <p>Using vaccine impact estimates from multiple simulations to estimate power.&#x00a0;&#x00a0;</p>
                    </list-item>
                </list> I have proposed combining observed and simulated data so that the estimated power is conditional on the observed data. An alternative approach would be to simulate all the pre-vaccine data to give an unconditional estimate, but I think this estimate is probably less useful given that the final analysis will almost certainly make use of the pre-vaccine data used in the power calculation.</p>
            <p> Some additional observations:&#x00a0; 
                <list list-type="order">
                    <list-item>
                        <p>In equation 3, the second harmonic is written in terms of 2 sine functions. I think this is a typo and one of these functions should be a cosine. Also, a reference for harmonic regression (e.g. chapter 5.4 in Forecasting Principles and Practice by Hyndman &amp; Athanasopoulos) might be useful.</p>
                    </list-item>
                    <list-item>
                        <p>The tool allows for the inclusion of a control series. This is a nice feature, but it wasn&#x2019;t clear to me how the relationship between the two time series is specified? Is the control included as a covariate in the regression model? In a recent paper (Bottomley et al. 2019) we argue that because the control is effectively trend + error it should be included as an offset term, otherwise confounding due to trend is not adequately accounted for.&#x00a0;</p>
                    </list-item>
                    <list-item>
                        <p>An important assumption of the model is that the observations are independent. This is a big assumption because time series are often autocorrelated. One way to relax the assumption would be to model the random effect (phi_t) as an autoregressive process. This might make fitting the model more complicated so I can see that you might not want to go down this route, but at a minimum the issue of autocorrelation should be mentioned in the discussion.</p>
                    </list-item>
                    <list-item>
                        <p>Statistical power must be defined with respect to a particular significance level. I presume the calculations are based on a 5% significance level (2-sided test), but I didn&#x2019;t see this stated anywhere.</p>
                    </list-item>
                    <list-item>
                        <p>The model assumes a linearly increasing intervention effect on the log scale.&#x00a0; This seems reasonable for a vaccine that is introduced into a childhood vaccine programme without a catchup campaign. But what about vaccines that are introduced with a catchup campaign? A step function might be more appropriate for this scenario. The assumed mode of introduction should be mentioned in the discussion or the tool could be modified to allow for different modes of introduction.</p>
                    </list-item>
                </list> Some typos in the web interface: 
                <list list-type="order">
                    <list-item>
                        <p>In the section &#x201c;Results&#x201d;, delete &#x201c;N&#x201d; from the sentence &#x201c;The second plot shows the N estimate&#x201d;.</p>
                    </list-item>
                    <list-item>
                        <p>In the section &#x201c;Sample Data&#x201d; missing &#x201c;r&#x201d; in children &#x201c;These time series represent the number of cases among childen &lt;24 months of age&#x201d;.</p>
                    </list-item>
                </list>
            </p>
            <p>Is the work clearly and accurately presented and does it cite the current literature?</p>
            <p>Yes</p>
            <p>If applicable, is the statistical analysis and its interpretation appropriate?</p>
            <p>Partly</p>
            <p>Are all the source data underlying the results available to ensure full reproducibility?</p>
            <p>Yes</p>
            <p>Is the study design appropriate and is the work technically sound?</p>
            <p>Partly</p>
            <p>Are the conclusions drawn adequately supported by the results?</p>
            <p>Yes</p>
            <p>Are sufficient details of methods and analysis provided to allow replication by others?</p>
            <p>Partly</p>
            <p>Reviewer Expertise:</p>
            <p>NA</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.</p>
        </body>
        <back>
            <ref-list>
                <title>References</title>
                <ref id="rep-ref-28685-1">
                    <label>1</label>
                    <mixed-citation>
                        <person-group person-group-type="author"/>:
                        <article-title>Forecasting: principles and practice</article-title>.
                        <source>
                            <italic>OTexts</italic>
                        </source>.<year>2018</year>;
                        <ext-link ext-link-type="uri" xlink:href="https://otexts.com/fpp2/">Reference source</ext-link>
                    </mixed-citation>
                </ref>
                <ref id="rep-ref-28685-2">
                    <label>2</label>
                    <mixed-citation publication-type="journal">
                        <person-group person-group-type="author"/>:
                        <article-title>Analysing Interrupted Time Series with a Control</article-title>.
                        <source>
                            <italic>Epidemiologic Methods</italic>
                        </source>.<year>2019</year>;<volume>8</volume>(<issue>1</issue>) :
                        <elocation-id>10.1515/em-2018-0010</elocation-id>
                        <pub-id pub-id-type="doi">10.1515/em-2018-0010</pub-id>
                    </mixed-citation>
                </ref>
            </ref-list>
        </back>
        <sub-article article-type="response" id="comment3350-28685">
            <front-stub>
                <contrib-group>
                    <contrib contrib-type="author">
                        <name>
                            <surname>Weinberger</surname>
                            <given-names>Daniel</given-names>
                        </name>
                        <aff/>
                    </contrib>
                </contrib-group>
                <author-notes>
                    <fn fn-type="conflict">
                        <p>
                            <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                    </fn>
                </author-notes>
                <pub-date pub-type="epub">
                    <day>26</day>
                    <month>8</month>
                    <year>2020</year>
                </pub-date>
            </front-stub>
            <body>
                <p>We thank Dr. Bottomley for the constructive feedback on the analyses. We have responded by making all analyses conditional on the observed pre-vaccine data. Additionally, we have modified the user interface so that the user can modify the number of pre- and post-vaccine time points used when estimating power.</p>
                <p> </p>
                <p> </p>
                <p> </p>
                <p> 
                    <italic>Reviewer #1</italic>
                </p>
                <p>
                    <italic> &#x00a0;</italic>
                </p>
                <p>
                    <italic> In their paper Weinberger and Warren describe a simulation-based approach for calculating statistical power in time series studies of vaccine impact. They apply their methodology to data from Brazil and Chile, using the latter to demonstrate a web-based interactive tool they have developed.</italic>
                </p>
                <p>
                    <italic> &#x00a0;</italic>
                </p>
                <p>
                    <italic> I agree with the premise of the paper that power calculations may have an important role to play in designing vaccine impact studies, particularly in deciding how much pre and post vaccine data needs to be collected. And the web-based tool is potentially a useful tool for researchers planning a vaccine impact study.</italic>
                </p>
                <p>
                    <italic> &#x00a0;</italic>
                </p>
                <p>
                    <italic> However, my major concern is that in its current form the tool has limited flexibility to evaluate power for different amounts of pre and post vaccine data. For example, if a researcher wants to know the power associated with different lengths of post-vaccine surveillance, given they have, say, three years of pre-vaccine surveillance, can they use the tool to calculate power for these scenarios? Based on my understanding of the Chile example, the amount of post-intervention data can only be increased if the pre-intervention data is reduced. This is a significant constraint since it is hard to imagine why a researcher would not want to use all the pre-vaccine data available.&#x00a0;</italic>
                </p>
                <p>
                    <italic> My recommendation would therefore be to adapt the current version of this tool to include numbers of pre and post vaccine observations as additional parameters.&#x00a0; In theory this should be straightforward to do by:</italic> 
                    <list list-type="order">
                        <list-item>
                            <p>
                                <italic>Fitting the model to pre-vaccine data</italic>
                            </p>
                            <p>
                                <italic> &#x00a0;</italic>
                            </p>
                        </list-item>
                        <list-item>
                            <p>
                                <italic>Using the resulting parameter estimates, and an assumed vaccine impact, to simulate additional pre-vaccine and post-vaccine data</italic>
                            </p>
                            <p>
                                <italic> &#x00a0;</italic>
                            </p>
                        </list-item>
                        <list-item>
                            <p>
                                <italic>Estimating vaccine impact based on the combined (observed + simulated) data</italic>
                            </p>
                            <p>
                                <italic> &#x00a0;</italic>
                            </p>
                        </list-item>
                        <list-item>
                            <p>
                                <italic>Using vaccine impact estimates from multiple simulations to estimate power.&#x00a0;&#x00a0;</italic>
                            </p>
                        </list-item>
                    </list> 
                    <italic>I have proposed combining observed and simulated data so that the estimated power is conditional on the observed data. An alternative approach would be to simulate all the pre-vaccine data to give an unconditional estimate, but I think this estimate is probably less useful given that the final analysis will almost certainly make use of the pre-vaccine data used in the power calculation.</italic>
                </p>
                <p> 
                    <bold>&#x00a0;</bold>
                </p>
                <p> 
                    <bold>&#x00a0;</bold>
                    <bold>&#x00a0;</bold>
                </p>
                <p> 
                    <bold>This is an excellent suggestion, and we have implemented the suggested change in the Shiny App, and we have also updated the analyses in the main manuscript to be conditional on the observed pre-vaccine data. The user can now specify the number of months of post-intervention data (and pre-intervention data) using sliders, and then power is conditional on the observed data from the pre-vaccine period. The updated tool is available at 
                        <ext-link ext-link-type="uri" xlink:href="https://weinbergerlab.shinyapps.io/ITS_Poisson_Power/?_ga=2.146775735.247258148.1597801540-69302913.1596476977">https://weinbergerlab.shinyapps.io/ITS_Poisson_Powe</ext-link>r</bold>
                </p>
                <p> 
                    <bold>&#x00a0;</bold>
                </p>
                <p> 
                    <bold>&#x00a0;</bold>
                </p>
                <p> 
                    <italic>Some additional observations:&#x00a0;</italic> 
                    <list list-type="order">
                        <list-item>
                            <p>
                                <italic>In equation 3, the second harmonic is written in terms of 2 sine functions. I think this is a typo and one of these functions should be a cosine. Also, a reference for harmonic regression (e.g. chapter 5.4 in Forecasting Principles and Practice by Hyndman &amp; Athanasopoulos) might be useful.</italic>
                            </p>
                        </list-item>
                    </list> 
                    <bold>We have fixed this typo and added the reference, as suggested.</bold>
                </p>
                <p> 
                    <bold>&#x00a0;</bold> 
                    <list list-type="order">
                        <list-item>
                            <p>
                                <italic>The tool allows for the inclusion of a control series. This is a nice feature, but it wasn&#x2019;t clear to me how the relationship between the two time series is specified? Is the control included as a covariate in the regression model? In a recent paper (Bottomley et al. 2019) we argue that because the control is effectively trend + error it should be included as an offset term, otherwise confounding due to trend is not adequately accounted for.&#x00a0;</italic>
                            </p>
                        </list-item>
                    </list> 
                    <bold>Indeed, the log(control+0.5) is included as a covariate, and the user can specify multiple control variables if available. We agree that error in the control time series can lead to inadequate control of underlying trends (see Shioda et al., Epidemiology 2019). In our experience with administrative data, using a control as an offset can introduce biases when the control itself experiences major changes that are caused by a factor that does not reflect the outcome. By allowing the regression coefficient to be estimated instead of fixed at one, it guards against this effect by downweighting controls that do not reflect changes in the outcome. This is an interesting point though for future methodological work to compare these approaches, and we have added a comment on this to the discussion.&#x00a0;&#x00a0;</bold> 
                    <list list-type="order">
                        <list-item>
                            <p>
                                <italic>An important assumption of the model is that the observations are independent. This is a big assumption because time series are often autocorrelated. One way to relax the assumption would be to model the random effect (phi_t) as an autoregressive process. This might make fitting the model more complicated so I can see that you might not want to go down this route, but at a minimum the issue of autocorrelation should be mentioned in the discussion.</italic>
                            </p>
                        </list-item>
                    </list> 
                    <bold>We agree. However, there are two reasons we prefer to use an iid random effect. First, in many instances, after adjusting for seasonality and control time series, the residual autocorrelation is negligible. Second, we have found that AR(1) models can suffer from identifiability issues where inclusion of the correlated random effect biases the estimates of trend or control variables. A similar phenomenon has been described for spatial data: (Hodges JS, Reich BJ. Adding spatially-correlated errors can mess up the fixed effect you love. The American Statistician 2010; 64:325-34). Depending on the characteristics of the underlying trends in the time series, this can bias estimates of vaccine impact. We have not yet found a satisfactory solution to this problem, nor have we been able to determine the time series characteristics that lead to this identifiability issue. This is an area we are actively investigating. We have added a comment to the discussion to indicate that the use of an AR(1) random effect could lead to bias in the primary associations of interest.</bold>
                </p>
                <p> 
                    <bold>&#x00a0;</bold> 
                    <list list-type="order">
                        <list-item>
                            <p>
                                <italic>Statistical power must be defined with respect to a particular significance level. I presume the calculations are based on a 5% significance level (2-sided test), but I didn&#x2019;t see this stated anywhere.</italic>
                            </p>
                        </list-item>
                    </list> 
                    <bold>We have an alpha of 0.05 (2 sided). We have added this detail to the methods section.</bold>
                </p>
                <p> 
                    <bold>&#x00a0;</bold> 
                    <list list-type="order">
                        <list-item>
                            <p>
                                <italic>The model assumes a linearly increasing intervention effect on the log scale. This seems reasonable for a vaccine that is introduced into a childhood vaccine programme without a catchup campaign. But what about vaccines that are introduced with a catchup campaign? A step function might be more appropriate for this scenario. The assumed mode of introduction should be mentioned in the discussion or the tool could be modified to allow for different modes of introduction.</italic>
                            </p>
                        </list-item>
                    </list> 
                    <bold>We agree and have added a comment on this to the discussion. We also modified the app to allow the user to specify the length of time over which the decline occurs. So for a step change, the length of time would be set to 1.&#x00a0;</bold>
                </p>
                <p> 
                    <bold>Some typos in the web interface:</bold> 
                    <list list-type="order">
                        <list-item>
                            <p>
                                <italic>In the section &#x201c;Results&#x201d;, delete &#x201c;N&#x201d; from the sentence &#x201c;The second plot shows the N estimate&#x201d;.</italic>
                            </p>
                            <p>
                                <italic> &#x00a0;</italic>
                            </p>
                        </list-item>
                        <list-item>
                            <p>
                                <italic>In the section &#x201c;Sample Data&#x201d; missing &#x201c;r&#x201d; in children &#x201c;These time series represent the number of cases among childen &lt;24 months of age&#x201d;.</italic>
                            </p>
                        </list-item>
                    </list> 
                    <bold>Thank you, we have fixed these typos.</bold>
                </p>
            </body>
        </sub-article>
    </sub-article>
</article>
