<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20190208//EN" "http://jats.nlm.nih.gov/publishing/1.2/JATS-journalpublishing1.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article" dtd-version="1.2" xml:lang="en">
    <front>
        <journal-meta>
            <journal-id journal-id-type="pmc">Gates Open Res</journal-id>
            <journal-title-group>
                <journal-title>Gates Open Research</journal-title>
            </journal-title-group>
            <issn pub-type="epub">2572-4754</issn>
            <publisher>
                <publisher-name>F1000 Research Limited</publisher-name>
                <publisher-loc>London, UK</publisher-loc>
            </publisher>
        </journal-meta>
        <article-meta>
            <article-id pub-id-type="doi">10.12688/gatesopenres.13180.1</article-id>
            <article-categories>
                <subj-group subj-group-type="heading">
                    <subject>Research Article</subject>
                </subj-group>
                <subj-group>
                    <subject>Articles</subject>
                </subj-group>
            </article-categories>
            <title-group>
                <article-title>On the use of real-time reported mortality data in modelling and analysis during an epidemic outbreak</article-title>
                <fn-group content-type="pub-status">
                    <fn>
                        <p>[version 1; peer review: 2 approved with reservations]</p>
                    </fn>
                </fn-group>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author" corresp="yes">
                    <name>
                        <surname>Liljenberg</surname>
                        <given-names>Per</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Conceptualization</role>
                    <role content-type="http://credit.niso.org/">Data Curation</role>
                    <role content-type="http://credit.niso.org/">Formal Analysis</role>
                    <role content-type="http://credit.niso.org/">Investigation</role>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Project Administration</role>
                    <role content-type="http://credit.niso.org/">Resources</role>
                    <role content-type="http://credit.niso.org/">Software</role>
                    <role content-type="http://credit.niso.org/">Supervision</role>
                    <role content-type="http://credit.niso.org/">Validation</role>
                    <role content-type="http://credit.niso.org/">Visualization</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <uri content-type="orcid">https://orcid.org/0000-0002-2597-953X</uri>
                    <xref ref-type="corresp" rid="c1">a</xref>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <aff id="a1">
                    <label>1</label>Bill &amp; Melinda Gates Foundation, Seattle, WA, USA</aff>
            </contrib-group>
            <author-notes>
                <corresp id="c1">
                    <label>a</label>
                    <email xlink:href="mailto:per.liljenberg@gatesfoundation.org">per.liljenberg@gatesfoundation.org</email>
                </corresp>
                <fn fn-type="conflict">
                    <p>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>11</day>
                <month>9</month>
                <year>2020</year>
            </pub-date>
            <pub-date pub-type="collection">
                <year>2020</year>
            </pub-date>
            <volume>4</volume>
            <elocation-id>128</elocation-id>
            <history>
                <date date-type="accepted">
                    <day>3</day>
                    <month>9</month>
                    <year>2020</year>
                </date>
            </history>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2020 Liljenberg P</copyright-statement>
                <copyright-year>2020</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <self-uri content-type="pdf" xlink:href="https://gatesopenresearch.org/articles/4-128/pdf"/>
            <abstract>
                <p>
                    <bold>Background</bold>: For diseases like Covid-19, where it has been difficult to identify the true number of infected people, or where the number of known cases is heavily influenced by the number of tests performed, hospitalizations and deaths play a significant role in understanding the epidemic and in determining the appropriate response. However, the Covid-19 deaths data reported by some countries display a significant weekly variability, which can make the interpretation and use of the death data in analysis and modeling difficult.</p>
                <p>
                    <bold>Methods</bold>: We derive the mathematical relationship between the series of new daily deaths by reporting date and the series of deaths by death date. We then apply this formalism to the corresponding time-series reported by Sweden during the Covid-19 pandemic.</p>
                <p>
                    <bold>Results</bold>: The practice of reporting new deaths daily, as is standard procedure during an outbreak in most countries and regions, should be viewed as a time-dependent filter, modulating the underlying true death curve. After having characterized the Swedish reporting process, we show how smoothing of the Swedish reported daily deaths series results in a curve distinctly different from the true death curve. We also comment on the use of nowcasting methods.</p>
                <p>
                    <bold>Conclusions</bold>: Modelers and analysts using the series of new daily deaths by reporting date should take extra care when it is highly variable and when there is a significant reporting delay. It might be appropriate to instead use the series of deaths by death date combined with a nowcasting algorithm as basis for their analysis.</p>
            </abstract>
            <kwd-group kwd-group-type="author">
                <kwd>mortality data</kwd>
                <kwd>reporting delay</kwd>
                <kwd>real-time</kwd>
                <kwd>nowcasting</kwd>
                <kwd>epidemiological modeling</kwd>
                <kwd>smoothing</kwd>
            </kwd-group>
            <funding-group>
                <award-group id="fund-1" xlink:href="http://dx.doi.org/10.13039/100000865">
                    <funding-source>Gates Foundation</funding-source>
                </award-group>
                <funding-statement>The author is an employee of the Gates Foundation.</funding-statement>
            </funding-group>
        </article-meta>
    </front>
    <body>
        <sec sec-type="intro">
            <title>Introduction</title>
            <p>What policy makers and analysts are interested in during an outbreak is the times series 
                <italic toggle="yes">M</italic> of number of deaths 
                <italic toggle="yes">m
                    <sub>t</sub>
                </italic> that happened on each day 
                <italic toggle="yes">t</italic>,</p>
            <p>
                <disp-formula id="e1">
                    <mml:math display="block" id="math1">
                        <mml:mrow>
                            <mml:mi>M</mml:mi>
                            <mml:mo>=</mml:mo>
                            <mml:mo>{</mml:mo>
                            <mml:msub>
                                <mml:mi>m</mml:mi>
                                <mml:mi>t</mml:mi>
                            </mml:msub>
                            <mml:mo>,</mml:mo>
                            <mml:mspace width="0.3em"/>
                            <mml:mi>t</mml:mi>
                            <mml:mo>&#x2208;</mml:mo>
                            <mml:mo stretchy="false">[</mml:mo>
                            <mml:mn>0</mml:mn>
                            <mml:mo>,</mml:mo>
                            <mml:mspace width="0.3em"/>
                            <mml:mi>T</mml:mi>
                            <mml:mo stretchy="false">]</mml:mo>
                            <mml:mo>}</mml:mo>
                        </mml:mrow>
                        <mml:mspace width="28em"/>
                        <mml:mo stretchy="false">(</mml:mo>
                        <mml:mn>1</mml:mn>
                        <mml:mo stretchy="false">)</mml:mo>
                    </mml:math> </disp-formula>
            </p>
            <p>During the outbreak, this is typically not what is available in real time. Instead, we shall have a closer look at two commonly used times series. The first time series is what has been reported by countries and regions to the WHO
                <sup>
                    <xref ref-type="bibr" rid="ref-1">1</xref>
                </sup>, the US CDC
                <sup>
                    <xref ref-type="bibr" rid="ref-2">2</xref>
                </sup>, the European CDC
                <sup>
                    <xref ref-type="bibr" rid="ref-3">3</xref>
                </sup> and that eventually end up in dashboards like Johns Hopkins University
                <sup>
                    <xref ref-type="bibr" rid="ref-4">4</xref>,
                    <xref ref-type="bibr" rid="ref-5">5</xref>
                </sup> or Our World in Data
                <sup>
                    <xref ref-type="bibr" rid="ref-6">6</xref>
                </sup>. This is the number of new deaths 
                <italic toggle="yes">r
                    <sub>t</sub>
                </italic> that has been reported to competent agencies in countries and regions during the last 24 hours, and which the countries and regions in turn report to the WHO and the European CDC on the reporting date 
                <italic toggle="yes">t</italic>. We shall call this the reporting series, 
                <italic toggle="yes">R</italic>,</p>
            <p>
                <disp-formula id="e2">
                    <mml:math display="block" id="math2">
                        <mml:mrow>
                            <mml:mi>R</mml:mi>
                            <mml:mo>=</mml:mo>
                            <mml:mo>{</mml:mo>
                            <mml:msub>
                                <mml:mi>r</mml:mi>
                                <mml:mi>t</mml:mi>
                            </mml:msub>
                            <mml:mo>,</mml:mo>
                            <mml:mspace width="0.3em"/>
                            <mml:mi>t</mml:mi>
                            <mml:mo>&#x2208;</mml:mo>
                            <mml:mo stretchy="false">[</mml:mo>
                            <mml:mn>0</mml:mn>
                            <mml:mo>,</mml:mo>
                            <mml:mspace width="0.3em"/>
                            <mml:mi>T</mml:mi>
                            <mml:mo stretchy="false">]</mml:mo>
                            <mml:mo>}</mml:mo>
                        </mml:mrow>
                        <mml:mspace width="28.5em"/>
                        <mml:mo stretchy="false">(</mml:mo>
                        <mml:mn>2</mml:mn>
                        <mml:mo stretchy="false">)</mml:mo>
                    </mml:math> </disp-formula>
            </p>
            <p>Note that this number is only on very rare occasions updated retrospectively. Normally, a new number is added to the time series every day. Also note that we know nothing about when the deaths happened, the event date, we only know the number of deaths reported on each day.</p>
            <p>The second time series is a time series that some countries, e.g. Sweden
                <sup>
                    <xref ref-type="bibr" rid="ref-7">7</xref>&#x2013;
                    <xref ref-type="bibr" rid="ref-9">9</xref>
                </sup>, Belgium
                <sup>
                    <xref ref-type="bibr" rid="ref-10">10</xref>
                </sup> and the UK
                <sup>
                    <xref ref-type="bibr" rid="ref-11">11</xref>,
                    <xref ref-type="bibr" rid="ref-12">12</xref>
                </sup>, make available on the internet. This data is normally a de-identified extract from the national surveillance and reporting system. This series has information about when the deaths actually happened. We will call this data set the event-based series 
                <italic toggle="yes">D
                    <sup>T</sup>
                </italic>,</p>
            <p>
                <disp-formula id="e3">
                    <mml:math display="block" id="math3">
                        <mml:mrow>
                            <mml:msup>
                                <mml:mi>D</mml:mi>
                                <mml:mi>T</mml:mi>
                            </mml:msup>
                            <mml:mo>=</mml:mo>
                            <mml:mo>{</mml:mo>
                            <mml:msubsup>
                                <mml:mi>d</mml:mi>
                                <mml:mi>t</mml:mi>
                                <mml:mi>T</mml:mi>
                            </mml:msubsup>
                            <mml:mo>,</mml:mo>
                            <mml:mspace width="0.3em"/>
                            <mml:mi>t</mml:mi>
                            <mml:mo>&#x2208;</mml:mo>
                            <mml:mo stretchy="false">[</mml:mo>
                            <mml:mn>0</mml:mn>
                            <mml:mo>,</mml:mo>
                            <mml:mspace width="0.3em"/>
                            <mml:mi>T</mml:mi>
                            <mml:mo stretchy="false">]</mml:mo>
                            <mml:mo>}</mml:mo>
                        </mml:mrow>
                        <mml:mspace width="28em"/>
                        <mml:mo stretchy="false">(</mml:mo>
                        <mml:mn>3</mml:mn>
                        <mml:mo stretchy="false">)</mml:mo>
                    </mml:math> </disp-formula>
            </p>
            <p>This data set is a time series reported on day 
                <italic toggle="yes">T</italic>, where each number in the series, 
                <inline-formula>
                    <mml:math display="inline" id="M1">
                        <mml:mrow>
                            <mml:msubsup>
                                <mml:mi>d</mml:mi>
                                <mml:mi>t</mml:mi>
                                <mml:mi>T</mml:mi>
                            </mml:msubsup>
                            <mml:mo>,</mml:mo>
                        </mml:mrow>
                    </mml:math>
                </inline-formula> is the cumulative number of deaths on each event date 
                <italic toggle="yes">t</italic> known to the country agency on date 
                <italic toggle="yes">T</italic>. Note that typically a whole new time series is reported every day. As time goes by, more deaths find their way through the reporting process, the number of deaths for every event date in the past is updated.</p>
            <p>The two time-series as reported by Sweden on 2020-05-01 have been plotted in 
                <xref ref-type="fig" rid="f1">Figure 1</xref>. A few general comments about these curves. The 
                <italic toggle="yes">R</italic>-series varies dramatically with a weekly pattern, whereas the 
                <italic toggle="yes">D
                    <sup>T</sup>
                </italic>-series varies less but descends sharply towards zero as time approaches the reporting time 
                <italic toggle="yes">T</italic>. From inspecting the cumulative versions in 
                <xref ref-type="fig" rid="f2">Figure 2</xref> one can see the 
                <italic toggle="yes">R</italic>-series for the most part represents a delayed situation compared to the 
                <italic toggle="yes">D
                    <sup>T</sup>
                </italic>-series, but that they coincide on the reporting day. Notably, these times series do look quite different and in what follows we will discuss how to interpret these time series and where care must be taken in using these data sets as input in modelling or for decision making.</p>
            <fig fig-type="figure" id="f1" orientation="portrait" position="float">
                <label>Figure 1. </label>
                <caption>
                    <title>The daily deaths series.</title>
                    <p>he R-series and the 
                        <italic toggle="yes">D
                            <sup>T</sup>
                        </italic>-series as reported by Sweden on 2020-05-01 and on 2020-07-09. It is enough to plot just one R-series as it does not change retrospectively unlike the 
                        <italic toggle="yes">D
                            <sup>T</sup>
                        </italic>-series, which is updated retrospectively every time it is reported.</p>
                </caption>
                <graphic orientation="portrait" position="float" xlink:href="https://gatesopenresearch-files.f1000.com/manuscripts/14380/e7b416d3-92bf-45da-a893-bc3b14493dd0_figure1.gif"/>
            </fig>
            <fig fig-type="figure" id="f2" orientation="portrait" position="float">
                <label>Figure 2. </label>
                <caption>
                    <title>The cumulative deaths series.</title>
                    <p>The cumulative R-series and the cumulative 
                        <italic toggle="yes">D
                            <sup>T</sup>
                        </italic>-series as reported by Sweden on 2020-05-01 and on 2020-07-09. Clearly, there is a delay in the R-series compared to the D
                        <sup>T</sup>-series. The cumulative number of deaths coincide for both series on the reporting day.</p>
                </caption>
                <graphic orientation="portrait" position="float" xlink:href="https://gatesopenresearch-files.f1000.com/manuscripts/14380/e7b416d3-92bf-45da-a893-bc3b14493dd0_figure2.gif"/>
            </fig>
            <p>In addition to comparing the 
                <italic toggle="yes">R</italic>-series and the 
                <italic toggle="yes">D
                    <sup>T</sup>
                </italic>-series, we will say a few words about nowcasting, and introduce a naive method based on the characteristics of the reporting process.</p>
        </sec>
        <sec>
            <title>Materials and methods</title>
            <p>It turns out that there is a rather elegant mathematical framework for the two series in terms of matrix algebra. For the interested reader we have put all of that in the Appendix (see 
                <italic toggle="yes">Extended data</italic>
                <italic toggle="yes"/>
                <sup>
                    <xref ref-type="bibr" rid="ref-13">13</xref>
                </sup>). One of the key findings is equation (A16) which is the mathematical relationship between the 
                <italic toggle="yes">R</italic>-series and the 
                <italic toggle="yes">D
                    <sup>T</sup>
                </italic>-series which is worth repeating here. The relationship between 
                <italic toggle="yes">r
                    <sub>t</sub>
                </italic> and 
                <inline-formula>
                    <mml:math display="inline" id="M2">
                        <mml:mrow>
                            <mml:msubsup>
                                <mml:mi>d</mml:mi>
                                <mml:mi>e</mml:mi>
                                <mml:mi>T</mml:mi>
                            </mml:msubsup>
                        </mml:mrow>
                    </mml:math>
                </inline-formula> is </p>
            <p>
                <disp-formula id="e4">
                    <mml:math display="block" id="math4">
                        <mml:mrow>
                            <mml:msub>
                                <mml:mi>r</mml:mi>
                                <mml:mi>t</mml:mi>
                            </mml:msub>
                            <mml:mo>=</mml:mo>
                            <mml:mstyle displaystyle="true">
                                <mml:munderover>
                                    <mml:mo>&#x2211;</mml:mo>
                                    <mml:mrow>
                                        <mml:mi>e</mml:mi>
                                        <mml:mo>=</mml:mo>
                                        <mml:mn>0</mml:mn>
                                    </mml:mrow>
                                    <mml:mi>T</mml:mi>
                                </mml:munderover>
                                <mml:mrow>
                                    <mml:msubsup>
                                        <mml:mi>&#x03c1;</mml:mi>
                                        <mml:mi>e</mml:mi>
                                        <mml:mi>T</mml:mi>
                                    </mml:msubsup>
                                </mml:mrow>
                            </mml:mstyle>
                            <mml:mo stretchy="false">(</mml:mo>
                            <mml:mi>t</mml:mi>
                            <mml:mo>&#x2212;</mml:mo>
                            <mml:mi>e</mml:mi>
                            <mml:mo stretchy="false">)</mml:mo>
                            <mml:mspace width="0.3em"/>
                            <mml:msubsup>
                                <mml:mi>d</mml:mi>
                                <mml:mi>e</mml:mi>
                                <mml:mi>T</mml:mi>
                            </mml:msubsup>
                        </mml:mrow>
                        <mml:mspace width="28em"/>
                        <mml:mo stretchy="false">(</mml:mo>
                        <mml:mn>4</mml:mn>
                        <mml:mo stretchy="false">)</mml:mo>
                    </mml:math> </disp-formula>
            </p>
            <p>where 
                <inline-formula>
                    <mml:math display="inline" id="M3">
                        <mml:mrow>
                            <mml:msubsup>
                                <mml:mi>&#x03c1;</mml:mi>
                                <mml:mi>e</mml:mi>
                                <mml:mi>T</mml:mi>
                            </mml:msubsup>
                            <mml:mo stretchy="false">(</mml:mo>
                            <mml:mtext>&#x0394;</mml:mtext>
                            <mml:mo stretchy="false">)</mml:mo>
                        </mml:mrow>
                    </mml:math>
                </inline-formula> is the so-called forward-looking probability mass function (pmf) defined as the fraction of the deaths that happened on event date 
                <italic toggle="yes">e</italic> and were reported 
                <italic toggle="yes">exactly</italic> &#x0394; days later. If there were underlying stochastic variables for when deaths happened and were reported, 
                <inline-formula>
                    <mml:math display="inline" id="M4">
                        <mml:mrow>
                            <mml:msubsup>
                                <mml:mi>&#x03c1;</mml:mi>
                                <mml:mi>e</mml:mi>
                                <mml:mi>T</mml:mi>
                            </mml:msubsup>
                            <mml:mo stretchy="false">(</mml:mo>
                            <mml:mtext>&#x0394;</mml:mtext>
                            <mml:mo stretchy="false">)</mml:mo>
                        </mml:mrow>
                    </mml:math>
                </inline-formula>  would have the interpretation as an estimate of the conditional probability 
                <italic toggle="yes">P</italic>(reporting date = 
                <italic toggle="yes">e</italic>+&#x0394;  |  event date = 
                <italic toggle="yes">e</italic>). There is also a corresponding forward-looking cumulative distribution function (cdf) 
                <inline-formula>
                    <mml:math display="inline" id="M5">
                        <mml:mrow>
                            <mml:msubsup>
                                <mml:mi>&#x03d5;</mml:mi>
                                <mml:mi>e</mml:mi>
                                <mml:mi>T</mml:mi>
                            </mml:msubsup>
                            <mml:mo stretchy="false">(</mml:mo>
                            <mml:mtext>&#x0394;</mml:mtext>
                            <mml:mo stretchy="false">)</mml:mo>
                        </mml:mrow>
                    </mml:math>
                </inline-formula> defined as the the fraction of the deaths that happened on event date 
                <italic toggle="yes">e</italic> and were reported 
                <italic toggle="yes">within</italic> &#x0394; days.</p>
            <p>The expression (4) makes intuitive sense. It says that the number of deaths reported on day 
                <italic toggle="yes">t</italic>, is the weighted sum of deaths happening on previous event days. The contribution from each event day 
                <italic toggle="yes">e</italic> is the total number of deaths that day 
                <inline-formula>
                    <mml:math display="inline" id="M6">
                        <mml:mrow>
                            <mml:msubsup>
                                <mml:mi>d</mml:mi>
                                <mml:mi>e</mml:mi>
                                <mml:mi>T</mml:mi>
                            </mml:msubsup>
                        </mml:mrow>
                    </mml:math>
                </inline-formula> times the probability that those deaths will be reported exactly 
                <italic toggle="yes">t</italic>&#x2013;
                <italic toggle="yes">e</italic> days later.</p>
            <p>Now, typically the reporting process does not change that much over time, in which case one may want to find the average cumulative distribution function and the average probability mass function. See expression (A15) in the Appendix (see 
                <italic toggle="yes">Extended data</italic>
                <sup>
                    <xref ref-type="bibr" rid="ref-13">13</xref>
                </sup>) for how this is done. Also see the equation (A20) in the Appendix (see 
                <italic toggle="yes">Extended data</italic>
                <sup>
                    <xref ref-type="bibr" rid="ref-13">13</xref>
                </sup>) for the inverse of (4), i.e. 
                <inline-formula>
                    <mml:math display="inline" id="M">
                        <mml:mrow>
                            <mml:msubsup>
                                <mml:mi>d</mml:mi>
                                <mml:mi>e</mml:mi>
                                <mml:mi>T</mml:mi>
                            </mml:msubsup>
                        </mml:mrow>
                    </mml:math>
                </inline-formula> as a function of  
                <italic toggle="yes">r
                    <sub>t</sub>
                </italic>.</p>
            <p>
                <xref ref-type="other" rid="e1">Equation (4)</xref> is interesting from another perspective. It is the defining expression of a linear time-dependent filter, where in-signal 
                <italic toggle="yes">x</italic>(
                <italic toggle="yes">t</italic>) is modulated by a time dependent filter function 
                <italic toggle="yes">f</italic> (&#x03c4;, 
                <italic toggle="yes">t</italic>, &#x2013;&#x03c4;) resulting in an out-signal 
                <italic toggle="yes">y</italic>(
                <italic toggle="yes">t</italic>)</p>
            <p>
                <disp-formula id="e5">
                    <mml:math display="block" id="math5">
                        <mml:mrow>
                            <mml:mi>y</mml:mi>
                            <mml:mo stretchy="false">(</mml:mo>
                            <mml:mi>t</mml:mi>
                            <mml:mo stretchy="false">)</mml:mo>
                            <mml:mo>=</mml:mo>
                            <mml:mstyle displaystyle="true">
                                <mml:mrow>
                                    <mml:munderover>
                                        <mml:mo>&#x222b;</mml:mo>
                                        <mml:mn>0</mml:mn>
                                        <mml:mi>&#x221e;</mml:mi>
                                    </mml:munderover>
                                    <mml:mrow>
                                        <mml:mi>f</mml:mi>
                                        <mml:mo stretchy="false">(</mml:mo>
                                        <mml:mi>&#x03c4;</mml:mi>
                                        <mml:mo>,</mml:mo>
                                        <mml:mi>t</mml:mi>
                                        <mml:mo>&#x2212;</mml:mo>
                                        <mml:mi>&#x03c4;</mml:mi>
                                        <mml:mo stretchy="false">)</mml:mo>
                                        <mml:mspace width="0.3em"/>
                                        <mml:mi>x</mml:mi>
                                        <mml:mo stretchy="false">(</mml:mo>
                                        <mml:mi>&#x03c4;</mml:mi>
                                        <mml:mo stretchy="false">)</mml:mo>
                                        <mml:mspace width="0.3em"/>
                                        <mml:mi>d</mml:mi>
                                        <mml:mi>&#x03c4;</mml:mi>
                                    </mml:mrow>
                                </mml:mrow>
                            </mml:mstyle>
                        </mml:mrow>
                        <mml:mspace width="26.5em"/>
                        <mml:mo stretchy="false">(</mml:mo>
                        <mml:mn>5</mml:mn>
                        <mml:mo stretchy="false">)</mml:mo>
                    </mml:math> </disp-formula>
            </p>
            <p>In mathematical terms, the expression (5) is a so-called convolution of the true signal and the time-dependent filter. Filters are used in many areas including epidemiology, although its main application is in signal processing. Interestingly, studies of time-dependent filters relevant to this paper can be found in geology, e.g. in
                <sup>
                    <xref ref-type="bibr" rid="ref-14">14</xref>,
                    <xref ref-type="bibr" rid="ref-15">15</xref>
                </sup>. The main insight here is that the practice of reporting daily deaths, the 
                <italic toggle="yes">R</italic>-series, as is standard procedure, should be viewed as a time-dependent filter modulating the event-based curve. The reporting series 
                <italic toggle="yes">R</italic> is distinct from the true death curve, even if it in most practical applications comes very close. Below we will highlight some circumstances where this difference can be important.</p>
            <sec>
                <title>Nowcasting of the D
                    <sup>T</sup>-series</title>
                <p>As can be seen in 
                    <xref ref-type="fig" rid="f1">Figure 1</xref>, the 
                    <italic toggle="yes">D
                        <sup>T</sup>
                    </italic>-series typically descends to a value close to zero on the reporting day 
                    <italic toggle="yes">T</italic>. This is because few deaths will be reported on the same day they actually happened. Many deaths have happened but are still being processed in the reporting system and will be reported at a later date. This means that for a time period prior to the reporting time, the 
                    <italic toggle="yes">D
                        <sup>T</sup>
                    </italic>-series will underestimate the actual number of deaths on those days. Ideally, one would like to know the final number of deaths on any particular event date 
                    <italic toggle="yes">e</italic> before the reporting date 
                    <italic toggle="yes">T</italic>, i.e. finding the 
                    <italic toggle="yes">M</italic> series for 
                    <italic toggle="yes">t</italic> &lt; 
                    <italic toggle="yes">T</italic>. This is the problem of nowcasting. Several groups have devised algorithms to estimate the 
                    <italic toggle="yes">M</italic>-series for 
                    <italic toggle="yes">t</italic> &lt; 
                    <italic toggle="yes">T</italic>, see
                    <sup>
                        <xref ref-type="bibr" rid="ref-16">16</xref>&#x2013;
                        <xref ref-type="bibr" rid="ref-22">22</xref>
                    </sup>. In the Appendix (see 
                    <italic toggle="yes">Extended data</italic>
                    <sup>
                        <xref ref-type="bibr" rid="ref-13">13</xref>
                    </sup>) we derive a very simple expression for the 
                    <italic toggle="yes">M</italic>-series, i.e. for 
                    <italic toggle="yes">m
                        <sub>e</sub>
                    </italic> = 
                    <inline-formula>
                        <mml:math display="inline" id="M7">
                            <mml:mrow>
                                <mml:msubsup>
                                    <mml:mi>d</mml:mi>
                                    <mml:mi>e</mml:mi>
                                    <mml:mi>&#x221e;</mml:mi>
                                </mml:msubsup>
                            </mml:mrow>
                        </mml:math>
                    </inline-formula>, which is</p>
                <p>
                    <disp-formula id="e6">
                        <mml:math display="block" id="math6">
                            <mml:mrow>
                                <mml:msub>
                                    <mml:mi>m</mml:mi>
                                    <mml:mi>e</mml:mi>
                                </mml:msub>
                                <mml:mo>=</mml:mo>
                                <mml:msubsup>
                                    <mml:mi>d</mml:mi>
                                    <mml:mi>e</mml:mi>
                                    <mml:mi>&#x221e;</mml:mi>
                                </mml:msubsup>
                                <mml:mo>=</mml:mo>
                                <mml:mrow>
                                    <mml:mrow>
                                        <mml:msubsup>
                                            <mml:mi>d</mml:mi>
                                            <mml:mi>e</mml:mi>
                                            <mml:mi>T</mml:mi>
                                        </mml:msubsup>
                                    </mml:mrow>
                                    <mml:mo>/</mml:mo>
                                    <mml:mrow>
                                        <mml:msubsup>
                                            <mml:mi>&#x03d5;</mml:mi>
                                            <mml:mi>e</mml:mi>
                                            <mml:mi>&#x221e;</mml:mi>
                                        </mml:msubsup>
                                    </mml:mrow>
                                </mml:mrow>
                                <mml:mo stretchy="false">(</mml:mo>
                                <mml:mi>T</mml:mi>
                                <mml:mo>&#x2212;</mml:mo>
                                <mml:mi>e</mml:mi>
                                <mml:mo stretchy="false">)</mml:mo>
                            </mml:mrow>
                            <mml:mspace width="26.5em"/>
                            <mml:mo stretchy="false">(</mml:mo>
                            <mml:mn>6</mml:mn>
                            <mml:mo stretchy="false">)</mml:mo>
                        </mml:math> </disp-formula>
                </p>
                <p>Intuitively this expression makes sense. It says that one could get the final number of deaths on an event day 
                    <italic toggle="yes">e</italic> by dividing the number of deaths on that day that have been reported so far (i.e. 
                    <inline-formula>
                        <mml:math display="inline" id="M8">
                            <mml:mrow>
                                <mml:msubsup>
                                    <mml:mi>d</mml:mi>
                                    <mml:mi>e</mml:mi>
                                    <mml:mi>T</mml:mi>
                                </mml:msubsup>
                            </mml:mrow>
                        </mml:math>
                    </inline-formula>) by 
                    <inline-formula>
                        <mml:math display="inline" id="M9">
                            <mml:mrow>
                                <mml:msubsup>
                                    <mml:mi>&#x03d5;</mml:mi>
                                    <mml:mi>e</mml:mi>
                                    <mml:mi>&#x221e;</mml:mi>
                                </mml:msubsup>
                                <mml:mo stretchy="false">(</mml:mo>
                                <mml:mi>T</mml:mi>
                                <mml:mo>&#x2212;</mml:mo>
                                <mml:mi>e</mml:mi>
                                <mml:mo stretchy="false">)</mml:mo>
                            </mml:mrow>
                        </mml:math>
                    </inline-formula>, which the fraction of the final number of deaths on day 
                    <italic toggle="yes">e</italic> reported within 
                    <italic toggle="yes">T</italic>&#x2013;
                    <italic toggle="yes">e</italic> days since day 
                    <italic toggle="yes">e</italic>. Of course, at time 
                    <italic toggle="yes">T</italic> we don&#x2019;t know what the final cdf 
                    <inline-formula>
                        <mml:math display="inline" id="M10">
                            <mml:mrow>
                                <mml:msubsup>
                                    <mml:mi>&#x03d5;</mml:mi>
                                    <mml:mi>e</mml:mi>
                                    <mml:mi>&#x221e;</mml:mi>
                                </mml:msubsup>
                            </mml:mrow>
                        </mml:math>
                    </inline-formula> will look like but by making the assumption that the reporting process will behave as it has done in the recent past, we will be able to compute simple nowcasts below. The drawback of this method is that it does not work when 
                    <inline-formula>
                        <mml:math display="inline" id="M11">
                            <mml:mrow>
                                <mml:msubsup>
                                    <mml:mi>d</mml:mi>
                                    <mml:mi>e</mml:mi>
                                    <mml:mi>T</mml:mi>
                                </mml:msubsup>
                                <mml:mo>=</mml:mo>
                                <mml:mn>0</mml:mn>
                            </mml:mrow>
                        </mml:math>
                    </inline-formula>, but as we will see it can still be quite useful.</p>
            </sec>
            <sec>
                <title>Source data and data processing</title>
                <p>The computations in this paper have been based on the fundamental data set 
                    <inline-formula>
                        <mml:math display="inline" id="M12">
                            <mml:mrow>
                                <mml:msubsup>
                                    <mml:mi>d</mml:mi>
                                    <mml:mrow>
                                        <mml:mi>e</mml:mi>
                                        <mml:mi>t</mml:mi>
                                    </mml:mrow>
                                    <mml:mi>T</mml:mi>
                                </mml:msubsup>
                                <mml:mo>,</mml:mo>
                            </mml:mrow>
                        </mml:math>
                    </inline-formula> which was constructed from downloading the 
                    <italic toggle="yes">D
                        <sup>T</sup>
                    </italic>-series daily from 2020-04-02 to 2020-07-09. Data processing was done in R-studio (v1.2.1335) using R version 3.6.0 (2019-04-26). An R-script including relevant R-packages has been made available. Further information can be found in the 
                    <italic toggle="yes">Data availability</italic> section
                    <sup>
                        <xref ref-type="bibr" rid="ref-13">13</xref>
                    </sup>
                    <xref ref-type="bibr" rid="ref-23">23</xref>.</p>
            </sec>
        </sec>
        <sec>
            <title>Results</title>
            <p>We will now apply the methods described above and, in the Appendix (see 
                <italic toggle="yes">Extended data</italic>
                <sup>
                    <xref ref-type="bibr" rid="ref-13">13</xref>
                </sup>), to analyze the relationship between the reporting series and the event-based series as reported by Sweden.</p>
            <sec>
                <title>The Swedish reporting process for Covid-19 deaths</title>
                <p>As we have argued, the reporting process is characterized by the two time-dependent distribution functions 
                    <inline-formula>
                        <mml:math display="inline" id="M13">
                            <mml:mrow>
                                <mml:msubsup>
                                    <mml:mi>&#x03d5;</mml:mi>
                                    <mml:mi>e</mml:mi>
                                    <mml:mi>T</mml:mi>
                                </mml:msubsup>
                                <mml:mo stretchy="false">(</mml:mo>
                                <mml:mtext>&#x0394;</mml:mtext>
                                <mml:mo stretchy="false">)</mml:mo>
                            </mml:mrow>
                        </mml:math>
                    </inline-formula> and 
                    <inline-formula>
                        <mml:math display="inline" id="M14">
                            <mml:mrow>
                                <mml:msubsup>
                                    <mml:mi>&#x03c1;</mml:mi>
                                    <mml:mi>e</mml:mi>
                                    <mml:mi>T</mml:mi>
                                </mml:msubsup>
                                <mml:mo stretchy="false">(</mml:mo>
                                <mml:mtext>&#x0394;</mml:mtext>
                                <mml:mo stretchy="false">)</mml:mo>
                            </mml:mrow>
                        </mml:math>
                    </inline-formula>. In 
                    <xref ref-type="fig" rid="f3">Figure 3</xref> we have plotted the cumulative distribution functions for all event dates between 2020-04-02 and 2020-06-01, as well as the resulting mean cdf.</p>
                <fig fig-type="figure" id="f3" orientation="portrait" position="float">
                    <label>Figure 3. </label>
                    <caption>
                        <title>The cdfs for the Swedish reporting process of deaths.</title>
                        <p>Plot of all cdfs 
                            <inline-formula>
                                <mml:math display="inline" id="M15">
                                    <mml:mrow>
                                        <mml:msubsup>
                                            <mml:mi>&#x03d5;</mml:mi>
                                            <mml:mi>e</mml:mi>
                                            <mml:mi>T</mml:mi>
                                        </mml:msubsup>
                                        <mml:mo stretchy="false">(</mml:mo>
                                        <mml:mtext>&#x0394;</mml:mtext>
                                        <mml:mo stretchy="false">)</mml:mo>
                                    </mml:mrow>
                                </mml:math>
                            </inline-formula> for event dates between 2020-04-02 and 2020-06-01. The bars represent the mean cdf which is time independent. The cdf 
                            <inline-formula>
                                <mml:math display="inline" id="M16">
                                    <mml:mrow>
                                        <mml:msubsup>
                                            <mml:mi>&#x03d5;</mml:mi>
                                            <mml:mi>e</mml:mi>
                                            <mml:mi>T</mml:mi>
                                        </mml:msubsup>
                                        <mml:mo stretchy="false">(</mml:mo>
                                        <mml:mtext>&#x0394;</mml:mtext>
                                        <mml:mo stretchy="false">)</mml:mo>
                                    </mml:mrow>
                                </mml:math>
                            </inline-formula> is defined as the fraction of deaths on a particular event date e that will be reported within &#x0394; days.</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://gatesopenresearch-files.f1000.com/manuscripts/14380/e7b416d3-92bf-45da-a893-bc3b14493dd0_figure3.gif"/>
                </fig>
                <p>In 
                    <xref ref-type="fig" rid="f4">Figure 4</xref> we show the corresponding mean pmf 
                    <inline-formula>
                        <mml:math display="inline" id="M17">
                            <mml:mrow>
                                <mml:msubsup>
                                    <mml:mi>&#x03c1;</mml:mi>
                                    <mml:mi>e</mml:mi>
                                    <mml:mi>T</mml:mi>
                                </mml:msubsup>
                                <mml:mo stretchy="false">(</mml:mo>
                                <mml:mtext>&#x0394;</mml:mtext>
                                <mml:mo stretchy="false">)</mml:mo>
                            </mml:mrow>
                        </mml:math>
                    </inline-formula>. The first observation is that there is a significant delay in the reporting process. The average delay can be computed using equation (A14) and is 5.2 days. By inspection of the cdf, we see that it takes on average 7 days to capture 75% of the deaths on a particular day, and about 10 days to capture 90% of the deaths.</p>
                <fig fig-type="figure" id="f4" orientation="portrait" position="float">
                    <label>Figure 4. </label>
                    <caption>
                        <title>The mean pmf for the Swedish reporting process of deaths.</title>
                        <p>Plot of the mean probability mass function corresponding to the mean cdf across all event days between 2020-04-02 and 2020-06-01.</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://gatesopenresearch-files.f1000.com/manuscripts/14380/e7b416d3-92bf-45da-a893-bc3b14493dd0_figure4.gif"/>
                </fig>
                <p>Furthermore, from inspection of the pmf in 
                    <xref ref-type="fig" rid="f4">Figure 4</xref>, there seems to be a substructure to the reporting process. One can identify three sub-processes with different delays. Allow us to speculate that we have one reporting process for deaths in hospitals with an average delay of 1&#x2013;2 days, one process for deaths in nursing homes with an average delay of 6&#x2013;7 days and a third process that for some reason takes even longer, with an average delay of 11&#x2013;12 days. One can imagine the pmf resulting from superposition of three distributions where the respective areas under the curve correspond to the fraction of deaths in those three settings respectively. Again, by inspection of the pmf, we estimate that roughly 3/8, 1/2 and 1/8 of the deaths happened in those three settings respectively. This is roughly in line with what has been reported by Sweden
                    <sup>
                        <xref ref-type="bibr" rid="ref-24">24</xref>
                    </sup>.</p>
                <p>In order to better understand the weekly periodic pattern in the 
                    <italic toggle="yes">R</italic>-series, we have computed the average distributions for each weekday. The average cumulative distribution functions for each weekday is plotted in 
                    <xref ref-type="fig" rid="f5">Figure 5</xref>. There is a flat portion of each curve representing the fact that deaths were essentially not reported on Sundays. For Saturdays the flat portion is between day 0 and day 1; for Fridays between day 1 and 2 etc. Sweden stopped reporting both the 
                    <italic toggle="yes">R</italic> and the 
                    <italic toggle="yes">D
                        <sup>T</sup>
                    </italic>-series on weekends from 2020-06-20 onwards.</p>
                <fig fig-type="figure" id="f5" orientation="portrait" position="float">
                    <label>Figure 5. </label>
                    <caption>
                        <title>The average cdfs by weekday for the Swedish reporting process of deaths.</title>
                        <p>Plot of the average cdfs by weekday, for all event days between 2020-04-02 and 2020-06-01.</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://gatesopenresearch-files.f1000.com/manuscripts/14380/e7b416d3-92bf-45da-a893-bc3b14493dd0_figure5.gif"/>
                </fig>
                <p>We also plot the pmfs in 
                    <xref ref-type="fig" rid="f6">Figure 6</xref>. Given the simplicity of the mean distribution, the differences by weekday are a little surprising. It is a complex interplay between the three sub-processes mentioned previously and a significantly reduced reporting on Saturdays and Sundays resulting in the "valley" on day 6 on Mondays moving one day to the left every day.</p>
                <fig fig-type="figure" id="f6" orientation="portrait" position="float">
                    <label>Figure 6. </label>
                    <caption>
                        <title>The average pmfs by weekday for the Swedish reporting process of deaths.</title>
                        <p>Plot of the average pmfs by weekday, for all event days between 2020-04-02 and 2020-06-01.</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://gatesopenresearch-files.f1000.com/manuscripts/14380/e7b416d3-92bf-45da-a893-bc3b14493dd0_figure6.gif"/>
                </fig>
                <p>Next, in order to isolate the effect of the reporting process, we compute the effect of the Swedish reporting process on a hypothetical smooth bell shaped death curve, much like we have done for two simple examples in Figure A11 and Figure A12 in the Appendix (see 
                    <italic toggle="yes">Extended data</italic>
                    <sup>
                        <xref ref-type="bibr" rid="ref-13">13</xref>
                    </sup>). The result can be seen in 
                    <xref ref-type="fig" rid="f7">Figure 7</xref>.</p>
                <fig fig-type="figure" id="f7" orientation="portrait" position="float">
                    <label>Figure 7. </label>
                    <caption>
                        <title>Application of the Swedish reporting process to a hypothetical death curve.</title>
                        <p>Plot of the Swedish reporting process applied to a hypothetical smooth bell-shaped actual deaths curve. The 
                            <italic toggle="yes">R</italic>-series resulting from application of the full time-dependent set of pmfs shows a response very similar to reported 
                            <italic toggle="yes">R</italic>-series. The 
                            <italic toggle="yes">R</italic>-series resulting from the application of the time-independent mean pmf produces a modified bell-shaped 
                            <italic toggle="yes">R</italic>-series which is flatter and wider with a slightly different slope.</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://gatesopenresearch-files.f1000.com/manuscripts/14380/e7b416d3-92bf-45da-a893-bc3b14493dd0_figure7.gif"/>
                </fig>
                <p>Again, the daily, time-dependent pmfs produce a highly variable output just like the observed reporting series. Contrasting this output to the output one gets using the mean time-independent pmf, we conclude that the periodic pattern in the reporting series has its origin in characteristics of the reporting process rather than in the characteristics of the actual deaths curve. Although this is perhaps not very surprising, the amplitude of the periodic pattern is surprising. Relatively small differences in the reporting processes from one weekday to the next results in these wild swings in the reported daily deaths. Is there an opportunity to give guidance to the countries for how to reduce these resulting swings?  We also note that the Swedish reporting process modifies the shape of the hypothetical death curve in three ways, just like the hypothetical cases in the Appendix (see 
                    <italic toggle="yes">Extended data</italic>
                    <sup>
                        <xref ref-type="bibr" rid="ref-13">13</xref>
                    </sup>). First, there is a clear time shift. Second, the peak is lower. Third, the slope is flatter, both as deaths increase and decrease.</p>
            </sec>
            <sec>
                <title>Smoothing of the Swedish 
                    <italic toggle="yes">R</italic>-series</title>
                <p>As we have seen, the Swedish 
                    <italic toggle="yes">R</italic>-series is very variable. Hence, when using the 
                    <italic toggle="yes">R</italic>-series as model input one may have to do some pre-processing. If used in a time dependent model together with case incidence or prevalence (e.g. in an SIR model) one may have to adjust for the reporting delay, since the reporting of cases normally is quicker than the reporting of deaths. The average reporting delay for cases in Sweden is approximately 1&#x2013;2 days.</p>
                <p>Additionally, if initial conditions need to be specified for the magnitude as well as the slope of the death curve, some kind of smoothing of the 
                    <italic toggle="yes">R</italic>-series is appropriate. The question then arises, what is a good smoothing of the 
                    <italic toggle="yes">R</italic>-series?  Since the 
                    <italic toggle="yes">R</italic>-series is not the same as the 
                    <italic toggle="yes">D
                        <sup>T</sup>
                    </italic>-series, what should a good smoothing of the 
                    <italic toggle="yes">R</italic>-series look like? The example in the previous section gives us a clue. Based on the graphs plotted in 
                    <xref ref-type="fig" rid="f7">Figure 7</xref>, a measure of how good a smoothing is to see how closely it matches the "transformed" 
                    <italic toggle="yes">D
                        <sup>T</sup>
                    </italic>-series, i.e. the series one would obtain by applying a "smoothed reporting process" to the event based series, i.e. using the time-independent mean pmf rather than the time-dependent pmfs. In addition, a good smoothing should have the same area under the curve as the 
                    <italic toggle="yes">R</italic>-series. Unfortunately, neither the 
                    <italic toggle="yes">D
                        <sup>T</sup>
                    </italic>-series, nor the mean pmf are available to the modelers.</p>
                <p>One model that relies on the shape of the death curve is the model developed by the Institute of Health Metrics and Evaluations (IHME) at University of Washington
                    <sup>
                        <xref ref-type="bibr" rid="ref-25">25</xref>
                    </sup>. In 
                    <xref ref-type="fig" rid="f8">Figure 8</xref> we have plotted the Swedish 
                    <italic toggle="yes">R</italic>-series as reported on 2020-06-03 well as the smoothing of the 
                    <italic toggle="yes">R</italic>-series by IHME modelers in the 2020-06-05 update of their model
                    <sup>
                        <xref ref-type="bibr" rid="ref-26">26</xref>
                    </sup>. We have also plotted the transformed 
                    <italic toggle="yes">D
                        <sup>T</sup>
                    </italic>-series one obtains by filtering the 
                    <italic toggle="yes">D
                        <sup>T</sup>
                    </italic> series using the mean pmf.</p>
                <fig fig-type="figure" id="f8" orientation="portrait" position="float">
                    <label>Figure 8. </label>
                    <caption>
                        <title>Smoothing of the Swedish R-series.</title>
                        <p>Plot showing the IHME smoothing of the R-series on 2020-06-03 in comparison with the Transformed 
                            <italic toggle="yes">D
                                <sup>T</sup>
                            </italic>-series using the mean pmf. Note there is an uptick in the R-series just before the reporting date, which may have influenced the smoothing. Also note that the Transformed 
                            <italic toggle="yes">D
                                <sup>T</sup>
                            </italic>-series has a slightly lower peak, a significant delay and a lesser slope than the 
                            <italic toggle="yes">D
                                <sup>T</sup>
                            </italic>-series.</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://gatesopenresearch-files.f1000.com/manuscripts/14380/e7b416d3-92bf-45da-a893-bc3b14493dd0_figure8.gif"/>
                </fig>
                <p>At this particular instance the IHME modelers were unfortunate to produce a smoothing that drastically changed the outcome of the model and we can see that it deviates significantly from the transformed 
                    <italic toggle="yes">D
                        <sup>T</sup>
                    </italic>-series. This IHME smoothing preserved the number of deaths. For reference, the 2020-06-05 update of the model estimated 8357 (7046, 10386) number of deaths by 2020-08-01. A previous update on 2020-05-29, with a different smoothing, estimated 5254 (4688, 6420) number of deaths. In their next update of the model on 2020-06-25, the smoothing is very similar to the transformed 
                    <italic toggle="yes">D
                        <sup>T</sup>
                    </italic>-series.</p>
                <p>Please note that by only using the smoothed 
                    <italic toggle="yes">R</italic>-series as input, one does not compensate for the delay in the reporting process, nor for the change in shape of the curve. This means for the total number of deaths, as well as the rise (or decline) in number of deaths is underestimated.</p>
            </sec>
            <sec>
                <title>Nowcasting of the Swedish 
                    <italic toggle="yes">D
                        <sup>T</sup>
                    </italic>-series</title>
                <p>Turning our attention to the 
                    <italic toggle="yes">D
                        <sup>T</sup>
                    </italic>-series, we were curious to see how a state of the art nowcasting algorithm would perform in the presence of the strong weekly patterns in the Swedish time-dependent reporting process. We have therefore compared a "naive" approximation of the nowcasting formula (6) with the nowcasting algorithm developed by a group at Harvard
                    <sup>
                        <xref ref-type="bibr" rid="ref-17">17</xref>
                    </sup>. We wanted to use both a time independent and a time-dependent approximation and used the following two naive approximations</p>
                <p>
                    <disp-formula id="e7">
                        <mml:math display="block" id="math7">
                            <mml:mrow>
                                <mml:msubsup>
                                    <mml:mi>&#x03d5;</mml:mi>
                                    <mml:mi>e</mml:mi>
                                    <mml:mi>&#x221e;</mml:mi>
                                </mml:msubsup>
                                <mml:mo stretchy="false">(</mml:mo>
                                <mml:mtext>&#x0394;</mml:mtext>
                                <mml:mo stretchy="false">)</mml:mo>
                                <mml:mspace width="0.4em"/>
                                <mml:mo>&#x2248;</mml:mo>
                                <mml:mspace width="0.3em"/>
                                <mml:msubsup>
                                    <mml:mi>&#x03d5;</mml:mi>
                                    <mml:mrow>
                                        <mml:mn>21</mml:mn>
                                        <mml:mo>&#x2212;</mml:mo>
                                        <mml:mn>34</mml:mn>
                                    </mml:mrow>
                                    <mml:mi>&#x221e;</mml:mi>
                                </mml:msubsup>
                                <mml:mo stretchy="false">(</mml:mo>
                                <mml:mtext>&#x0394;</mml:mtext>
                                <mml:mo stretchy="false">)</mml:mo>
                            </mml:mrow>
                            <mml:mspace width="18em"/>
                            <mml:mo stretchy="false">(</mml:mo>
                            <mml:mo>Mean</mml:mo>
                            <mml:mspace width="0.5em"/>
                            <mml:mo>nowcast</mml:mo>
                            <mml:mo stretchy="false">)</mml:mo>
                            <mml:mspace width="2em"/>
                            <mml:mo stretchy="false">(</mml:mo>
                            <mml:mn>7</mml:mn>
                            <mml:mo stretchy="false">)</mml:mo>
                        </mml:math> </disp-formula>
                </p>
                <p>
                    <disp-formula id="e8">
                        <mml:math display="block" id="math8">
                            <mml:mrow>
                                <mml:msubsup>
                                    <mml:mi>&#x03d5;</mml:mi>
                                    <mml:mi>e</mml:mi>
                                    <mml:mi>&#x221e;</mml:mi>
                                </mml:msubsup>
                                <mml:mo stretchy="false">(</mml:mo>
                                <mml:mtext>&#x0394;</mml:mtext>
                                <mml:mo stretchy="false">)</mml:mo>
                                <mml:mspace width="0.4em"/>
                                <mml:mo>&#x2248;</mml:mo>
                                <mml:mspace width="0.3em"/>
                                <mml:mfrac>
                                    <mml:mn>1</mml:mn>
                                    <mml:mn>2</mml:mn>
                                </mml:mfrac>
                                <mml:mo stretchy="false">[</mml:mo>
                                <mml:msubsup>
                                    <mml:mi>&#x03d5;</mml:mi>
                                    <mml:mrow>
                                        <mml:mi>e</mml:mi>
                                        <mml:mo>&#x2212;</mml:mo>
                                        <mml:mn>21</mml:mn>
                                    </mml:mrow>
                                    <mml:mi>&#x221e;</mml:mi>
                                </mml:msubsup>
                                <mml:mo stretchy="false">(</mml:mo>
                                <mml:mtext>&#x0394;</mml:mtext>
                                <mml:mo stretchy="false">)</mml:mo>
                                <mml:mo>+</mml:mo>
                                <mml:msubsup>
                                    <mml:mi>&#x03d5;</mml:mi>
                                    <mml:mrow>
                                        <mml:mi>e</mml:mi>
                                        <mml:mo>&#x2212;</mml:mo>
                                        <mml:mn>28</mml:mn>
                                    </mml:mrow>
                                    <mml:mi>&#x221e;</mml:mi>
                                </mml:msubsup>
                                <mml:mo stretchy="false">(</mml:mo>
                                <mml:mtext>&#x0394;</mml:mtext>
                                <mml:mo stretchy="false">)</mml:mo>
                                <mml:mo stretchy="false">]</mml:mo>
                                <mml:mspace width="10em"/>
                                <mml:mo stretchy="false">(</mml:mo>
                                <mml:mo>Weekday</mml:mo>
                                <mml:mspace width="0.5em"/>
                                <mml:mo>forecast</mml:mo>
                                <mml:mo stretchy="false">)</mml:mo>
                                <mml:mspace width="2em"/>
                                <mml:mo stretchy="false">(</mml:mo>
                                <mml:mn>8</mml:mn>
                                <mml:mo stretchy="false">)</mml:mo>
                            </mml:mrow>
                        </mml:math> </disp-formula>
                </p>
                <p>where 
                    <inline-formula>
                        <mml:math display="inline" id="M18">
                            <mml:mrow>
                                <mml:msubsup>
                                    <mml:mi>&#x03d5;</mml:mi>
                                    <mml:mrow>
                                        <mml:mn>21</mml:mn>
                                        <mml:mo>&#x2212;</mml:mo>
                                        <mml:mn>34</mml:mn>
                                    </mml:mrow>
                                    <mml:mi>T</mml:mi>
                                </mml:msubsup>
                                <mml:mo stretchy="false">(</mml:mo>
                                <mml:mtext>&#x0394;</mml:mtext>
                                <mml:mo stretchy="false">)</mml:mo>
                            </mml:mrow>
                        </mml:math>
                    </inline-formula> is the mean cdf computed as an average of the 14 cdfs for the event days 21&#x2013;34 days before the reporting time 
                    <italic toggle="yes">T</italic>. Note that we have to go 21 days back in order to have the cdf defined for &#x0394; between 0 and 21 days. We deliberately chose a multiple of 7 to match with the correct weekday. Starting on Sunday 2020-05-10, we have plotted 7 nowcasts of the 
                    <italic toggle="yes">D
                        <sup>T</sup>
                    </italic>-series. The first graphs are shown in 
                    <xref ref-type="fig" rid="f9">Figure 9</xref> and the following six in 
                    <xref ref-type="fig" rid="f10">Figure 10</xref>.</p>
                <fig fig-type="figure" id="f9" orientation="portrait" position="float">
                    <label>Figure 9. </label>
                    <caption>
                        <title>Nowcasts of the Swedish 
                            <italic toggle="yes">D
                                <sup>T</sup>
                            </italic>-series, initial day.  </title>
                        <p>Graph showing the 
                            <italic toggle="yes">D
                                <sup>T</sup>
                            </italic>-series, the Mean nowcast, the Weekday nowcast and the Harvard nowcast of the 
                            <italic toggle="yes">D
                                <sup>T</sup>
                            </italic>-series on the reporting day 2020-05-10. The 
                            <italic toggle="yes">D
                                <sup>T</sup>
                            </italic>-series as of 2020-07-09 serves as the true M-series. On this reporting day, the three nowcasts are quite different for the two days prior to the reporting date.</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://gatesopenresearch-files.f1000.com/manuscripts/14380/e7b416d3-92bf-45da-a893-bc3b14493dd0_figure9.gif"/>
                </fig>
                <fig fig-type="figure" id="f10" orientation="portrait" position="float">
                    <label>Figure 10. </label>
                    <caption>
                        <title>Nowcasts of the Swedish 
                            <italic toggle="yes">D
                                <sup>T</sup>
                            </italic>-series, subsequent days. </title>
                        <p>Graph showing the zoomed-in 
                            <italic toggle="yes">D
                                <sup>T</sup>
                            </italic>-series, the Mean nowcast, the Weekday nowcast and the Harvard nowcast of the 
                            <italic toggle="yes">D
                                <sup>T</sup>
                            </italic>-series on reporting days between 2020-05-11 and 2020-05-16. The 
                            <italic toggle="yes">D
                                <sup>T</sup>
                            </italic>-series as of 2020-07-09 serves as the true M-series.  Although the Harvard nowcast is better, the naive nowcasts can be useful.</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://gatesopenresearch-files.f1000.com/manuscripts/14380/e7b416d3-92bf-45da-a893-bc3b14493dd0_figure10.gif"/>
                </fig>
                <p>Generally, the Harvard nowcast algorithm performs better, but since also the Harvard nowcast fluctuates, it would be interesting to see if there are improvements that can be made by taking the weekly pattern better into account. It should be noted that we have used default settings for the Harvard nowcast. Furthermore, one can likely improve upon the approximations (7) and (8) but nowcasting is not the focus of this paper. The takeaway message here is that there are good nowcast methods available to analysts if they have access to the fundamental data set 
                    <inline-formula>
                        <mml:math display="inline" id="M19">
                            <mml:mrow>
                                <mml:msubsup>
                                    <mml:mi>d</mml:mi>
                                    <mml:mrow>
                                        <mml:mi>e</mml:mi>
                                        <mml:mi>t</mml:mi>
                                    </mml:mrow>
                                    <mml:mi>T</mml:mi>
                                </mml:msubsup>
                            </mml:mrow>
                        </mml:math>
                    </inline-formula> and if they are interested in using the best possible input in their analyses.</p>
            </sec>
        </sec>
        <sec>
            <title>Conclusion</title>
            <p>By viewing the 
                <italic toggle="yes">R</italic>-series and the 
                <italic toggle="yes">D
                    <sup>T</sup>
                </italic>-series as two aspects of the same underlying data set, we conclude that the 
                <italic toggle="yes">R</italic>-series is the result of a reporting process that "filters" the event based series, the 
                <italic toggle="yes">D
                    <sup>T</sup>
                </italic>-series, see 
                <xref ref-type="other" rid="e1">Equation (4)</xref>. In many cases this does not result in a significant difference between the two series. However, it turns out that the Swedish reporting process for Covid-19 deaths happens to generate a 
                <italic toggle="yes">R</italic>-series that looks quite different from the 
                <italic toggle="yes">D
                    <sup>T</sup>
                </italic>-series. The 
                <italic toggle="yes">R</italic>-series is wildly varying in time, which makes it hard to see trends and use as model input.</p>
            <p>To remedy this for the Swedish 
                <italic toggle="yes">R</italic>-series, smoothing of the curve is appropriate. Unfortunately, it is hard to know what constitutes a good smoothing, unless you have access to the 
                <italic toggle="yes">D
                    <sup>T</sup>
                </italic>-series. However, three factors are worth paying close attention to. First, most reporting processes have a built-in delay, leading to a corresponding shift in time between the 
                <italic toggle="yes">R</italic>-series and the 
                <italic toggle="yes">D
                    <sup>T</sup>
                </italic>-series. Second, if the delay is significant, the shape of the curve will also be affected. The peak will be lower, and the slope of the smooth 
                <italic toggle="yes">R</italic>-series will be less than the slope of the 
                <italic toggle="yes">D
                    <sup>T</sup>
                </italic>-series. This may result in an estimate of number of deaths that is an under-estimate during times of increasing daily deaths counts, and an over-estimate during times of decreasing death counts. Finally, if the slope of the deaths curve just before the reporting time is of importance, one should note that the perfectly smooth 
                <italic toggle="yes">R</italic>-series will to some degree reflect the slope of the 
                <italic toggle="yes">D
                    <sup>T</sup>
                </italic>-series and drop off just before the reporting time. Getting the slope right based on the 
                <italic toggle="yes">R</italic>-series can therefore be a very hard problem, as we have seen in a real world case shown in 
                <xref ref-type="fig" rid="f8">Figure 8</xref>. It is worth noting that smoothing of the 
                <italic toggle="yes">R</italic>-series by applying a 7- or 14-day rolling average, which has been very common when reporting Covid-19 deaths, might be a good idea but it suffers from the same short comings as mentioned above. It adds another four (alt. eight) days of delay, dampens the peaks, and additionally flattens the slope of the death curve.</p>
            <p>So, what is the best method? In cases where the there is a significant difference between the 
                <italic toggle="yes">R</italic>-series and the 
                <italic toggle="yes">D
                    <sup>T</sup>
                </italic>-series, it is probably not very controversial to be of the opinion that one should use the 
                <italic toggle="yes">D
                    <sup>T</sup>
                </italic>-series. However, one then faces the problem of a 
                <italic toggle="yes">D
                    <sup>T</sup>
                </italic>-series that drops off to zero close to the reporting date, see 
                <xref ref-type="fig" rid="f1">Figure 1</xref>, and one is forced to use a nowcasting method to compensate for this drop-off. As we have seen in 
                <xref ref-type="fig" rid="f9">Figure 9</xref> and 
                <xref ref-type="fig" rid="f10">Figure 10</xref>, the nowcasting series are not perfect and do fluctuate when applied to the Swedish data set. Nevertheless, whereas the change in shape of the 
                <italic toggle="yes">R</italic>-series from one day to another can be significant, the shape of 
                <italic toggle="yes">D
                    <sup>T</sup>
                </italic>-series never changes that much since it distributes the new deaths over many event dates. This is the main reason why using the 
                <italic toggle="yes">D
                    <sup>T</sup>
                </italic>-series with nowcasting is preferable to using the 
                <italic toggle="yes">R</italic>-series with smoothing, shifting and potentially compensating for a difference in shape.</p>
            <p>Coming back to the Swedish situation, back in May it was not a trivial task to interpret the death curve and seeing the trend if you only had access to the 
                <italic toggle="yes">R</italic>-series. However, having access to the 
                <italic toggle="yes">D
                    <sup>T</sup>
                </italic>-series and a nowcasting algorithm did give you confidence that the number of deaths were indeed declining. Of course, access to data for hospitalizations, ICU occupancy and case incidence gives a more complete picture and help interpret the situation. Nevertheless, it would be welcome if more countries made the 
                <italic toggle="yes">D
                    <sup>T</sup>
                </italic>-series available.</p>
        </sec>
        <sec>
            <title>Data availability</title>
            <sec>
                <title>Underlying data</title>
                <p>The primary data set analyzed in this paper was constructed by downloading the 
                    <italic toggle="yes">D
                        <sup>T</sup>
                    </italic>-series daily from the Public Health Agency of Sweden
                    <sup>
                        <xref ref-type="bibr" rid="ref-7">7</xref>
                    </sup> between 2020-04-02 and 2020-07-09.  The data is publicly available and is considered public domain data.</p>
                <p>Zenodo: On the use of real-time mortality data in modelling and analysis during an epidemic outbreak &#x2013; underlying data</p>
                <p>
                    <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.5281/zenodo.3992345">https://doi.org/10.5281/zenodo.3992345</ext-link>
                    <sup>
                        <xref ref-type="bibr" rid="ref-23">23</xref>
                    </sup>. </p>
                <p>This project contains the following underlying data:</p>
                <list list-type="bullet">
                    <list-item>
                        <p>
                            <monospace>FHM_Covid_Download.zip</monospace>. (Zip-archive of raw downloaded files with Swedish deaths data.)</p>
                    </list-item>
                    <list-item>
                        <p>
                            <monospace>swedish_covid_deaths_data.csv</monospace>. (Swedish deaths data collated from the raw data files in a .csv format.)</p>
                    </list-item>
                    <list-item>
                        <p>
                            <monospace>swedish_covid_deaths_data.xlsx</monospace>. (Swedish deaths data collated from the raw data files in a .xlsx format.)</p>
                    </list-item>
                </list>
                <p>Data are available under the terms of the 
                    <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/publicdomain/zero/1.0/">Creative Commons Zero "No rights reserved" data waiver</ext-link> (CC0 1.0 Public domain dedication).</p>
                <p>In the article we also discuss and plot data generated by 
                    <ext-link ext-link-type="uri" xlink:href="http://www.healthdata.org/covid/data-downloads">Institute of Health Metrics and Evaluation</ext-link>
                    <sup>
                        <xref ref-type="bibr" rid="ref-26">26</xref>
                    </sup> in the 2020-06-05 update of their model. The terms and conditions can be found or their website and states for non-commercial users: 
                    <italic toggle="yes">&#x201c;Data made available for download on IHME Websites can be used, shared, modified or built upon by non-commercial users via the Creative Commons Attribution-NonCommercial 4.0 International License (
                        <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by-nc/4.0/">https://creativecommons.org/licenses/by-nc/4.0/)&#x201d;</ext-link>
                    </italic>.</p>
            </sec>
            <sec>
                <title>Extended data</title>
                <p>The Appendix to this article as well as the R-code to generate the graphs and the nowcasts are available as extended data.</p>
                <p>Zenodo: On the use of real-time mortality data in modelling and analysis during an epidemic outbreak &#x2013; extended data.</p>
                <p>
                    <ext-link ext-link-type="uri" xlink:href="http://doi.org/10.5281/zenodo.4005244">http://doi.org/10.5281/zenodo.4005244</ext-link>
                    <sup>
                        <xref ref-type="bibr" rid="ref-13">13</xref>
                    </sup>.</p>
                <p>This project contains the following extended data:</p>
                <list list-type="bullet">
                    <list-item>
                        <p>
                            <monospace>Liljenberg2020_OGR_Appendix.pdf</monospace> (Appendix to the main article)</p>
                    </list-item>
                    <list-item>
                        <p>
                            <monospace>swedish_covid_deaths_OGR.R</monospace>. (R-script to generate graphs and nowcasts in the paper.)</p>
                    </list-item>
                    <list-item>
                        <p>
                            <monospace>MDAR author checklist.pdf</monospace> (Completed MDAR reporting checklist)</p>
                    </list-item>
                </list>
                <p>Data are available under the terms of the 
                    <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/publicdomain/zero/1.0/">Creative Commons Zero "No rights reserved" data waiver</ext-link> (CC0 1.0 Public domain dedication).</p>
            </sec>
        </sec>
    </body>
    <back>
        <ack>
            <title>Acknowledgements</title>
            <p>We would like to acknowledge Dr. Sarah McGough for assistance with the Harvard nowcasting r-package.</p>
        </ack>
        <ref-list>
            <ref id="ref-1">
                <label>1</label>
                <mixed-citation publication-type="journal">
                    <collab>World Health Organization (WHO)</collab>:
                    <article-title>Coronavirus Disease (COVID-19) Situation Reports</article-title>. [Accessed: 01-Aug-2020].
                    <ext-link ext-link-type="uri" xlink:href="https://www.who.int/emergencies/diseases/novel-coronavirus-2019/situation-reports">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref-2">
                <label>2</label>
                <mixed-citation publication-type="journal">
                    <collab>Centers for Disease Control and Prevention (U.S.)</collab>:
                    <article-title>CDC COVID Data Tracker</article-title>. [Accessed: 01-Aug-2020].</mixed-citation>
            </ref>
            <ref id="ref-3">
                <label>3</label>
                <mixed-citation publication-type="journal">
                    <collab>European Centre for Disease Control</collab>:
                    <article-title>How ECDC collects and processes COVID-19 data</article-title>.<year>2020</year>. [Accessed: 05-Jul-2020].
                    <ext-link ext-link-type="uri" xlink:href="https://www.ecdc.europa.eu/en/covid-19/data-collection">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref-4">
                <label>4</label>
                <mixed-citation publication-type="journal">
                    <collab>Johns Hopkins University</collab>:
                    <article-title>COVID-19 Map - Coronavirus Resource Center</article-title>.<year>2020</year>. [Accessed: 01-Aug-2020].
                    <ext-link ext-link-type="uri" xlink:href="https://coronavirus.jhu.edu/map.html">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref-5">
                <label>5</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Dong</surname>
                            <given-names>E</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Du</surname>
                            <given-names>H</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Gardner</surname>
                            <given-names>L</given-names>
                        </name>
</person-group>:
                    <article-title>An interactive web-based dashboard to track COVID-19 in real time.</article-title>
                    <source>

                        <italic toggle="yes">Lancet Infect Dis.</italic>
</source>Lancet Publishing Group.<year>2020</year>;<volume>20</volume>(<issue>5</issue>):<fpage>533</fpage>&#x2013;<lpage>534</lpage>.
                    <pub-id pub-id-type="pmid">32087114</pub-id>
                    <pub-id pub-id-type="doi">10.1016/S1473-3099(20)30120-1 </pub-id>
                    <pub-id pub-id-type="pmcid">7159018 </pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-6">
                <label>6</label>
                <mixed-citation publication-type="journal">
                    <collab>Our World in Data</collab>:
                    <article-title>Coronavirus Pandemic (COVID-19) - Statistics and Research</article-title>. [Accessed: 01-Aug-2020].
                    <ext-link ext-link-type="uri" xlink:href="https://ourworldindata.org/coronavirus">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref-7">
                <label>7</label>
                <mixed-citation publication-type="journal">
                    <collab>Folkh&#x00e4;losmyndigheten (Public health agency of Sweden)</collab>:
                    <article-title>Bekr&#x00e4;ftade fall i Sverige &#x2014; Folkh&#x00e4;lsomyndigheten</article-title>. [Accessed: 10-Jul-2020].
                    <ext-link ext-link-type="uri" xlink:href="https://www.folkhalsomyndigheten.se/smittskydd-beredskap/utbrott/aktuella-utbrott/covid-19/statistik-och-analyser/bekraftade-fall-i-sverige/">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref-8">
                <label>8</label>
                <mixed-citation publication-type="journal">
                    <collab>Folkh&#x00e4;lsomyndigheten (Public health agency of Sweden)</collab>:
                    <article-title>Bekr&#x00e4;ftade fall i Sverige - daglig uppdatering</article-title>. [Accessed: 05-Jul-2020].
                    <ext-link ext-link-type="uri" xlink:href="https://experience.arcgis.com/experience/09f821667ce64bf7be6f9f87457ed9aa">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref-9">
                <label>9</label>
                <mixed-citation publication-type="journal">
                    <collab>Sveriges Television</collab>:
                    <article-title>The spread of the new coronavirus - SVT Nyheter</article-title>. [Accessed: 05-Jul-2020].
                    <ext-link ext-link-type="uri" xlink:href="https://www.svt.se/datajournalistik/the-spread-of-the-coronavirus/">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref-10">
                <label>10</label>
                <mixed-citation publication-type="journal">
                    <collab>Sciensano (Belgian Institute for Health)</collab>:
                    <article-title>Epistat - COVID-19 Monitoring</article-title>.<year>2020</year>. [Accessed: 18-Aug-2020].
                    <ext-link ext-link-type="uri" xlink:href="https://epistat.wiv-isp.be/covid/">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref-11">
                <label>11</label>
                <mixed-citation publication-type="journal">
                    <collab>National Health Service</collab>:
                    <article-title>Statistics&#x202f;&#x00bb; COVID-19 Daily Deaths</article-title>. [Accessed: 10-Jul-2020]].
                    <ext-link ext-link-type="uri" xlink:href="https://www.england.nhs.uk/statistics/statistical-work-areas/covid-19-daily-deaths/">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref-12">
                <label>12</label>
                <mixed-citation publication-type="journal">
                    <collab>GOV.UK</collab>:
                    <article-title>Coronavirus (COVID-19) in the UK</article-title>. [Accessed: 05-Jul-2020].
                    <ext-link ext-link-type="uri" xlink:href="https://coronavirus.data.gov.uk/#category=nations&amp;map=rate">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref-13">
                <label>13</label>
                <mixed-citation publication-type="data">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Liljenberg</surname>
                            <given-names>P</given-names>
                        </name>
</person-group>:
                    <article-title>On the use of real-time mortality data in modelling and analysis during an epidemic outbreak - extended data [Data set].</article-title>
                    <source>

                        <italic toggle="yes">Zenodo.</italic>
</source>
                    <year>2020</year>.
                    <ext-link ext-link-type="uri" xlink:href="http://www.doi.org/10.5281/zenodo.4005244">http://www.doi.org/10.5281/zenodo.4005244</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref-14">
                <label>14</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Margrave</surname>
                            <given-names>GF</given-names>
                        </name>
</person-group>:
                    <article-title>Theory of nonstationary linear filtering in the Fourier domain with application to time&#x2010;variant filtering.</article-title>
                    <source>

                        <italic toggle="yes">GEOPHYSICS.</italic>
</source>
                    <year>1998</year>;<volume>63</volume>(<issue>1</issue>):<fpage>244</fpage>&#x2013;<lpage>259</lpage>.
                    <pub-id pub-id-type="doi">10.1190/1.1444318</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-15">
                <label>15</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Clarke</surname>
                            <given-names>GKC</given-names>
                        </name>
</person-group>:
                    <article-title>Time-varying deconvolution filters.</article-title>
                    <source>

                        <italic toggle="yes">GEOPHYSICS.</italic>
</source>
                    <year>1968</year>;<volume>33</volume>(<issue>6</issue>):<fpage>936</fpage>&#x2013;<lpage>944</lpage>.
                    <pub-id pub-id-type="doi">10.1190/1.1439987
</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-16">
                <label>16</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Salmon</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Schumacher</surname>
                            <given-names>D</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Stark</surname>
                            <given-names>K</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Bayesian outbreak detection in the presence of reporting delays.</article-title>
                    <source>

                        <italic toggle="yes">Biom J.</italic>
</source>
                    <year>2015</year>;<volume>57</volume>(<issue>6</issue>):<fpage>1051</fpage>&#x2013;<lpage>1067</lpage>.
                    <pub-id pub-id-type="pmid">26250543</pub-id>
                    <pub-id pub-id-type="doi">10.1002/bimj.201400159 </pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-17">
                <label>17</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>McGough</surname>
                            <given-names>SF</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Johansson</surname>
                            <given-names>MA</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Lipsitch</surname>
                            <given-names>M</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Nowcasting by Bayesian Smoothing: A flexible, generalizable model for real-time epidemic tracking.</article-title>
                    <source>

                        <italic toggle="yes">PLOS Comput Biol.</italic>
</source>
                    <year>2020</year>;<volume>16</volume>(<issue>4</issue>):<fpage>e1007735</fpage>.
                    <pub-id pub-id-type="pmid">32251464</pub-id>
                    <pub-id pub-id-type="doi">10.1371/journal.pcbi.1007735 </pub-id>
                    <pub-id pub-id-type="pmcid">7162546</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-18">
                <label>18</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Rotejanaprasert</surname>
                            <given-names>C</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ekapirat</surname>
                            <given-names>N</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Areechokchai</surname>
                            <given-names>D</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Bayesian spatiotemporal modeling with sliding windows to correct reporting delays for real-time dengue surveillance in Thailand.</article-title>
                    <source>

                        <italic toggle="yes">Int J Health Geogr.</italic>
</source>
                    <year>2020</year>;<volume>19</volume>(<issue>1</issue>):<fpage>4</fpage>.
                    <pub-id pub-id-type="pmid">32126997</pub-id>
                    <pub-id pub-id-type="doi">10.1186/s12942-020-00199-0</pub-id>
                    <pub-id pub-id-type="pmcid">7055098</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-19">
                <label>19</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Wu</surname>
                            <given-names>JT</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Leung</surname>
                            <given-names>K</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Leung</surname>
                            <given-names>GM</given-names>
                        </name>
</person-group>:
                    <article-title>Nowcasting and forecasting the potential domestic and international spread of the 2019-nCoV outbreak originating in Wuhan, China: a modelling study.</article-title>
                    <source>

                        <italic toggle="yes">Lancet.</italic>
</source>
                    <year>2020</year>;<volume>395</volume>(<issue>10225</issue>):<fpage>689</fpage>&#x2013;<lpage>697</lpage>.
                    <pub-id pub-id-type="pmid">32014114</pub-id>
                    <pub-id pub-id-type="doi">10.1016/S0140-6736(20)30260-9 </pub-id>
                    <pub-id pub-id-type="pmcid">7159271 </pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-20">
                <label>20</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Arslan</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ozdemir</surname>
                            <given-names>MY</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ucar</surname>
                            <given-names>A</given-names>
                        </name>
</person-group>:
                    <article-title>Nowcasting and Forecasting the Spread of COVID-19 and Healthcare Demand In Turkey, A Modelling Study.</article-title>
                    <source>

                        <italic toggle="yes">medRxiv.</italic>
</source>
                    <year>2020</year>.
                    <pub-id pub-id-type="doi">10.1101/2020.04.13.20063305</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-21">
                <label>21</label>
                <mixed-citation publication-type="journal">
                    <collab>Science Media Centre</collab>:
                    <article-title>Nowcasting and forecasting COVID-19 data</article-title>. [Accessed: 01-Aug-2020].
                    <ext-link ext-link-type="uri" xlink:href="https://www.sciencemediacentre.org/nowcasting-and-forecasting-covid-19-data/">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref-22">
                <label>22</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Bird</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Nielsen</surname>
                            <given-names>B</given-names>
                        </name>
</person-group>:
                    <article-title>Now-casting of Covid-19 deaths in English hospitals</article-title>. [Accessed: 21-Jun-2020].
                    <ext-link ext-link-type="uri" xlink:href="http://users.ox.ac.uk/~nuff0078/Covid/index.htm">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref-23">
                <label>23</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Liljenberg</surname>
                            <given-names>P</given-names>
                        </name>
</person-group>:
                    <article-title>On the use of real-time mortality data in modelling and analysis during an epidemic outbreak - underlying data [Data set].</article-title>
                    <source>

                        <italic toggle="yes">Zenodo.</italic>
</source>
                    <year>2020</year>.
                    <ext-link ext-link-type="uri" xlink:href="http://www.doi.org/10.5281/zenodo.3992345">http://www.doi.org/10.5281/zenodo.3992345</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref-24">
                <label>24</label>
                <mixed-citation publication-type="journal">
                    <collab>Socialstyrelsen</collab>:
                    <article-title>Statistik om covid-19 bland &#x00e4;ldre efter boendeform - Socialstyrelsen</article-title>. [Accessed: 11-Jul-2020].
                    <ext-link ext-link-type="uri" xlink:href="https://www.socialstyrelsen.se/statistik-och-data/statistik/statistik-om-covid-19/statistik-om-covid-19-bland-aldre-efter-boendeform/">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref-25">
                <label>25</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>IHME COVID-19 health service utilization forecasting team, Murray</surname>
                            <given-names>CJL</given-names>
                        </name>
</person-group>:
                    <article-title>Forecasting the impact of the first wave of the COVID-19 pandemic on hospital demand and deaths for the USA and European Economic Area countries.</article-title>
                    <source>

                        <italic toggle="yes">medRxiv.</italic>
</source>
                    <year>2020</year>.
                    <pub-id pub-id-type="doi">10.1101/2020.04.21.20074732</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-26">
                <label>26</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>IHME COVID-19 health service utilization forecasting team, Murray</surname>
                            <given-names>CJL</given-names>
                        </name>
</person-group>:
                    <article-title>COVID-19 estimate downloads | Institute for Health Metrics and Evaluation</article-title>. [Accessed: 10-Jul-2020].
                    <ext-link ext-link-type="uri" xlink:href="http://www.healthdata.org/covid/data-downloads">Reference Source</ext-link>
                </mixed-citation>
            </ref>
        </ref-list>
    </back>
    <sub-article article-type="reviewer-report" id="report30074">
        <front-stub>
            <article-id pub-id-type="doi">10.21956/gatesopenres.14380.r30074</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Truelove</surname>
                        <given-names>Shaun</given-names>
                    </name>
                    <xref ref-type="aff" rid="r30074a1">1</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0003-0538-0607</uri>
                </contrib>
                <aff id="r30074a1">
                    <label>1</label>Department of International Health, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>29</day>
                <month>1</month>
                <year>2021</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2021 Truelove S</copyright-statement>
                <copyright-year>2021</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport30074" related-article-type="peer-reviewed-article" xlink:href="10.12688/gatesopenres.13180.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve-with-reservations</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>This paper presents an interesting examination of how aggregate death reporting can misrepresent the true mortality curve during the COVID-19 pandemic. Using case data, which captures the true dates of when deaths happened, not just when they were reported, the author developed a mathematical framework for compare the sources of deaths. Through this framework, the author goes on to compare the now-casting with their framework with those of IHME and Harvard.</p>
            <p> </p>
            <p> While the methods are sound, the structure of this manuscript was unconventional, making it somewhat challenging to follow. The author provided only limited background information, and motivation for this work was not overtly clear to the reader. I would like to see more background provided about why we care about these differences in deaths reporting.&#x00a0;</p>
            <p> </p>
            <p> The methods and results were similarly lacking in structure. I was not completely clear what the authors did in terms of methods, particularly with respect to the now-casting, and much of the results are presented as a conversational commentary, rather than scientific findings. It would have been good to seen a quantified impact of using the R-series versus the D-series.</p>
            <p> </p>
            <p> Finally, while I believe this work provides novel findings, I believe these findings could be more clearly presented and emphasized, particularly in terms of how the public should be interpreting model results, and how researchers should be accounting for these. Additionally, the author did not discuss limitations of this work or reliability of the data being used. While the "D-series" is ideal, it is often only a subset of individuals across a country.</p>
            <p>Is the work clearly and accurately presented and does it cite the current literature?</p>
            <p>Partly</p>
            <p>If applicable, is the statistical analysis and its interpretation appropriate?</p>
            <p>Yes</p>
            <p>Are all the source data underlying the results available to ensure full reproducibility?</p>
            <p>Yes</p>
            <p>Is the study design appropriate and is the work technically sound?</p>
            <p>Partly</p>
            <p>Are the conclusions drawn adequately supported by the results?</p>
            <p>Yes</p>
            <p>Are sufficient details of methods and analysis provided to allow replication by others?</p>
            <p>Yes</p>
            <p>Reviewer Expertise:</p>
            <p>infectious disease epidemiology and modeling, statistics, COVID-19, vaccination</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.</p>
        </body>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report29961">
        <front-stub>
            <article-id pub-id-type="doi">10.21956/gatesopenres.14380.r29961</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Khajanchi</surname>
                        <given-names>Subhas</given-names>
                    </name>
                    <xref ref-type="aff" rid="r29961a1">1</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0002-6533-1121</uri>
                </contrib>
                <aff id="r29961a1">
                    <label>1</label>Department of Mathematics, Presidency University, Kolkata, West Bengal, India</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>18</day>
                <month>11</month>
                <year>2020</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2020 Khajanchi S</copyright-statement>
                <copyright-year>2020</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport29961" related-article-type="peer-reviewed-article" xlink:href="10.12688/gatesopenres.13180.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve-with-reservations</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>The COVID-19 pandemic has already spread throughout the world and the people are aware about the diseases and they are using precautions about the pandemic. Still, COVID-19 is spreading very quickly. Some countries like Spain, Australia, Serbia, China, etc. have started a second wave of COVID-19. To stop the spread of the disease, a vaccine is needed. In absence of the vaccine people must have maintained social distancing. In order to maintain the social distancing, one must obey the modeling rule.</p>
            <p> </p>
            <p> The introduction needs to be improved by incorporating some recent references of COVID-19 pandemic. To do so, I suggest some modeling work to be included in the references such as: Sakar 
                <italic>et al</italic>. (2020)
                <sup>
                    <xref ref-type="bibr" rid="rep-ref-29961-1">1</xref>
                </sup>, Khajanchi &amp; Sakar (2020)
                <sup>
                    <xref ref-type="bibr" rid="rep-ref-29961-2">2</xref>
                </sup>, and&#x00a0;Samui
                <italic> et al.</italic> (2020)
                <sup>
                    <xref ref-type="bibr" rid="rep-ref-29961-3">3</xref>
                </sup>.&#x00a0;</p>
            <p> </p>
            <p> In this context, an important factor must be included in this study, that is, the impact of the effect of media. How the COVID-19 dynamics been changed due to incorporation of the media related awareness like use of face masks, non-pharmaceutical interventions, hand sanitization, etc. The author must include the manuscript Khajanchi 
                <italic>et al. </italic>(2020)
                <sup>
                    <xref ref-type="bibr" rid="rep-ref-29961-4">4</xref>
                </sup>&#x00a0;to study the effect of media.</p>
            <p> </p>
            <p> Is there any experimental data to validate the mathematical model? The authors at least describe the basic reproduction number R
                <sub>0</sub> and its impact on COVID-19 pandemic in India. The basic reproduction number R
                <sub>0</sub> is one of the most crucial quantities in infectious diseases, as R
                <sub>0</sub> measures how contagious a disease is. For R
                <sub>0</sub> &lt; 1, the disease is expected to stop spreading, but for R
                <sub>0</sub> = 1 an infected individual can infect on an average 1 person, that is, the spread of the disease is stable. The disease can spread and become epidemic if R
                <sub>0</sub> must be greater than 1
                <sup>
                    <xref ref-type="bibr" rid="rep-ref-29961-5">5</xref>
                </sup>.</p>
            <p> </p>
            <p> Some references contain errors and inconsistent formatting. It is difficult to give credit to research if even elementary aspects of the work are not error free. This should be corrected with care and love to detail.</p>
            <p> </p>
            <p> The manuscript is comprehensive, and I have enjoyed learning about the presented results. I find that the manuscript is written with very poor language and the presentation is not good, and I am in principal in favor of indexing, although the following comments should nevertheless be accommodated in one major revision.</p>
            <p>Is the work clearly and accurately presented and does it cite the current literature?</p>
            <p>Partly</p>
            <p>If applicable, is the statistical analysis and its interpretation appropriate?</p>
            <p>Partly</p>
            <p>Are all the source data underlying the results available to ensure full reproducibility?</p>
            <p>Partly</p>
            <p>Is the study design appropriate and is the work technically sound?</p>
            <p>Partly</p>
            <p>Are the conclusions drawn adequately supported by the results?</p>
            <p>Partly</p>
            <p>Are sufficient details of methods and analysis provided to allow replication by others?</p>
            <p>Yes</p>
            <p>Reviewer Expertise:</p>
            <p>Infectious diseases, Ecological systems, Tumor-immune interactions.</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.</p>
        </body>
        <back>
            <ref-list>
                <title>References</title>
                <ref id="rep-ref-29961-1">
                    <label>1</label>
                    <mixed-citation publication-type="journal">
                        <person-group person-group-type="author"/>:
                        <article-title>Modeling and forecasting the COVID-19 pandemic in India.</article-title>
                        <source>
                            <italic>Chaos Solitons Fractals</italic>
                        </source>.<year>2020</year>;<volume>139</volume>:
                        <elocation-id>10.1016/j.chaos.2020.110049</elocation-id>
                        <fpage>110049</fpage>
                        <pub-id pub-id-type="pmid">32834603</pub-id>
                        <pub-id pub-id-type="doi">10.1016/j.chaos.2020.110049</pub-id>
                    </mixed-citation>
                </ref>
                <ref id="rep-ref-29961-2">
                    <label>2</label>
                    <mixed-citation publication-type="journal">
                        <person-group person-group-type="author"/>:
                        <article-title>Forecasting the daily and cumulative number of cases for the COVID-19 pandemic in India.</article-title>
                        <source>
                            <italic>Chaos</italic>
                        </source>.<year>2020</year>;<volume>30</volume>(<issue>7</issue>) :
                        <elocation-id>10.1063/5.0016240</elocation-id>
                        <fpage>071101</fpage>
                        <pub-id pub-id-type="pmid">32752627</pub-id>
                        <pub-id pub-id-type="doi">10.1063/5.0016240</pub-id>
                    </mixed-citation>
                </ref>
                <ref id="rep-ref-29961-3">
                    <label>3</label>
                    <mixed-citation publication-type="journal">
                        <person-group person-group-type="author"/>:
                        <article-title>A mathematical model for COVID-19 transmission dynamics with a case study of India.</article-title>
                        <source>
                            <italic>Chaos Solitons Fractals</italic>
                        </source>.<year>2020</year>;<volume>140</volume>:
                        <elocation-id>10.1016/j.chaos.2020.110173</elocation-id>
                        <fpage>110173</fpage>
                        <pub-id pub-id-type="pmid">32834653</pub-id>
                        <pub-id pub-id-type="doi">10.1016/j.chaos.2020.110173</pub-id>
                    </mixed-citation>
                </ref>
                <ref id="rep-ref-29961-4">
                    <label>4</label>
                    <mixed-citation publication-type="journal">
                        <person-group person-group-type="author"/>:
                        <article-title>Dynamics of the COVID-19 pandemic in India</article-title>.<year>2020</year>;
                        <elocation-id>10.21203/rs.3.rs-27112/v1</elocation-id>
                        <pub-id pub-id-type="doi">10.21203/rs.3.rs-27112/v1</pub-id>
                    </mixed-citation>
                </ref>
                <ref id="rep-ref-29961-5">
                    <label>5</label>
                    <mixed-citation publication-type="journal">
                        <person-group person-group-type="author"/>:
                        <article-title>Mathematical analysis of the global dynamics of a HTLV-I infection model, considering the role of cytotoxic T-lymphocytes</article-title>.
                        <source>
                            <italic>Mathematics and Computers in Simulation</italic>
                        </source>.<year>2021</year>;<volume>180</volume>:
                        <elocation-id>10.1016/j.matcom.2020.09.009</elocation-id>
                        <fpage>354</fpage>-<lpage>378</lpage>
                        <pub-id pub-id-type="doi">10.1016/j.matcom.2020.09.009</pub-id>
                    </mixed-citation>
                </ref>
            </ref-list>
        </back>
    </sub-article>
</article>
