**************************** Why UndersmoothedUnfolding? **************************** --------------------------------------------------------- Coverage as a function of regularization strength --------------------------------------------------------- ``UndersmoothedUnfolding`` an extension for the unfolding software `TUnfoldV17.8 `_ [2]_ written by Stefan Schmitt. `TUnfold `_ implements Tikhonov regularization for unfolding smeared data from particle detectors. However, as demonstrated by Kuusela (2016) [1]_, the resulting confidence intervals may seriously underestimate the true uncertainty in the unfolded space, unless care is taken in the choice of the regularization strength :math:`\tau`. .. figure:: plots/coverage_peak_bin_curvature.png :width: 80% :align: center Coverage is empirically obtained with 10,000 independent realizations of the data. The plot above illustrates the empirical coverage at the peak bin (with highest :math:`y`-value) for various values of the regularization strength in the two Gaussian peaks example described below. Coverage starts at the nominal level (68 %) when there is no regularization. However, as the regularization strength increases, coverage gradually decreases to zero. The first vertical line indicates the average regularization strength chosen by ``UndersmoothedUnfolding::UndersmoothTau``, and the corresponding coverage is near the nominal level (68 %). On the other hand, the second vertical line, corresponding to the average regularization strength chosen by ``TUnfold::ScanLcurve``, shows serious undercoverage at around 20%. This result shows that ``TUnfold::ScanLcurve`` regularizes too strongly from the perspective of uncertainty quantification, while ``UndersmoothedUnfolding::UndersmoothTau`` has chosen a regularization strength that results in better-calibrated uncertainties. .. _demo-two-peaks: ------------------------------------------- Coverage demonstration I: Two Gaussian peaks ------------------------------------------- The first test case we consider has two Gaussian peaks on top of a uniform background, resembling situations where invariant mass peaks are observed on top of some background events. The observations are smeared with Gaussian noise, resulting in a Gaussian deconvolution problem. .. figure:: plots/undersmooth_demo_twopeaks.png :width: 100% :align: center The coverage is estimated by repeating the unfolding 1,000 times with independent realizations of the data. The average confidence interval length for each method is indicated above the top plots. The top two plots compare the performance of the unfolded confidence intervals when :math:`\tau` is chosen using ``TUnfold::ScanLcurve`` and the algorithm ``UndersmoothedUnfolding::UndersmoothTau`` provided by this software. The tolerance :math:`\epsilon` is set to 0.01, so the intervals from ``UndersmoothTau`` should have 67% coverage. The bottom plots show one realization of the unfolded confidence intervals for each method. The confidence intervals provided by ``TUnfold::ScanLcurve`` are too short and suffer from drastic undercoverage, while the intervals provided by ``UndersmoothedUnfolding::UndersmoothTau`` have nearly nominal coverage without being excessively long. .. _demo-steeply-falling: -------------------------------------------------- Coverage demonstration II: Steeply falling spectrum -------------------------------------------------- The second test case we consider has a steeply falling particle spectrum, and we use the inclusive jet transverse momentum spectrum [3]_ as an example. Observing a jet, a collimated stream of energetic particles, in an LHC particle detector implies the creation of a quark or a gluon, and analyzing the properties of these jets is an important test of the Standard Model of particle physics. In this example, the smearing kernel is Gaussian with a variance that changes along the :math:`x`-axis. See Kuusela (2016) [1]_ for a more detailed description of the experiment setup. .. figure:: plots/undersmooth_demo_incjets.png :width: 100% :align: center The coverage is estimated by repeating the unfolding 1,000 times with independent realizations of the data. The average confidence interval length for each method is indicated above the top plots. Again, we see that the confidence intervals provided by ``TUnfold::ScanLcurve`` suffer from drastic undercoverage especially on the left side of the spectrum, while the intervals provided by ``UndersmoothedUnfolding::UndersmoothTau`` have nearly nominal coverage. .. [1] M. Kuusela, “Uncertainty quantification in unfolding elementary particle spectra at the Large Hadron Collider”, PhD thesis, EPFL (2016) .. [2] S. Schmitt, “TUnfold, an algorithm for correcting migration effects in high energy physics”, Journal of Instrumentation **7** (2012) .. [3] CMS Collaboration, "Measurement of the Inclusive Jet Cross Section in :math:`pp` Collisions at :math:`\sqrt{s} = 7` TeV", Physical Review D **107** (2011)