skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Machine learning techniques in searches for$$t\bar{t}$$h in the h → $$b\bar{b}$$ decay channel

Abstract

Study of the production of pairs of top quarks in association with a Higgs boson is one of the primary goals of the Large Hadron Collider over the next decade, as measurements of this process may help us to understand whether the uniquely large mass of the top quark plays a special role in electroweak symmetry breaking. Higgs bosons decay predominantly to b$$\bar{_b}$$, yielding signatures for the signal that are similar to t$$\bar{_t}$$ + jets with heavy flavor. Though particularly challenging to study due to the similar kinematics between signal and background events, such final states (t$$\bar{_t}$$b$$\bar{b}$$) are an important channel for studying the top quark Yukawa coupling. This paper presents a systematic study of machine learning (ML) methods for detecting t$$\bar{_t}$$h in the h → b$$\bar{b}$$ decay channel. Among the seven ML methods tested, we show that neural network models outperform alternative methods. In addition, two neural models used in this paper outperform NeuroBayes, one of the standard algorithms used in current particle physics experiments. We further study the effectiveness of ML algorithms by investigating the impact of feature set and data size, as well as the depth of the networks for neural models. We demonstrate that an extended feature set leads to improvement of performance over basic features. Furthermore, the availability of large samples for training is found to be important for improving the performance of the techniques. For the features and the data set studied here, neural networks of more layers deliver comparable performance to their simpler counterparts.

Authors:
 [1];  [1];  [2];  [2];  [1];  [2];  [1]
  1. Northern Illinois Univ., DeKalb, IL (United States)
  2. Argonne National Lab. (ANL), Lemont, IL (United States)
Publication Date:
Research Org.:
Argonne National Lab. (ANL), Argonne, IL (United States)
Sponsoring Org.:
National Science Foundation (NSF); USDOE Office of Science (SC)
OSTI Identifier:
1373727
Grant/Contract Number:
AC02-06CH11357
Resource Type:
Journal Article: Accepted Manuscript
Journal Name:
Journal of Instrumentation
Additional Journal Information:
Journal Volume: 12; Journal Issue: 04; Journal ID: ISSN 1748-0221
Publisher:
Institute of Physics (IOP)
Country of Publication:
United States
Language:
English
Subject:
72 PHYSICS OF ELEMENTARY PARTICLES AND FIELDS; 46 INSTRUMENTATION RELATED TO NUCLEAR SCIENCE AND TECHNOLOGY; Analysis and statistical methods; Data processing methods

Citation Formats

Santos, Robert, Nguyen, M., Webster, Jordan, Ryu, Soo, Adelman, Jahred, Chekanov, Sergei, and Zhou, Jie. Machine learning techniques in searches for$t\bar{t}$h in the h → $b\bar{b}$ decay channel. United States: N. p., 2017. Web. doi:10.1088/1748-0221/12/04/P04014.
Santos, Robert, Nguyen, M., Webster, Jordan, Ryu, Soo, Adelman, Jahred, Chekanov, Sergei, & Zhou, Jie. Machine learning techniques in searches for$t\bar{t}$h in the h → $b\bar{b}$ decay channel. United States. doi:10.1088/1748-0221/12/04/P04014.
Santos, Robert, Nguyen, M., Webster, Jordan, Ryu, Soo, Adelman, Jahred, Chekanov, Sergei, and Zhou, Jie. Mon . "Machine learning techniques in searches for$t\bar{t}$h in the h → $b\bar{b}$ decay channel". United States. doi:10.1088/1748-0221/12/04/P04014. https://www.osti.gov/servlets/purl/1373727.
@article{osti_1373727,
title = {Machine learning techniques in searches for$t\bar{t}$h in the h → $b\bar{b}$ decay channel},
author = {Santos, Robert and Nguyen, M. and Webster, Jordan and Ryu, Soo and Adelman, Jahred and Chekanov, Sergei and Zhou, Jie},
abstractNote = {Study of the production of pairs of top quarks in association with a Higgs boson is one of the primary goals of the Large Hadron Collider over the next decade, as measurements of this process may help us to understand whether the uniquely large mass of the top quark plays a special role in electroweak symmetry breaking. Higgs bosons decay predominantly to b$\bar{_b}$, yielding signatures for the signal that are similar to t$\bar{_t}$ + jets with heavy flavor. Though particularly challenging to study due to the similar kinematics between signal and background events, such final states (t$\bar{_t}$b$\bar{b}$) are an important channel for studying the top quark Yukawa coupling. This paper presents a systematic study of machine learning (ML) methods for detecting t$\bar{_t}$h in the h → b$\bar{b}$ decay channel. Among the seven ML methods tested, we show that neural network models outperform alternative methods. In addition, two neural models used in this paper outperform NeuroBayes, one of the standard algorithms used in current particle physics experiments. We further study the effectiveness of ML algorithms by investigating the impact of feature set and data size, as well as the depth of the networks for neural models. We demonstrate that an extended feature set leads to improvement of performance over basic features. Furthermore, the availability of large samples for training is found to be important for improving the performance of the techniques. For the features and the data set studied here, neural networks of more layers deliver comparable performance to their simpler counterparts.},
doi = {10.1088/1748-0221/12/04/P04014},
journal = {Journal of Instrumentation},
number = 04,
volume = 12,
place = {United States},
year = {Mon Apr 10 00:00:00 EDT 2017},
month = {Mon Apr 10 00:00:00 EDT 2017}
}

Journal Article:
Free Publicly Available Full Text
Publisher's Version of Record

Save / Share:
  • A search for a charged Higgs boson (H +) inmore » $$t\bar{t}$$decays is presented, where one of the top quarks decays via t →H + b, followed by H + → two jets ($$c\bar{s}$$). The other top quark decays to Wb, where the W boson then decays into a lepton (e/μ) and a neutrino. The data were recorded in pp collisions at √ s = 7 TeV by the ATLAS detector at the LHC in 2011, and correspond to an integrated luminosity of 4.7 fb -1. Finally, with no observation of a signal, 95 % confidence level (CL) upper limits are set on the decay branching ratio of top quarks to charged Higgs bosons varying between 5 % and 1 % for H + masses between 90 GeV and 150 GeV, assuming B(H + →cs) = 100%.« less
  • Jet multiplicity distributions in top quark pair (tt-bar) events are measured in pp collisions at a centre-of-mass energy of 8 TeV with the CMS detector at the LHC using a data set corresponding to an integrated luminosity of 19.7 fb -1. The measurement is performed in the dilepton decay channels (e +e -+μ - and e ±μ ). Furthermore, the absolute and normalized differential cross sections for tt-bar production are measured as a function of the jet multiplicity in the event for different jet transverse momentum thresholds and the kinematic properties of the leading additional jets. The differential tt-barbmore » and tt-barbb-bar cross sections are presented for the first time as a function of the kinematic properties of the leading additional b jets. Furthermore, the fraction of events without additional jets above a threshold is measured as a function of the transverse momenta of the leading additional jets and the scalar sum of the transverse momenta of all additional jets. Finally, the data are compared and found to be consistent with predictions from several perturbative quantum chromodynamics event generators and a next-to-leading ordercalculation.« less
  • CP-nonconserving effects in the reaction p{bar p}{r_arrow}t{bar b}+X{r_arrow}W{sup +}b{bar b}+X, driven by the supersymmetric CP-odd phase of the top squark trilinear soft breaking term arg(A{sub t}), are studied. We discuss the CP-nonconserving effects in both production and the associated decay amplitudes of the top quark. We find that, within a plausible low energy scenario of the MSSM and keeping the neutron electric dipole moment below its current limit, a CP-violating cross-section asymmetry as large as 2{minus}3{percent} can arise if some of the parameters lie in a favorable range. A partial rate asymmetry originating only in the top quark decay t{r_arrow}W{supmore » +}b is found to be, in general, below the 0.1{percent} level which is somewhat smaller than previous claims. For a low tan{beta} of order one the decay asymmetry can reach at the most {approximately}0.3{percent}. This (few) percent level overall CP-violating signal in p{bar p}{r_arrow}t{bar b}+X{r_arrow}W{sup +}b{bar b}+X might be within the reach of the future 2(4) TeV p{bar p} Fermilab Tevatron collider that may be able to produce {approximately}10000 ({approximately}30000) such t{bar b} events with an integrated luminosity of 30 fb{sup {minus}1}. In particular, it may be used to place an upper bound on arg(A{sub t}) if indeed arg({mu}){r_arrow}0, as implied from the present experimental limit on the neutron electric dipole moment. The partial rate asymmetry in the top quark decay ({approximately}few{times}10{sup {minus}3}) may also be within the reach of the CERN LHC with {approximately}10{sup 7} pairs of t{bar t} produced, provided detector systematics are sufficiently small. We also show that if the GUT scale universality of the soft breaking trilinear A terms is relaxed, then the phases associated with arg(A{sub u}) and arg(A{sub d}) can take values up to {approximately}few{times}10{sup {minus}1} even with squarks and gluino masses of several hundred GeV{close_quote}s without contradicting the experimental limit on the neutron electric dipole moment. {copyright} {ital 1997} {ital The American Physical Society}« less
  • We demonstrate that expected efficiencies and purities for [ital b] tagging at the detectors at the Superconducting Super Collider and the CERN Large Hadron Collider may allow detection of the standard model Higgs boson in [ital t[bar t]H] production, with [ital H][r arrow][ital b[bar b]] decay, for 80[approx lt][ital m][sub [ital H]][approx lt]130 GeV, provided [ital m][sub [ital t]][approx gt]130 GeV.
  • Phenomenological analysis of the decay modes psi (psi') ..-->.. B/sub 8/B-bar/sub 8/ and B/sub 10/B-bar/sub 10/ has been given and compared to the presently available experimental data. The interference effects between the electromagnetic and nonelectromagnetic currents responsible for the decays are very important. So far, the data prefer the choice of the constructive- rather than the destructive-interference pattern. Many other predictions on these decay widths are also given. Especially, some information on electromagnetic form factors of the nucleon at t = 9.58 GeV/sup 2/ has been extracted from the data. (AIP)