Endre søk
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Decision tree insights analytics (DTIA) tool: an analytic framework to identify insights from large data records across fields of science
KTH, Skolan för teknikvetenskap (SCI), Fysik, Kärnvetenskap och kärnteknik.
Univ New South Wales, Sch Engn & Informat Technol, Sydney, Australia..
Karolinska Inst, Dept Cell & Mol Biol CMB, Stockholm, Sweden..ORCID-id: 0000-0001-9407-7540
Karolinska Inst, Dept Cell & Mol Biol CMB, Stockholm, Sweden.;Norwegian Univ Sci & Technol, Dept Biomed Lab Sci, Trondheim, Norway..
Vise andre og tillknytning
2024 (engelsk)Inngår i: Machine Learning: Science and Technology, E-ISSN 2632-2153, Vol. 5, nr 4, artikkel-id 045004Artikkel i tidsskrift (Fagfellevurdert) Published
Abstract [en]

Supervised machine learning (SML) techniques have been developed since the 1960s. Most of their applications were oriented towards developing models capable of predicting numerical values or categorical output based on a set of input variables (input features). Recently, SML models' interpretability and explainability were extensively studied to have confidence in the models' decisions. In this work, we propose a new deployment method named Decision Tree Insights Analytics (DTIA) that shifts the purpose of using decision tree classification from having a model capable of differentiating the different categorical outputs based on the input features to systematically finding the associations between inputs and outputs. DTIA can reveal interesting areas in the feature space, leading to the development of research questions and the discovery of new associations that might have been overlooked earlier. We applied the method to three case studies: (1) nuclear reactor accident propagation, (2) single-cell RNA sequencing of Niemann-Pick disease type C1 in mice, and (3) bulk RNA sequencing for breast cancer staging in humans. The developed method provided insights into the first two. On the other hand, it showed some of the method's limitations in the third case study. Finally, we presented how the DTIA's insights are more agreeable with the abstract information gain calculations and provide more in-depth information that can help derive more profound physical meaning compared to the random forest's feature importance attribute and K-means clustering for feature ranking.

sted, utgiver, år, opplag, sider
IOP Publishing , 2024. Vol. 5, nr 4, artikkel-id 045004
Emneord [en]
decision trees, machine learning interpretability and explainability, nuclear reactor safety, RNA sequencing, Niemann-Pick type C1 (NPC), random forest feature importance, exploratory data analysis
HSV kategori
Identifikatorer
URN: urn:nbn:se:kth:diva-355198DOI: 10.1088/2632-2153/ad7f23ISI: 001327892500001Scopus ID: 2-s2.0-85209642339OAI: oai:DiVA.org:kth-355198DiVA, id: diva2:1907685
Merknad

QC 20241023

Tilgjengelig fra: 2024-10-23 Laget: 2024-10-23 Sist oppdatert: 2025-12-12bibliografisk kontrollert
Inngår i avhandling
1. Decision Tree Insights Analytics (DTIA): An Explainable AI Framework for Severe Accident Analysis
Åpne denne publikasjonen i ny fane eller vindu >>Decision Tree Insights Analytics (DTIA): An Explainable AI Framework for Severe Accident Analysis
2025 (engelsk)Doktoravhandling, med artikler (Annet vitenskapelig)
Abstract [en]

In nuclear reactor safety analysis, we label an accident as severe once a partial core meltdown and material relocation begin. Researchers use simulation tools such as ANSYS and MELCOR to study these events safely, producing vast and complex datasets. In this work, we applied machine learning explainability and interpretability to extract insights from severe accident simulations for the Nordic boiling water reactor (BWR) through five iterative studies. First, we examined the explainability of the decision tree classification algorithm to distinguish between accident types using time-wise pressure vessel external temperature. Second, we generalised the model to create a more statistically robust and generic framework, introducing the open-source Decision Tree Insights Analytics (DTIA) framework (https://github.com/KHossny/DTIA), which combines explainability, interpretability, and statistical robustness. Third, we applied DTIA to high-dimensional MELCOR COR package data for a station blackout combined with a loss-of-coolant accident (SBO + LOCA) in a Nordic BWR, revealing new findings. Fourth, we used DTIA to compare structural variables of the reactor pressure vessel lower head under SBO and SBO + LOCA conditions. Finally, we coupled DTIA with K-Means clustering to address its need for labelled data, uncovering previously overlooked events such as canister melting. We concluded that the patterns identified by machine learning in mapping inputs to outputs can uncover insights that were previously overlooked, particularly in high-dimensional and complex datasets.

Abstract [sv]

Inom analysen av kärnreaktorsäkerhet betecknar vi en olycka som allvarlig när en partiell härdsmälta och materialomplacering inleds. Forskare använder simuleringsverktyg som ANSYS och MELCOR för att studera dessa händelser på ett säkert sätt, vilket genererar stora och komplexa datamängder. I detta arbete tillämpade vi maskininlärningens förklarbarhet och tolkbarhet för att utvinna insikter från simuleringar av allvarliga olyckor för den nordiska kokvattenreaktorn (BWR) genom fem iterativa studier. För det första undersökte vi förklarbarheten hos beslutsträdsklassificeringsalgoritmen för att skilja mellan olyckstyper baserat på tidsberoende yttre temperatur på reaktortanken. För det andra generaliserade vi modellen för att skapa ett mer statistiskt robust och generiskt ramverk och introducerade det öppna ramverket Decision Tree Insights (https://github.com/KHossny/DTIA), som Analytics (DTIA) kombinerar förklarbarhet, tolkbarhet och statistisk robusthet. För det tredje tillämpade vi DTIA på högdimensionella data från MELCOR:s COR-paket för ett totalt strömavbrott kombinerat med ett kylvätskeförlustscenario (SBO + LOCA) i en nordisk BWR, vilket avslöjade nya fynd. För det fjärde använde vi DTIA för att jämföra strukturella variabler i reaktortankens nedre del under SBO- och SBO + LOCAförhållanden. Slutligen kopplade vi DTIA till K-Means-klustring för att hantera behovet av märkta data, vilket avslöjade tidigare förbisedda händelser som smältning av kapseln. Vi drog slutsatsen att de mönster som identifieras av maskininlärning vid koppling mellan indata och utdata kan avslöja insikter som tidigare förbises, särskilt i högdimensionella och komplexa datamängder.

sted, utgiver, år, opplag, sider
Stockholm: KTH Royal Institute of Technology, 2025. s. 111
Serie
TRITA-SCI-FOU ; 2025:78
Emneord
Explainable AI, Decision Tree Insights Analytics (DTIA), Nuclear Severe Accidents Analysis, Decision Tree Classification, Reactor Pressure Vessel Failure., Förklarbar AI, Decision Tree Insights Analytics (DTIA), Analys av allvarliga kärnkraftsolyckor, Beslutsträdsklassificering, Haveri av reaktortryckkärl.
HSV kategori
Forskningsprogram
Energiteknik; Fysik, Kärnenergiteknik
Identifikatorer
urn:nbn:se:kth:diva-374074 (URN)978-91-8106-509-1 (ISBN)
Disputas
2026-01-16, FB51, Roslagstullsbacken 21, AlbaNova University Centrum, 114 21, Stockholm, Sweden., Stockholm, 10:00 (engelsk)
Opponent
Veileder
Forskningsfinansiär
Swedish Radiation Safety Authority
Merknad

QC 2025-12-12

Tilgjengelig fra: 2025-12-12 Laget: 2025-12-12 Sist oppdatert: 2025-12-12bibliografisk kontrollert

Open Access i DiVA

Fulltekst mangler i DiVA

Andre lenker

Forlagets fulltekstScopus

Person

Hossny, KarimVillanueva, Walter

Søk i DiVA

Av forfatter/redaktør
Hossny, KarimCougnoux, AntonyVillanueva, Walter
Av organisasjonen
I samme tidsskrift
Machine Learning: Science and Technology

Søk utenfor DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric

doi
urn-nbn
Totalt: 119 treff
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf