kth.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Cooperative System Identification via Correctional Learning
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Decision and Control Systems (Automatic Control).
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Decision and Control Systems (Automatic Control).ORCID iD: 0000-0003-4533-4971
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Decision and Control Systems (Automatic Control).ORCID iD: 0000-0003-0355-2663
KTH, School of Engineering Sciences (SCI), Mathematics (Dept.), Optimization and Systems Theory. KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Decision and Control Systems (Automatic Control).ORCID iD: 0000-0002-1927-1690
2021 (English)In: IFAC PAPERSONLINE, Elsevier BV , 2021, Vol. 54, no 7, p. 19-24Conference paper, Published paper (Refereed)
Abstract [en]

We consider a cooperative system identification scenario in which an expert agent (teacher) knows a correct, or at least a good, model of the system and aims to assist a learner-agent (student), but cannot directly transfer its knowledge to the student. For example, the teacher's knowledge of the system might be abstract or the teacher and student might be employing different model classes, which renders the teacher's parameters uninformative to the student. In this paper, we propose correctional learning as an approach to the above problem: suppose that in order to assist the student, the teacher can intercept the observations collected from the system and modify them to maximize the amount of information the student receives about the system. We formulate a general solution as an optimization problem, which for a multinomial system instantiates itself as an integer program. Furthermore, we obtain finite-sample results on the improvement that the assistance from the teacher results in (as measured by the reduction in the variance of the estimator) for a binomial system. In numerical experiments, we illustrate the proposed algorithms and verify the theoretical results that have been derived in the paper.

Place, publisher, year, edition, pages
Elsevier BV , 2021. Vol. 54, no 7, p. 19-24
Keywords [en]
Cooperative system identification, assisted learning, correctional learning, student-expert, transfer learning
National Category
Pedagogy
Identifiers
URN: urn:nbn:se:kth:diva-303777DOI: 10.1016/j.ifacol.2021.08.328ISI: 000696396200005Scopus ID: 2-s2.0-85118103651OAI: oai:DiVA.org:kth-303777DiVA, id: diva2:1605339
Conference
19th IFAC Symposium on System Identification (SYSID), JUL 13-16, 2021, Padova, ITALY
Note

QC 20211022

Available from: 2021-10-22 Created: 2021-10-22 Last updated: 2022-06-25Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Lourenço, InêsMattila, RobertRojas, Cristian R.Wahlberg, Bo

Search in DiVA

By author/editor
Lourenço, InêsMattila, RobertRojas, Cristian R.Wahlberg, Bo
By organisation
Decision and Control Systems (Automatic Control)Optimization and Systems Theory
Pedagogy

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 63 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf