Ensuring safe behaviors, i.e., minimizing the probability that a control strategy yields undesirable effects, becomes crucial when robots interact with humans in semi-structured environments through adaptive control strategies. In previous papers, we contributed to propose an approach that (i) computes control policies through reinforcement learning, (ii) verifies them against safety requirements with probabilistic model checking, and (iii) repairs them with greedy local methods until requirements are met. Such learn-verify-repair work-flow was shown effective in some — relatively simple and confined — test cases. In this paper, we frame human-robot interaction in light of such previous contributions, and we test the effectiveness of the learn-verify-repair approach in a more realistic factory-to-home deployment scenario. The purpose of our test is to assess whether we can verify that interaction patterns are carried out with negligible human-to-robot collision probability and whether, in the presence of user tuning, strategies which determine offending behaviors can be effectively repaired.
Testing a learn-verify-repair approach for safe human-robot interaction / Pathak, Shashank; Pulina, Luca; Tacchella, Armando. - 9336:(2015), pp. 260-273. (Intervento presentato al convegno 14th International Conference of the Italian Association for Artificial Intelligence, 2015 tenutosi a ita nel 2015) [10.1007/978-3-319-24309-2_20].
Testing a learn-verify-repair approach for safe human-robot interaction
PULINA, Luca;
2015-01-01
Abstract
Ensuring safe behaviors, i.e., minimizing the probability that a control strategy yields undesirable effects, becomes crucial when robots interact with humans in semi-structured environments through adaptive control strategies. In previous papers, we contributed to propose an approach that (i) computes control policies through reinforcement learning, (ii) verifies them against safety requirements with probabilistic model checking, and (iii) repairs them with greedy local methods until requirements are met. Such learn-verify-repair work-flow was shown effective in some — relatively simple and confined — test cases. In this paper, we frame human-robot interaction in light of such previous contributions, and we test the effectiveness of the learn-verify-repair approach in a more realistic factory-to-home deployment scenario. The purpose of our test is to assess whether we can verify that interaction patterns are carried out with negligible human-to-robot collision probability and whether, in the presence of user tuning, strategies which determine offending behaviors can be effectively repaired.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.