Combination of learning from non-optimal demonstrations and feedbacks using inverse reinforcement learning and Bayesian policy improvement - researchr publication authors

researchr

You are not signed in
Sign in
Sign up

Ali Ezzeddine, Nafee Mourad, Babak Nadjar Araabi, Majid Nili Ahmadabadi. Combination of learning from non-optimal demonstrations and feedbacks using inverse reinforcement learning and Bayesian policy improvement. Expert Syst. Appl., 112:331-341, 2018. [doi]

This author has not been identified. Look up 'Ali Ezzeddine' in GoogleThis author has not been identified. Look up 'Nafee Mourad' in GoogleThis author has not been identified. Look up 'Babak Nadjar Araabi' in GoogleThis author has not been identified. Look up 'Majid Nili Ahmadabadi' in Google

runs on WebDSL