Combination of learning from non-optimal demonstrations and feedbacks using inverse reinforcement learning and Bayesian policy improvement

Ali Ezzeddine, Nafee Mourad, Babak Nadjar Araabi, Majid Nili Ahmadabadi. Combination of learning from non-optimal demonstrations and feedbacks using inverse reinforcement learning and Bayesian policy improvement. Expert Syst. Appl., 112:331-341, 2018. [doi]

Authors

Ali Ezzeddine

This author has not been identified. Look up 'Ali Ezzeddine' in Google

Nafee Mourad

This author has not been identified. Look up 'Nafee Mourad' in Google

Babak Nadjar Araabi

This author has not been identified. Look up 'Babak Nadjar Araabi' in Google

Majid Nili Ahmadabadi

This author has not been identified. Look up 'Majid Nili Ahmadabadi' in Google