Simone Parisi

Quick Info

Research Interests

Reinforcement learning, policy search, multiobjective optimization, state representation, feature selection, robotics

More Information

Curriculum Vitae Publications Google Citations DBLP

Contact Information

Mail. Simone Parisi
TU Darmstadt, FG IAS,
Hochschulstr. 10, 64289 Darmstadt
Office. Room E226, Building S2|02
work+49-6151-16-20073

Simone Parisi joined the Intelligent Autonomous System lab on October, 1st, 2014 as a PhD student. His research interests include, amongst others, reinforcement learning, robotics and multi-objective optimization. During his PhD, Simone is working on Scalable Autonomous Reinforcement Learning (ScARL), developing and evaluating new methods in the field of robotics to guarantee both high degree of autonomy and the ability to solve complex task.

Before his PhD, Simone completed his MSc in Computer Science Engineering at the Politecnico di Milano, Italy, and at the University of Queensland, Australia. His thesis, entitled “Study and analysis of policy gradient approaches for multi-objective decision problems", was written under the supervision of Prof. Marcello Restelli and PhD Matteo Pirotta.

Research Interests

Over the last decade, reinforcement learning has established as a framework for solving a large variety of tasks in robotics. A lot of effort has been directed towards scaling reinforcement learning to control high-dimensional systems and tasks (such as skills with many degrees of freedom). These advances, however, generally depend on hand-crafted state description as well as pre-structured parametrized policies. Furthermore, reward shaping using expert knowledge is frequently needed to scale reinforcement learning to high dimensional tasks. This large amount of required pre-structuring is in stark contrast to the goal of developing autonomous learning. It is therefore necessary to develop systematic methods to increase the autonomy of the learning system while keeping their scalability, by going beyond traditional approaches.

Software

MiPS: A minimal toolbox for Matlab with some of the most famous policy search algorithms, as well as some recent multi-objective methods and benchmark problems in reinforcement learning. It was developed with the support of Matteo Pirotta.

Key References

  1. Tangkaratt, V.; van Hoof, H.; Parisi, S.; Neumann, G.; Peters, J.; Sugiyama, M. (2017). Policy Search with High-Dimensional Context Variables, Proceedings of the AAAI Conference on Artificial Intelligence (AAAI).   See Details [Details]   Download Article [PDF]   BibTeX Reference [BibTex]
  2. Parisi, S.; Abdulsamad, H.; Paraschos, A.; Daniel, C.; Peters, J. (2015). Reinforcement Learning vs Human Programming in Tetherball Robot Games, Proceedings of the IEEE/RSJ Conference on Intelligent Robots and Systems (IROS).   See Details [Details]   Download Article [PDF]   BibTeX Reference [BibTex]
  3. Parisi, S.; Pirotta, M.; Peters, J. (2017). Manifold-based Multi-objective Policy Search with Sample Reuse, Neurocomputing, Special Issue on Multi-Objective Reinforcement Learning, 263, pp.3-14.   See Details [Details]   Download Article [PDF]   BibTeX Reference [BibTex]
  4. Parisi, S.; Pirotta, M.; Restelli, M. (2016). Multi-objective Reinforcement Learning through Continuous Pareto Manifold Approximation, Journal of Artificial Intelligence Research (JAIR), 57, pp.187-227.   See Details [Details]   Download Article [PDF]   BibTeX Reference [BibTex]

  

zum Seitenanfang