Matteo Gagliolo
CoMo - Office 10 G 711
Department of Computer Science
Faculty of Sciences (WE)
Vrije Universiteit Brussel
Pleinlaan 2
B-1050 Brussels
Belgium
Tel. +32 2 629 37 11
Fax +32 2 629 37 08
mgagliol@vub.ac.be
|
News
Research interests
- Single and multi-agent reinforcement learning, multi-armed bandit problems, online learning.
- Social learning, evolutionary game theory, complex dynamic networks.
- Recurrent artificial neural networks, neuroevolution and evolutionary computation in general.
- Static and dynamic algorithm portfolios, restart strategies, meta-learning, algorithm selection.
- Survival analysis and multilevel analysis methods for algorithm performance modeling.
Publications 
-
Gagliolo, M., Van Vaerenbergh, K., Rodriguez, A., Nowé, A., Goossens, S.,
Pinte, G., and Symens, W. (2011b).
Policy search reinforcement learning for automatic wet clutch
engagement.
In 15th International Conference on System Theory, Control and
Computing - ICSTCC 2011, pages 250-255. IEEE.
[ bib |
event ]
-
Pinte, G., Stoev, J., Symens, W., Dutta, A., Zhong, Y., Wyns, B., De Keyser,
R., Depraetere, B., Swevers, J., Gagliolo, M., and Nowé, A. (2011).
Learning strategies for wet clutch control.
In 15th International Conference on System Theory, Control and
Computing - ICSTCC 2011, pages 467-474. IEEE.
[ bib |
event ]
-
Rodriguez, A., Gagliolo, M., Vrancx, P., Grau, R., and Nowé, A. (2011).
Improving the performance of continuous action reinforcement learning
automata.
9th European Workshop on Reinforcement Learning, EWRL 2011.
[ bib |
event |
.pdf ]
-
Gagliolo, M., Van Vaerenbergh, K., Rodriguez, A., Nowé, A., Goossens, S.,
Pinte, G., and Symens, W. (2011a).
Policy gradient methods for controlling systems with discrete sensor
information.
20th Annual Belgian-Dutch Conference on Machine Learning -
BeNeLearn, online proceedings, p. 115-116.
[ bib |
event |
.pdf ]
-
Gagliolo, M. and Schmidhuber, J. (2011).
Algorithm portfolio selection as a bandit problem with unbounded
losses.
Annals of Mathematics and Artificial Intelligence,
61(2):49-86.
[ bib |
DOI |
.pdf |
Abstract ]
-
Goossens, S., Pinte, G., Symens, W., Gagliolo, M., Rodriguez, A., and Nowé,
A. (2011).
Reinforcement learning for repetitive systems with discrete sensors.
In 30th Benelux Meeting on Systems and Control - Book of
Abstracts, page 149, Gent, Belgium. Universiteit Gent.
[ bib |
event |
slides |
.pdf ]
-
Gagliolo, M. and Legrand, C. (2010).
Algorithm survival analysis.
In Bartz-Beielstein, T., Chiarandini, M., Paquete, L., and Preuss,
M., editors, Experimental Methods for the Analysis of Optimization
Algorithms, pages 161-184. Springer, Berlin, Heidelberg.
[ bib |
DOI |
.pdf |
Abstract ]
-
Gagliolo, M. and Schmidhuber, J. (2010).
Algorithm selection as a bandit problem with unbounded losses.
In Blum, C. and Battiti, R., editors, Learning and Intelligent
Optimization. 4th International Conference, LION 4, Venice, Italy, January
18-22, 2010. Selected Papers, volume 6073 of LNCS, pages 82-96,
Berlin, Heidelberg. Springer.
[ bib |
DOI |
arXiv |
event |
slides |
.pdf |
Abstract ]
-
Gagliolo, M. (2010).
Online Dynamic Algorithm Portfolios.
PhD thesis, IDSIA/University of Lugano, Lugano, Switzerland.
[ bib |
slides |
.pdf |
Abstract ]
-
Gagliolo, M., Legrand, C., and Birattari, M. (2009).
Mixed-effects modeling of optimisation algorithm performance.
In Stützle, T., Birattari, M., and Hoos, H. H., editors,
Engineering Stochastic Local Search Algorithms. Designing, Implementing and
Analyzing Effective Heuristics, Second International Workshop, SLS 2009,
Brussels, Belgium, September 3-4, 2009. Proceedings, volume 5752 of
LNCS, pages 150-154. Springer.
[ bib |
DOI |
event |
poster |
.pdf |
Abstract ]
-
Gagliolo, M. and Schmidhuber, J. (2009).
Towards distributed algorithm portfolios.
In Corchado, J. M., Rodríguez, S., Llinas, J., and Molina, J. M.,
editors, International Symposium on Distributed Computing and Artificial
Intelligence 2008 (DCAI 2008), volume 50 of Advances in Soft
Computing, pages 634-643. Springer.
[ bib |
DOI |
event |
slides |
.pdf |
Abstract ]
-
Gagliolo, M. and Schmidhuber, J. (2008).
Algorithm selection as a bandit problem with unbounded losses.
Technical Report IDSIA - 07 - 08, IDSIA.
Published in LION4 Proceedings, Springer LNCS, 2010.
[ bib |
arXiv |
Abstract ]
-
Schmidhuber, J., Wierstra, D., Gagliolo, M., and Gomez, F. (2007).
Training recurrent networks by evolino.
Neural computation, 19(3):757-779.
[ bib |
DOI |
.pdf |
Abstract ]
-
Gagliolo, M. (2007).
Universal search.
Scholarpedia, 2(11):2575.
[ bib |
DOI |
http ]
-
Gagliolo, M. and Schmidhuber, J. (2007).
Learning restart strategies.
In Veloso, M. M., editor, IJCAI 2007 - Twentieth International
Joint Conference on Artificial Intelligence, vol. 1, pages 792-797. AAAI
Press.
[ bib |
event |
slides |
.pdf |
Abstract ]
-
Gagliolo, M. and Schmidhuber, J. (2006d).
Learning dynamic algorithm portfolios.
Annals of Mathematics and Artificial Intelligence,
47(3):295-328.
AI&MATH 2006 Special Issue.
[ bib |
DOI |
slides |
techrep |
.pdf |
Abstract ]
-
Gagliolo, M. and Schmidhuber, J. (2006b).
Gambling in a computationally expensive casino: Algorithm selection
as a bandit problem.
Online Trading of Exploration and Exploitation - NIPS 2006
Workshop.
[ bib |
event |
.pdf ]
-
Gagliolo, M. and Schmidhuber, J. (2006c).
Impact of censored sampling on the performance of restart strategies.
In Benhamou, F., editor, Principle and Practice of Constraint
Programming - CP 2006, 12th International Conference, CP 2006, Nantes,
France, September 25-29, 2006, Proceedings, volume 4204 of LNCS, pages
167-181. Springer.
[ bib |
DOI |
event |
slides |
.pdf |
Abstract ]
-
Gagliolo, M. (2006).
Dynamic meta learning.
50th Anniversary Summit of Artificial Intelligence, Monte
Veritá, Switzerland.
[ bib |
event |
poster |
.pdf ]
-
Schmidhuber, J., Gagliolo, M., Wierstra, D., and Gomez, F. (2006).
Evolino for recurrent support vector machines.
In Verleysen, M., editor, ESANN '06 - 14 th European Symposium
on Artificial Neural Networks, pages 593-598, Evere, Belgium. d-side.
[ bib |
arXiv |
event |
techrep ]
-
Gagliolo, M. and Schmidhuber, J. (2006a).
Dynamic algorithm portfolios.
AI&MATH '06 - Ninth International Symposium on Artificial
Intelligence and Mathematics.
[ bib |
event |
slides |
.pdf ]
-
Schmidhuber, J., Gagliolo, M., Wierstra, D., and Gomez, F. (2005).
Evolino for recurrent support vector machines.
Technical Report IDSIA - 19 - 05, IDSIA.
Shorter version published in ESANN 2006 Proceedings.
[ bib |
arXiv |
.pdf |
Abstract ]
-
Gagliolo, M. and Schmidhuber, J. (2005b).
Towards life-long meta learning.
Inductive Transfer : 10 Years Later - NIPS 2005 workshop.
[ bib |
event |
.pdf ]
-
Gagliolo, M. and Schmidhuber, J. (2005a).
A neural network model for inter-problem adaptive online time
allocation.
In Duch, W., Kacprzyk, J., Oja, E., and Zadrożny, S., editors,
Artificial Neural Networks: Formal Models and Their Applications -
ICANN 2005, 15th International Conference, Warsaw, Poland, September 11-15,
2005, Proceedings, Part 2, volume 3697 of LNCS, pages 7-12. Springer.
[ bib |
DOI |
event |
slides |
.pdf |
Abstract ]
-
Gagliolo, M., Zhumatiy, V., and Schmidhuber, J. (2004).
Adaptive online time allocation to search algorithms.
In Boulicaut, J., F.Esposito, Giannotti, F., and Pedreschi, D.,
editors, Machine Learning: ECML 2004. Proceedings of the 15th European
Conference on Machine Learning, Pisa, Italy, September 20-24, 2004, volume
3201 of LNCS, pages 134-143. Springer.
[ bib |
DOI |
event |
slides |
techrep |
Abstract ]
-
Schmidhuber, J., Zhumatiy, V., and Gagliolo, M. (2004).
Bias-optimal incremental learning of control sequences for virtual
robots.
In et al., F. G., editor, Proc. 8th Conference on Intelligent
Autonomous Systems, IAS-8, pages 658-665, Amsterdam, NL. IOS Press.
[ bib ]
-
Anguita, D. and Gagliolo, M. (2002).
Mdl based model selection for relevance vector regression.
In Dorronsoro, J., editor, Artificial Neural Networks - ICANN
2002, volume 2415 of LNCS, pages 468-473. Springer.
[ bib |
DOI |
.pdf |
Abstract ]
|