Chair of Information Technology (150)
Organisational unit: Chair
Research output
- 2025
- Accepted/In press
Online Regret Bounds for Satisficing in MDPs
Hajiabolhassan, H. & Ortner, R., 2025, (Accepted/In press) In: Mathematics of Operations Research. ??? Stand: 27. März 2025, ??? Stand: 27. März 2025, p. ??? Stand: 27. März 2025Research output: Contribution to journal › Article › Research › peer-review
- 2024
- Published
Machine learning assisted calibration of PVT simulations for SiC crystal growth
Taucher, L., Ramadan, Z., Hammer, R., Obermüller, T., Auer, P. & Romaner, L., 10 Oct 2024, In: CrystEngComm. 44.2024, 26, p. 6322-6335 14 p.Research output: Contribution to journal › Article › Research › peer-review
- Published
Automatic and time-resolved determination of fracture characteristics from in situ experiments
Schmuck, K. S., Antenreiter, M., Alfreider, M. & Kiener, D., Jul 2024, In: Materials & design. 243.2024, July, 12 p., 113038.Research output: Contribution to journal › Article › Research › peer-review
- Published
A DSMS approach to support surveillance data based services in U-space
Pfisterer, D., 2024Research output: Thesis › Master's Thesis
- Published
Instrumentation and Signal Processing for the Verification of Directional Drilling
O'Leary, P., Terbuch, A., Ninevski, D., Mevec, D., Fruhmann, R., Khalilimotlaghkasmaei, N. & Habacher, M., 2024, 2024 IEEE International Instrumentation and Measurement Technology Conference (I2MTC) (I2MTC 2024).Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
Multi-Armed Bandits in IEEE 802.11ac: Efficient Algorithms and Testbed Experiments
Le, M., Bile, P., Pawar, S. P., Auer, P., Thapa, P. D., Hühn, T. & Jorswieck, E. A., 2024, 2024 IEEE International Workshop Technical Committee on Communications Quality and Reliability (CQR).Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
Understanding the Gaps in Satisficing Bandits
Rouyer, C., Ortner, R. & Auer, P., 2024.Research output: Contribution to conference › Poster › Research › peer-review
- 2023
- Published
Adaptive Algorithms for Meta-Induction
Ortner, R., 7 Oct 2023, In: Journal for general philosophy of science = Zeitschrift für allgemeine Wissenschaftstheorie. 54.2023, 3, p. 433–450 18 p.Research output: Contribution to journal › Article › Research › peer-review
- Published
Exam proctoring system V3.0.1
Antenreiter, M., 23 Aug 2023Research output: Non-textual form › Software › Education
- Published
Regret Bounds for Satisficing in Multi-Armed Bandit Problems
Michel, T., Hajiabolhassan, H. & Ortner, R., 7 Jun 2023, In: Transactions on machine learning research. 2023, August, 19 p.Research output: Contribution to journal › Article › Research › peer-review
- Published
FunQG: Molecular Representation Learning Via Quotient Graphs
Hajiabolhassan, H., Taheri, Z., Hojatnia, A. & Taheri Yeganeh, Y., 15 May 2023, In: Journal of chemical information and modeling. 63.2023, 11, p. 3275-3287 13 p.Research output: Contribution to journal › Article › Research › peer-review
- Published
A Reinforcement Learning Approach for Real-Time Autonomous Decision-Making in Well Construction
Keshavarz, S., Vita, P., Rückert, E., Ortner, R. & Thonhauser, G., 19 Jan 2023, SPE AI Symposium 2023: Leveraging Artificial Intelligence to Shape the Future of the Energy Industry. (Society of Petroleum Engineers - SPE Symposium: Leveraging Artificial Intelligence to Shape the Future of the Energy Industry, AIS 2023).Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
Detecting Anomalous Multivariate Time-Series via Hybrid Machine Learning
Terbuch, A., O'Leary, P., Khalilimotlaghkasmaei, N., Auer, P., Zöhrer, A. & Winter, V., 12 Jan 2023, In: IEEE transactions on instrumentation and measurement. 72.2023, 13 p., 2503711.Research output: Contribution to journal › Article › Research › peer-review
- Published
Autonomous Exploration for Navigating in MDPs Using Blackbox RL Algorithms
Gajane, P., Auer, P. & Ortner, R., 2023, Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence (IJCAI-23). p. 3714-3722Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
Grundlagen- und Ansatzerhebung für eine gemeinsame Emulationsumgebung zur übergreifenden Anwendung auf der Materialfluss- und Steuerungsebene
Schalk, R., 2023Research output: Thesis › Master's Thesis
- Published
No-Regret Online Reinforcement Learning with Adversarial Losses and Transitions
Jin, T., Liu, J., Rouyer, C., Chang, W., Wei, C.-Y. & Luo, H., 2023, 37th Conference on Neural Information Processing Systems (NeurIPS 2023).Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Accepted/In press
Online Regret Bounds for Satisficing in MDPs
Hajiabolhassan, H. & Ortner, R., 2023, (Accepted/In press).Research output: Contribution to conference › Poster › Research › peer-review
- Published
When is Cartesian product a Cayley graph?
Dobson, E., Hujdurovic, A., Imrich, W. & Ortner, R., 2023, Proceedings of the 12th European Conference on Combinatorics, Graph Theory and Applications. p. 362-367Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- 2022
- Published
Decision Making Under Uncertainty and Reinforcement Learning
Dimitrakakis, C. & Ortner, R., Dec 2022, Springer. (Intelligent Systems Reference Library; vol. 223)Research output: Book/Report › Book › Education
- Published
Quantification of Transfer in Reinforcement Learning via Regret Bounds for Learning Agents
Tuynman, A. & Ortner, R., Sept 2022.Research output: Contribution to conference › Poster › Research › peer-review
- Published
Regret Bounds for Satisficing in Multi-Armed Bandit Problems
Michel, T., Hajiabolhassan, H. & Ortner, R., Sept 2022.Research output: Contribution to conference › Poster › Research › peer-review
- Published
Predicting Packaging Sizes Using Machine Learning
Heininger, M. & Ortner, R., 22 Aug 2022, In: Operations research forum. 43.2022, 3, 14 p., 43.Research output: Contribution to journal › Article › Research › peer-review
- E-pub ahead of print
The bin covering with delivery problem, extended investigations for the online case
Abraham, G., Auer, P., Dosa, G., Dulai, T., Tuza, Z. & Werner-Stark, Á., 30 Apr 2022, (E-pub ahead of print) In: Central European Journal of Operations Research. 31.2023, March, p. 21-47 27 p.Research output: Contribution to journal › Article › Research › peer-review
- Published
Annotation of screencasts: Distinguishing Between Relevant and Irrelevant Sections
Ulm, T., 2022Research output: Thesis › Master's Thesis
- Published
Entscheidungsfindung unter Unsicherheit durch Reinforcement Learning am Beispiel einer Microgrid-Steuerung
Zeilinger, K., 2022Research output: Thesis › Master's Thesis
- Published
Evaluierung der Eignung von neuronalen Netzen zum Forecasting logistischer Zeitreihen: Ein Beispiel aus der österreichischen Lebensmittelindustrie
Kotzbeck, J., 2022Research output: Thesis › Master's Thesis
- Published
Hybrid Machine Learning for Anomaly Detection in Industrial Time-Series Measurement Data
Terbuch, A., O'Leary, P. & Auer, P., 2022, I2MTC 2022 - IEEE International Instrumentation and Measurement Technology Conference: Instrumentation and Measurement under Pandemic Constraints, Proceedings. Institute of Electrical and Electronics Engineers, (Conference Record - IEEE Instrumentation and Measurement Technology Conference).Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
LoopyDenseNet: Combining Skip Connections, Dense Connectivity and Loops within a Convolutional Neural Network
Niederl, P., 2022Research output: Thesis › Master's Thesis
- Published
Parameteroptimierung einer Logistiksoftware: Anwendung eines evolutionären Verfahrens am Beispiel eines Lagerplatzsuchers
Perl, M., 2022Research output: Thesis › Master's Thesis
- Published
- 2021
- Published
A new heuristic and an exact approach for a production planning problem
Auer, P., Dósa, G., Dulai, T., Fügenschuh, A., Näser, P., Ortner, R. & Werner-Starkne, A., Sept 2021, In: Central European Journal of Operations Research. 29, 3, p. 1079-1113 35 p.Research output: Contribution to journal › Article › Research › peer-review
- Published
Regret Bounds for Reinforcement Learning via Markov Chain Concentration
Ortner, R., 26 Aug 2021.Research output: Contribution to conference › Poster › Research › peer-review
- Published
Particle Size Estimation in Mixed Commercial Waste Images Using Deep Learning
Kittiworapanya, P., Pasupa, K. & Auer, P., 29 Jun 2021, IAIT 2021 - 12th International Conference on Advances in Information Technology: Intelligence and Innovation for Digital Business and Society. Association for Computing Machinery (ACM), 3471273. (ACM International Conference Proceeding Series).Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
Learning to Drive with Deep Reinforcement Learning
Chukamphaeng, N., Pasupa, K., Antenreiter, M. & Auer, P., 21 Jan 2021, KST 2021 - 2021 13th International Conference Knowledge and Smart Technology. Institute of Electrical and Electronics Engineers, p. 147-152 6 p. 9415770. (KST 2021 - 2021 13th International Conference Knowledge and Smart Technology).Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
Grid Load Mitigation in EV Fast Charging Stations Through Integration of a High-Performance Flywheel Energy Storage System with CFRP Rotor
Buchroithner, A., Presmair, R., Haidl, P., Wegleiter, H., Thormann, B., Kienberger, T., Auer, P. & Domitner, J., 2021, 2021 IEEE Green Energy and Smart Systems Conference, IGESSC 2021. Institute of Electrical and Electronics Engineers, (2021 IEEE Green Energy and Smart Systems Conference, IGESSC 2021).Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
Online Anticipatory Algorithms for Scheduling Problems
Erler, S., 2021Research output: Thesis › Master's Thesis
- Published
Vorhersage der Verpackungsgröße einer Lieferung in einem E-Commerce-Unternehmen mittels Machine Learning
Heininger, M., 2021Research output: Thesis › Master's Thesis
- 2020
- Published
Online exam proctoring system
Antenreiter, M., 16 Oct 2020Research output: Non-textual form › Software › Education
- Published
A fast video compression algorithm for online exams
Antenreiter, M., 11 Sept 2020Research output: Non-textual form › Software › Research
- Published
Regret Bounds for Reinforcement Learning via Markov Chain Concentration
Ortner, R., 23 Jan 2020, In: The journal of artificial intelligence research. 67.2020, 1, p. 115-128 14 p.Research output: Contribution to journal › Article › Research › peer-review
- Published
Clusteranalyse für die Bildung von Produktgruppen zur Unterstützung der Lagerdimensionierung
Kohlhofer, J. B., 2020Research output: Thesis › Master's Thesis
- Published
Maschinelles Lernen in der Nachschubsteuerung - Verbesserung der Nachschubsteuerung durch den Einsatz von maschinellem Lernen in der Bedarfsprognose
Labner, J., 2020Research output: Thesis › Master's Thesis
- 2019
- Published
A Reinforcement Learning Motivated Algorithm for Process Optimization
Ábrahám, Á., Auer, P., Dósa, G., Dulai, T. & Werner-Stark, Á., 18 Dec 2019, In: Periodica Polytechnica Civil Engineering. 63.2019, 4, p. 961-970 10 p.Research output: Contribution to journal › Article › Research › peer-review
- Published
Regret Bounds for Learning State Representations in Reinforcement Learning
Ortner, R., Pirotta, M., Lazaric, A., Fruit, R. & Maillard, O.-A., Dec 2019.Research output: Contribution to conference › Poster › Research › peer-review
- Published
Adaptively Tracking the Best Bandit Arm with an Unknown Number of Distribution Changes
Auer, P., Gajane, P. & Ortner, R., 27 Jun 2019.Research output: Contribution to conference › Poster › Research › peer-review
- Published
Achieving Optimal Dynamic Regret for Non-stationary Bandits without Prior Information
Auer, P., Chen, Y., Gajane, P., Lee, C.-W., Luo, H., Ortner, R. & Wei, C.-Y., 2019.Research output: Contribution to conference › Abstract › peer-review
- Published
Adaptively Tracking the Best Bandit Arm with an Unknown Number of Distribution Changes
Auer, P., Gajane, P. & Ortner, R., 2019, Proceedings of the 32nd Conference on Learning Theory, COLT 2019. p. 138-158Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- E-pub ahead of print
Regret Bounds for Learning State Representations in Reinforcement Learning
Ortner, R., Pirotta, M., Lazaric, A., Fruit, R. & Maillard, O.-A., 2019, (E-pub ahead of print) Advances in Neural Information Processing Systems. Vol. 32. p. 12717 12727 p.Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
Variational Regret Bounds for Reinforcement Learning
Ortner, R., Gajane, P. & Auer, P., 2019.Research output: Contribution to conference › Paper › peer-review
- Published
Variational Regret Bounds for Reinforcement Learning
Ortner, R., Gajane, P. & Auer, P., 2019, Proceedings of The 35th Uncertainty in Artificial Intelligence Conference, UAI 2019. p. 81-90Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- 2018
- Published
Online learning with randomized feedback graphs for optimal PUE attacks in cognitive radio networks
Dabaghchian, M., Alipour-Fanid, A., Zeng, K., Wang, Q. & Auer, P., 1 Oct 2018, In: IEEE ACM transactions on networking. 26, 5, p. 2268-2281 14 p., 8466108.Research output: Contribution to journal › Article › Research › peer-review
- Published
Adaptively Tracking the Best Arm with an Unknown Number of Distribution Changes
Auer, P., Gajane, P. & Ortner, R., 2018.Research output: Contribution to conference › Paper › peer-review
- Published
Adaptively Tracking the Best Arm with an Unknown Number of Distribution Changes
Auer, P., Gajane, P. & Ortner, R., 2018.Research output: Contribution to conference › Poster › Research › peer-review
- Published
Application of Regression Analysis for Throughput Prediction in the Order Picking Process
Lahovnik, J., 2018Research output: Thesis › Master's Thesis
- Published
A Sliding-Window Approach for Reinforcement Learning in MDPs with Arbitrarily Changing Rewards and Transitions.
Gajane, P., Ortner, R. & Auer, P., 2018.Research output: Contribution to conference › Paper › peer-review
- Published
Efficient Bias-Span-Constrained Exploration-Exploitation in Reinforcement Learning
Fruit, R., Pirotta, M., Lazaric, A. & Ortner, R., 2018, Proceedings of the 35th International Conference on Machine Learning, ICML 2018. Vol. PMLR 80. p. 1578-1586Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
Guest Editors' Foreword
Ortner, R. & Ulrich Simon, H., 2018, In: Theoretical Computer Science. 742Research output: Contribution to journal › Article › Research
- 2017
- Published
PSE-AP V1.0: software for predicting the segregation energy based on atom positions
Antenreiter, M., 6 Oct 2017Research output: Non-textual form › Software › Research
- Published
Software for binary classification of uneven geochemical datasets V1.0
Antenreiter, M., 10 Jan 2017Research output: Non-textual form › Software › Research
- Published
Automatic Scene Interpretation with Totally Occluded Objects
Antenreiter, M., 2017Research output: Thesis › Doctoral Thesis
- Published
Machine learning concepts in predictive analytics - A case study on wind turbine data
Steiner, E., 2017Research output: Thesis › Master's Thesis
- Published
Monte Carlo Tree Search for Job Shop Scheduling Problems
Reichenhauser, C., 2017Research output: Thesis › Master's Thesis
- Published
Online Learning
Auer, P., 2017, Encyclopedia of Machine Learning and Data Mining.Research output: Chapter in Book/Report/Conference proceeding › Chapter › Research
- 2016
- Published
Algorithmic Learning Theory
Auer, P., Clark, A. & Zeugmann, T., 18 Oct 2016, In: Theoretical Computer Science. 650, p. 1-3Research output: Contribution to journal › Special issue › Research › peer-review
- Published
Guest editors' foreword
Auer, P., Clark, A. & Zeugmann, T., 18 Oct 2016, In: Theoretical Computer Science. 650.2016, 18 October, p. 1-3 3 p.Research output: Contribution to journal › Letter › peer-review
- Published
An algorithm with nearly optimal pseudo-regret for both stochastic and adversarial bandits
Auer, P. & Chiang, C.-K., 23 Jun 2016, Proceedings of the 29th Conference on Learning Theory, COLT 2016. p. 116-120 (JMLR Workshop and Conference Proceedings; vol. 49).Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
Improved Learning Complexity in Combinatorial Pure Exploration Bandits
Gabillon, V., Lazaric, A., Ghavamzadeh, M., Ortner, R. & Bartlett, P., 10 May 2016.Research output: Contribution to conference › Poster › Research › peer-review
- Published
Learning with Malicious Noise
Auer, P., 22 Apr 2016, Encyclopedia of Algorithms. Springer, p. 1086-1089Research output: Chapter in Book/Report/Conference proceeding › Entry for encyclopedia/dictionary › Research
- Published
Algorithmic Learning Theory: 27th International Conference, ALT 2016, Proceedings
Ortner, R. (Co-editor), Ulrich Simon, H. (Co-editor) & Zilles, S., 2016, Springer.Research output: Book/Report › Anthology › Research
- Published
Improved Learning Complexity in Combinatorial Pure Exploration Bandits
Gabillon, V., Lazaric, A., Ghavamzadeh, M., Ortner, R. & Bartlett, P., 2016, Proceedings of the Nineteenth International Conference on Artificial Intelligence and Statistics, AISTATS 2016. p. 1004-1012 (JMLR Workshop and Conference Proceedings).Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
Optimal Behavior is Easier to Learn than the Truth
Ortner, R., 2016, In: Minds and Machines. 26, 3, p. 243-252Research output: Contribution to journal › Article › Research › peer-review
- Published
Pareto Front Identification from Stochastic Bandit Feedback
Auer, P., Chiang, C.-K., Ortner, R. & Drugan, M., 2016, Proceedings of the Nineteenth International Conference on Artificial Intelligence and Statistics, AISTATS 2016. p. 939-947 (JMLR Workshop and Conference Proceedings).Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- 2015
- Published
Improved Regret Bounds for Undiscounted Continuous Reinforcement Learning
Kailasam, L., Ortner, R. & Ryabko, D., 7 Jul 2015.Research output: Contribution to conference › Poster › Research › peer-review
- Published
Forcing Subarrangements in Complete Arrangements of Pseudocircles
Ortner, R., 2015, In: Journal of Computational Geometry. 6, 1, p. 235-248Research output: Contribution to journal › Article › Research › peer-review
- Published
Improved Regret Bounds for Undiscounted Continuous Reinforcement Learning
Kailasam, L., Ortner, R. & Ryabko, D., 2015, Proceedings of The 32nd International Conference on Machine Learning.Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
Integration von Informationssystemen: Datenaustausch zwischen Teamcenter und Enterprise Applikationen
Wötzl, K., 2015Research output: Thesis › Master's Thesis
- 2014
- Published
Algorithmic Learning Theory: 25th International Conference, ALT 2014 Bled, Slovenia, October 8-10, 2014 Proceedings
Auer, P. (Co-editor), Clark, A. (Co-editor), Zeugmann, T. (Co-editor) & Zilles, S. (Co-editor), 1 Jan 2014, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Springer Berlin, Vol. 8776. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 8776).Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
Editors’ introduction
Auer, P., Clark, A., Zeugmann, T. & Zilles, S., 1 Jan 2014, In: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 8776, p. 1-7 7 p.Research output: Contribution to journal › Editorial › peer-review
- Published
Autonome Roboter in der Intralogistik: Möglichkeiten zur Optimierung der Auftragsverteilung
Himmelsbach, E. S., 2014Research output: Thesis › Master's Thesis
- Published
Pseudo-reward Algorithms for Contextual Bandits with Linear Payoff Functions
Chiang, C.-K., 2014, Asian Conference on Machine Learning. p. 0-0Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
Regret Bounds for Restless Markov Bandits
Ortner, R., Ryabko, D., Auer, P. & Munos, R., 2014, In: Theoretical Computer Science. 558, p. 62-76Research output: Contribution to journal › Article › Research › peer-review
- Published
Selecting Near-Optimal Approximate State Representations in Reinforcement Learning
Ortner, R., Maillard, O.-A. & Ryabko, D., 2014, Algorithmic Learning Theory - 25th International Conference, ALT 2014, Bled, October 8-10, 2014. p. 140-154Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
Trackingvarianten für Augmented Reality Kommissioniersysteme
Heily, D., 2014Research output: Thesis › Master's Thesis
- 2013
- Published
Adaptive Aggregation for Reinforcement Learning in Average Reward Markov Decision Processes
Ortner, R., 2013, In: Annals of operations research. 208, p. 321-336Research output: Contribution to journal › Article › Research › peer-review
- Published
Beating Bandits in Gradually Evolving Worlds
Chiang, C.-K., 2013, Conference on Learning Theory. Shalev-Shwartz, S. & Steinwart, I. (eds.). p. 210-227 (JMLR Workshop and Conference Proceedings; vol. 30).Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
Competing with an Infinite Set of Models in Reinforcement Learning
Nguyen, P., Maillard, O.-A., Ryabko, D. & Ortner, R., 2013, JMLR Workshop and Conference Proceedings Volume 31 : Proceedings of the Sixteenth International Conference on Artificial Intelligence and Statistics. p. 463-471Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
Linear regression with random projections.
Maillard, O.-A., 2013, In: Journal of machine learning research (JMLR). 13, p. 1-1Research output: Contribution to journal › Article › Research › peer-review
- Published
Optimal regret bounds for selecting the state representation in reinforcement learning.
Maillard, O.-A., Nguyen, P., Ortner, R. & Ryabko, D., 2013, JMLR Workshop and Conference Proceedings Volume 28 : Proceedings of The 30th International Conference on Machine Learning. p. 543-551Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
Reinforcement Learning (Dagstuhl Seminar 13321)
Auer, P., 2013, In: Dagstuhl Reports. 3, p. 1-26Research output: Contribution to journal › Article › Research › peer-review
- 2012
- Published
Autonomous Exploration For Navigating In MDPs.
Lim, S. H. & Auer, P., 2012, Proceedings of the 25th Annual Conference on Learning Theory. p. 40.1-40.24Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
Entwicklung einer Simulation für Kommissioniersysteme
Salmutter, A., 2012Research output: Thesis › Master's Thesis
- Published
Evaluation and Analysis of the Performance of the EXP3 Algorithm in Stochastic Environments
Seldin, Y., Szepesvári, C., Auer, P. & Abbasi-Yadkori, Y., 2012, Proceedings of the Tenth European Workshop on Reinforcement Learning, EWRL 2012. p. 103-116 (JMLR proceedings).Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
Online Optimization with Gradual Variations
Chiang, C.-K., 2012, COLT 2012: Proceedings of the 25th Annual Conference on Learning Theory June 25-27, 2012, Edinburgh, Scotland. Mannor, S., Srebro, N. & Willamson, R. C. (eds.). p. 6.1-6.20 (JMLR Workshop and Conference Proceedings; vol. 23).Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
Online Regret Bounds for Undiscounted Continuous Reinforcement Learning
Ortner, R. & Ryabko, D., 2012, Advances in Neural Information Processing Systems 25. MIT Press, p. 1772-1780Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
Online Regret Bounds for Undiscounted Continuous Reinforcement Learning
Ortner, R. & Ryabko, D., 2012.Research output: Contribution to conference › Poster › Research › peer-review
- Published
PAC-Bayes-Bernstein Inequality for Martingales and its Application to Multiarmed Bandits.
Seldin, Y., Cesa-Bianchi, N., Auer, P., Laviolette, F. & Shawe-Taylor, J., 2012, Proceedings of the Workshop on On-line Trading of Exploration and Exploitation 2. p. 98-111Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
PAC-Bayesian Analysis of Contextual Bandits
Seldin, Y., Auer, P., Laviolette, F., Shawe-Taylor, J. S. & Ortner, R., 2012, Advances in Neural Information Processing Systems 24. MIT Press, p. 1683-1691Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
PAC-Bayesian Inequalities for Martingales
Seldin, Y., Cesa-Bianchi, N., Shawe-Taylor, J. & Auer, P., 2012, Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence, UAI 2012.Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
PAC-Bayesian Inequalities for Martingales.
Seldin, Y., Laviolette, F., Cesa-Bianchi, N., Shawe-Taylor, J. & Auer, P., 2012, In: IEEE transactions on information theory. 58, p. 7086-7093Research output: Contribution to journal › Article › Research › peer-review
- Published
PAC Subset Selection in Stochastic Multi-armed Bandits
Kalyanakrishnan, S., Tewari, A., Auer, P. & Stone, P., 2012, Proceedings of the 29th International Conference on Machine Learning, ICML 2012.Research output: Chapter in Book/Report/Conference proceeding › Conference contribution