Chair of Information Technology (150)

Organisational unit: Chair

1 - 100 out of 219Page size: 100

10
20
50
100
All

Sort by: Publication date

Research output

2025
Accepted/In press
Online Regret Bounds for Satisficing in MDPs
Hajiabolhassan, H. & Ortner, R., 2025, (Accepted/In press) In: Mathematics of Operations Research. ??? Stand: 27. März 2025, ??? Stand: 27. März 2025, p. ??? Stand: 27. März 2025
Research output: Contribution to journal › Article › Research › peer-review
2024
Published
Machine learning assisted calibration of PVT simulations for SiC crystal growth
Taucher, L., Ramadan, Z., Hammer, R., Obermüller, T., Auer, P. & Romaner, L., 10 Oct 2024, In: CrystEngComm. 44.2024, 26, p. 6322-6335 14 p.
Research output: Contribution to journal › Article › Research › peer-review
Published
Automatic and time-resolved determination of fracture characteristics from in situ experiments
Schmuck, K. S., Antenreiter, M., Alfreider, M. & Kiener, D., Jul 2024, In: Materials & design. 243.2024, July, 12 p., 113038.
Research output: Contribution to journal › Article › Research › peer-review
Published
A DSMS approach to support surveillance data based services in U-space
Pfisterer, D., 2024
Research output: Thesis › Master's Thesis
Published
Instrumentation and Signal Processing for the Verification of Directional Drilling
O'Leary, P., Terbuch, A., Ninevski, D., Mevec, D., Fruhmann, R., Khalilimotlaghkasmaei, N. & Habacher, M., 2024, 2024 IEEE International Instrumentation and Measurement Technology Conference (I2MTC) (I2MTC 2024).
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
Published
Multi-Armed Bandits in IEEE 802.11ac: Efficient Algorithms and Testbed Experiments
Le, M., Bile, P., Pawar, S. P., Auer, P., Thapa, P. D., Hühn, T. & Jorswieck, E. A., 2024, 2024 IEEE International Workshop Technical Committee on Communications Quality and Reliability (CQR).
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
Published
Understanding the Gaps in Satisficing Bandits
Rouyer, C., Ortner, R. & Auer, P., 2024.
Research output: Contribution to conference › Poster › Research › peer-review
2023
Published
Adaptive Algorithms for Meta-Induction
Ortner, R., 7 Oct 2023, In: Journal for general philosophy of science = Zeitschrift für allgemeine Wissenschaftstheorie. 54.2023, 3, p. 433–450 18 p.
Research output: Contribution to journal › Article › Research › peer-review
Published
Exam proctoring system V3.0.1
Antenreiter, M., 23 Aug 2023
Research output: Non-textual form › Software › Education
Published
Regret Bounds for Satisficing in Multi-Armed Bandit Problems
Michel, T., Hajiabolhassan, H. & Ortner, R., 7 Jun 2023, In: Transactions on machine learning research. 2023, August, 19 p.
Research output: Contribution to journal › Article › Research › peer-review
Published
FunQG: Molecular Representation Learning Via Quotient Graphs
Hajiabolhassan, H., Taheri, Z., Hojatnia, A. & Taheri Yeganeh, Y., 15 May 2023, In: Journal of chemical information and modeling. 63.2023, 11, p. 3275-3287 13 p.
Research output: Contribution to journal › Article › Research › peer-review
Published
A Reinforcement Learning Approach for Real-Time Autonomous Decision-Making in Well Construction
Keshavarz, S., Vita, P., Rückert, E., Ortner, R. & Thonhauser, G., 19 Jan 2023, SPE AI Symposium 2023: Leveraging Artificial Intelligence to Shape the Future of the Energy Industry. (Society of Petroleum Engineers - SPE Symposium: Leveraging Artificial Intelligence to Shape the Future of the Energy Industry, AIS 2023).
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
Published
Detecting Anomalous Multivariate Time-Series via Hybrid Machine Learning
Terbuch, A., O'Leary, P., Khalilimotlaghkasmaei, N., Auer, P., Zöhrer, A. & Winter, V., 12 Jan 2023, In: IEEE transactions on instrumentation and measurement. 72.2023, 13 p., 2503711.
Research output: Contribution to journal › Article › Research › peer-review
Published
Autonomous Exploration for Navigating in MDPs Using Blackbox RL Algorithms
Gajane, P., Auer, P. & Ortner, R., 2023, Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence (IJCAI-23). p. 3714-3722
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
Published
Grundlagen- und Ansatzerhebung für eine gemeinsame Emulationsumgebung zur übergreifenden Anwendung auf der Materialfluss- und Steuerungsebene
Schalk, R., 2023
Research output: Thesis › Master's Thesis
Published
No-Regret Online Reinforcement Learning with Adversarial Losses and Transitions
Jin, T., Liu, J., Rouyer, C., Chang, W., Wei, C.-Y. & Luo, H., 2023, 37th Conference on Neural Information Processing Systems (NeurIPS 2023).
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
Accepted/In press
Online Regret Bounds for Satisficing in MDPs
Hajiabolhassan, H. & Ortner, R., 2023, (Accepted/In press).
Research output: Contribution to conference › Poster › Research › peer-review
Published
When is Cartesian product a Cayley graph?
Dobson, E., Hujdurovic, A., Imrich, W. & Ortner, R., 2023, Proceedings of the 12th European Conference on Combinatorics, Graph Theory and Applications. p. 362-367
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
2022
Published
Decision Making Under Uncertainty and Reinforcement Learning
Dimitrakakis, C. & Ortner, R., Dec 2022, Springer. (Intelligent Systems Reference Library; vol. 223)
Research output: Book/Report › Book › Education
Published
Quantification of Transfer in Reinforcement Learning via Regret Bounds for Learning Agents
Tuynman, A. & Ortner, R., Sept 2022.
Research output: Contribution to conference › Poster › Research › peer-review
Published
Regret Bounds for Satisficing in Multi-Armed Bandit Problems
Michel, T., Hajiabolhassan, H. & Ortner, R., Sept 2022.
Research output: Contribution to conference › Poster › Research › peer-review
Published
Predicting Packaging Sizes Using Machine Learning
Heininger, M. & Ortner, R., 22 Aug 2022, In: Operations research forum. 43.2022, 3, 14 p., 43.
Research output: Contribution to journal › Article › Research › peer-review
E-pub ahead of print
The bin covering with delivery problem, extended investigations for the online case
Abraham, G., Auer, P., Dosa, G., Dulai, T., Tuza, Z. & Werner-Stark, Á., 30 Apr 2022, (E-pub ahead of print) In: Central European Journal of Operations Research. 31.2023, March, p. 21-47 27 p.
Research output: Contribution to journal › Article › Research › peer-review
Published
Annotation of screencasts: Distinguishing Between Relevant and Irrelevant Sections
Ulm, T., 2022
Research output: Thesis › Master's Thesis
Published
Entscheidungsfindung unter Unsicherheit durch Reinforcement Learning am Beispiel einer Microgrid-Steuerung
Zeilinger, K., 2022
Research output: Thesis › Master's Thesis
Published
Evaluierung der Eignung von neuronalen Netzen zum Forecasting logistischer Zeitreihen: Ein Beispiel aus der österreichischen Lebensmittelindustrie
Kotzbeck, J., 2022
Research output: Thesis › Master's Thesis
Published
Hybrid Machine Learning for Anomaly Detection in Industrial Time-Series Measurement Data
Terbuch, A., O'Leary, P. & Auer, P., 2022, I2MTC 2022 - IEEE International Instrumentation and Measurement Technology Conference: Instrumentation and Measurement under Pandemic Constraints, Proceedings. Institute of Electrical and Electronics Engineers, (Conference Record - IEEE Instrumentation and Measurement Technology Conference).
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
Published
LoopyDenseNet: Combining Skip Connections, Dense Connectivity and Loops within a Convolutional Neural Network
Niederl, P., 2022
Research output: Thesis › Master's Thesis
Published
Parameteroptimierung einer Logistiksoftware: Anwendung eines evolutionären Verfahrens am Beispiel eines Lagerplatzsuchers
Perl, M., 2022
Research output: Thesis › Master's Thesis
Published
Reinforcement Learning for Decision Support
Roth, M., 2022
Research output: Thesis › Master's Thesis
2021
Published
A new heuristic and an exact approach for a production planning problem
Auer, P., Dósa, G., Dulai, T., Fügenschuh, A., Näser, P., Ortner, R. & Werner-Starkne, A., Sept 2021, In: Central European Journal of Operations Research. 29, 3, p. 1079-1113 35 p.
Research output: Contribution to journal › Article › Research › peer-review
Published
Regret Bounds for Reinforcement Learning via Markov Chain Concentration
Ortner, R., 26 Aug 2021.
Research output: Contribution to conference › Poster › Research › peer-review
Published
Particle Size Estimation in Mixed Commercial Waste Images Using Deep Learning
Kittiworapanya, P., Pasupa, K. & Auer, P., 29 Jun 2021, IAIT 2021 - 12th International Conference on Advances in Information Technology: Intelligence and Innovation for Digital Business and Society. Association for Computing Machinery (ACM), 3471273. (ACM International Conference Proceeding Series).
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
Published
Learning to Drive with Deep Reinforcement Learning
Chukamphaeng, N., Pasupa, K., Antenreiter, M. & Auer, P., 21 Jan 2021, KST 2021 - 2021 13th International Conference Knowledge and Smart Technology. Institute of Electrical and Electronics Engineers, p. 147-152 6 p. 9415770. (KST 2021 - 2021 13th International Conference Knowledge and Smart Technology).
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
Published
Grid Load Mitigation in EV Fast Charging Stations Through Integration of a High-Performance Flywheel Energy Storage System with CFRP Rotor
Buchroithner, A., Presmair, R., Haidl, P., Wegleiter, H., Thormann, B., Kienberger, T., Auer, P. & Domitner, J., 2021, 2021 IEEE Green Energy and Smart Systems Conference, IGESSC 2021. Institute of Electrical and Electronics Engineers, (2021 IEEE Green Energy and Smart Systems Conference, IGESSC 2021).
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
Published
Online Anticipatory Algorithms for Scheduling Problems
Erler, S., 2021
Research output: Thesis › Master's Thesis
Published
Vorhersage der Verpackungsgröße einer Lieferung in einem E-Commerce-Unternehmen mittels Machine Learning
Heininger, M., 2021
Research output: Thesis › Master's Thesis
2020
Published
Online exam proctoring system
Antenreiter, M., 16 Oct 2020
Research output: Non-textual form › Software › Education
Published
A fast video compression algorithm for online exams
Antenreiter, M., 11 Sept 2020
Research output: Non-textual form › Software › Research
Published
Regret Bounds for Reinforcement Learning via Markov Chain Concentration
Ortner, R., 23 Jan 2020, In: The journal of artificial intelligence research. 67.2020, 1, p. 115-128 14 p.
Research output: Contribution to journal › Article › Research › peer-review
Published
Clusteranalyse für die Bildung von Produktgruppen zur Unterstützung der Lagerdimensionierung
Kohlhofer, J. B., 2020
Research output: Thesis › Master's Thesis
Published
Maschinelles Lernen in der Nachschubsteuerung - Verbesserung der Nachschubsteuerung durch den Einsatz von maschinellem Lernen in der Bedarfsprognose
Labner, J., 2020
Research output: Thesis › Master's Thesis
2019
Published
A Reinforcement Learning Motivated Algorithm for Process Optimization
Ábrahám, Á., Auer, P., Dósa, G., Dulai, T. & Werner-Stark, Á., 18 Dec 2019, In: Periodica Polytechnica Civil Engineering. 63.2019, 4, p. 961-970 10 p.
Research output: Contribution to journal › Article › Research › peer-review
Published
Regret Bounds for Learning State Representations in Reinforcement Learning
Ortner, R., Pirotta, M., Lazaric, A., Fruit, R. & Maillard, O.-A., Dec 2019.
Research output: Contribution to conference › Poster › Research › peer-review
Published
Adaptively Tracking the Best Bandit Arm with an Unknown Number of Distribution Changes
Auer, P., Gajane, P. & Ortner, R., 27 Jun 2019.
Research output: Contribution to conference › Poster › Research › peer-review
Published
Achieving Optimal Dynamic Regret for Non-stationary Bandits without Prior Information
Auer, P., Chen, Y., Gajane, P., Lee, C.-W., Luo, H., Ortner, R. & Wei, C.-Y., 2019.
Research output: Contribution to conference › Abstract › peer-review
Published
Adaptively Tracking the Best Bandit Arm with an Unknown Number of Distribution Changes
Auer, P., Gajane, P. & Ortner, R., 2019, Proceedings of the 32nd Conference on Learning Theory, COLT 2019. p. 138-158
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
E-pub ahead of print
Regret Bounds for Learning State Representations in Reinforcement Learning
Ortner, R., Pirotta, M., Lazaric, A., Fruit, R. & Maillard, O.-A., 2019, (E-pub ahead of print) Advances in Neural Information Processing Systems. Vol. 32. p. 12717 12727 p.
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
Published
Variational Regret Bounds for Reinforcement Learning
Ortner, R., Gajane, P. & Auer, P., 2019.
Research output: Contribution to conference › Paper › peer-review
Published
Variational Regret Bounds for Reinforcement Learning
Ortner, R., Gajane, P. & Auer, P., 2019, Proceedings of The 35th Uncertainty in Artificial Intelligence Conference, UAI 2019. p. 81-90
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
2018
Published
Online learning with randomized feedback graphs for optimal PUE attacks in cognitive radio networks
Dabaghchian, M., Alipour-Fanid, A., Zeng, K., Wang, Q. & Auer, P., 1 Oct 2018, In: IEEE ACM transactions on networking. 26, 5, p. 2268-2281 14 p., 8466108.
Research output: Contribution to journal › Article › Research › peer-review
Published
Adaptively Tracking the Best Arm with an Unknown Number of Distribution Changes
Auer, P., Gajane, P. & Ortner, R., 2018.
Research output: Contribution to conference › Paper › peer-review
Published
Adaptively Tracking the Best Arm with an Unknown Number of Distribution Changes
Auer, P., Gajane, P. & Ortner, R., 2018.
Research output: Contribution to conference › Poster › Research › peer-review
Published
Application of Regression Analysis for Throughput Prediction in the Order Picking Process
Lahovnik, J., 2018
Research output: Thesis › Master's Thesis
Published
A Sliding-Window Approach for Reinforcement Learning in MDPs with Arbitrarily Changing Rewards and Transitions.
Gajane, P., Ortner, R. & Auer, P., 2018.
Research output: Contribution to conference › Paper › peer-review
Published
Efficient Bias-Span-Constrained Exploration-Exploitation in Reinforcement Learning
Fruit, R., Pirotta, M., Lazaric, A. & Ortner, R., 2018, Proceedings of the 35th International Conference on Machine Learning, ICML 2018. Vol. PMLR 80. p. 1578-1586
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
Published
Guest Editors' Foreword
Ortner, R. & Ulrich Simon, H., 2018, In: Theoretical Computer Science. 742
Research output: Contribution to journal › Article › Research
2017
Published
PSE-AP V1.0: software for predicting the segregation energy based on atom positions
Antenreiter, M., 6 Oct 2017
Research output: Non-textual form › Software › Research
Published
Software for binary classification of uneven geochemical datasets V1.0
Antenreiter, M., 10 Jan 2017
Research output: Non-textual form › Software › Research
Published
Automatic Scene Interpretation with Totally Occluded Objects
Antenreiter, M., 2017
Research output: Thesis › Doctoral Thesis
Published
Machine learning concepts in predictive analytics - A case study on wind turbine data
Steiner, E., 2017
Research output: Thesis › Master's Thesis
Published
Monte Carlo Tree Search for Job Shop Scheduling Problems
Reichenhauser, C., 2017
Research output: Thesis › Master's Thesis
Published
Online Learning
Auer, P., 2017, Encyclopedia of Machine Learning and Data Mining.
Research output: Chapter in Book/Report/Conference proceeding › Chapter › Research
2016
Published
Algorithmic Learning Theory
Auer, P., Clark, A. & Zeugmann, T., 18 Oct 2016, In: Theoretical Computer Science. 650, p. 1-3
Research output: Contribution to journal › Special issue › Research › peer-review
Published
Guest editors' foreword
Auer, P., Clark, A. & Zeugmann, T., 18 Oct 2016, In: Theoretical Computer Science. 650.2016, 18 October, p. 1-3 3 p.
Research output: Contribution to journal › Letter › peer-review
Published
An algorithm with nearly optimal pseudo-regret for both stochastic and adversarial bandits
Auer, P. & Chiang, C.-K., 23 Jun 2016, Proceedings of the 29th Conference on Learning Theory, COLT 2016. p. 116-120 (JMLR Workshop and Conference Proceedings; vol. 49).
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
Published
Improved Learning Complexity in Combinatorial Pure Exploration Bandits
Gabillon, V., Lazaric, A., Ghavamzadeh, M., Ortner, R. & Bartlett, P., 10 May 2016.
Research output: Contribution to conference › Poster › Research › peer-review
Published
Learning with Malicious Noise
Auer, P., 22 Apr 2016, Encyclopedia of Algorithms. Springer, p. 1086-1089
Research output: Chapter in Book/Report/Conference proceeding › Entry for encyclopedia/dictionary › Research
Published
Algorithmic Learning Theory: 27th International Conference, ALT 2016, Proceedings
Ortner, R. (Co-editor), Ulrich Simon, H. (Co-editor) & Zilles, S., 2016, Springer.
Research output: Book/Report › Anthology › Research
Published
Improved Learning Complexity in Combinatorial Pure Exploration Bandits
Gabillon, V., Lazaric, A., Ghavamzadeh, M., Ortner, R. & Bartlett, P., 2016, Proceedings of the Nineteenth International Conference on Artificial Intelligence and Statistics, AISTATS 2016. p. 1004-1012 (JMLR Workshop and Conference Proceedings).
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
Published
Optimal Behavior is Easier to Learn than the Truth
Ortner, R., 2016, In: Minds and Machines. 26, 3, p. 243-252
Research output: Contribution to journal › Article › Research › peer-review
Published
Pareto Front Identification from Stochastic Bandit Feedback
Auer, P., Chiang, C.-K., Ortner, R. & Drugan, M., 2016, Proceedings of the Nineteenth International Conference on Artificial Intelligence and Statistics, AISTATS 2016. p. 939-947 (JMLR Workshop and Conference Proceedings).
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
2015
Published
Improved Regret Bounds for Undiscounted Continuous Reinforcement Learning
Kailasam, L., Ortner, R. & Ryabko, D., 7 Jul 2015.
Research output: Contribution to conference › Poster › Research › peer-review
Published
Forcing Subarrangements in Complete Arrangements of Pseudocircles
Ortner, R., 2015, In: Journal of Computational Geometry. 6, 1, p. 235-248
Research output: Contribution to journal › Article › Research › peer-review
Published
Improved Regret Bounds for Undiscounted Continuous Reinforcement Learning
Kailasam, L., Ortner, R. & Ryabko, D., 2015, Proceedings of The 32nd International Conference on Machine Learning.
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
Published
Integration von Informationssystemen: Datenaustausch zwischen Teamcenter und Enterprise Applikationen
Wötzl, K., 2015
Research output: Thesis › Master's Thesis
2014
Published
Algorithmic Learning Theory: 25th International Conference, ALT 2014 Bled, Slovenia, October 8-10, 2014 Proceedings
Auer, P. (Co-editor), Clark, A. (Co-editor), Zeugmann, T. (Co-editor) & Zilles, S. (Co-editor), 1 Jan 2014, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Springer Berlin, Vol. 8776. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 8776).
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
Published
Editors’ introduction
Auer, P., Clark, A., Zeugmann, T. & Zilles, S., 1 Jan 2014, In: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 8776, p. 1-7 7 p.
Research output: Contribution to journal › Editorial › peer-review
Published
Autonome Roboter in der Intralogistik: Möglichkeiten zur Optimierung der Auftragsverteilung
Himmelsbach, E. S., 2014
Research output: Thesis › Master's Thesis
Published
Pseudo-reward Algorithms for Contextual Bandits with Linear Payoff Functions
Chiang, C.-K., 2014, Asian Conference on Machine Learning. p. 0-0
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
Published
Regret Bounds for Restless Markov Bandits
Ortner, R., Ryabko, D., Auer, P. & Munos, R., 2014, In: Theoretical Computer Science. 558, p. 62-76
Research output: Contribution to journal › Article › Research › peer-review
Published
Selecting Near-Optimal Approximate State Representations in Reinforcement Learning
Ortner, R., Maillard, O.-A. & Ryabko, D., 2014, Algorithmic Learning Theory - 25th International Conference, ALT 2014, Bled, October 8-10, 2014. p. 140-154
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
Published
Trackingvarianten für Augmented Reality Kommissioniersysteme
Heily, D., 2014
Research output: Thesis › Master's Thesis
2013
Published
Adaptive Aggregation for Reinforcement Learning in Average Reward Markov Decision Processes
Ortner, R., 2013, In: Annals of operations research. 208, p. 321-336
Research output: Contribution to journal › Article › Research › peer-review
Published
Beating Bandits in Gradually Evolving Worlds
Chiang, C.-K., 2013, Conference on Learning Theory. Shalev-Shwartz, S. & Steinwart, I. (eds.). p. 210-227 (JMLR Workshop and Conference Proceedings; vol. 30).
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
Published
Competing with an Infinite Set of Models in Reinforcement Learning
Nguyen, P., Maillard, O.-A., Ryabko, D. & Ortner, R., 2013, JMLR Workshop and Conference Proceedings Volume 31 : Proceedings of the Sixteenth International Conference on Artificial Intelligence and Statistics. p. 463-471
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
Published
Linear regression with random projections.
Maillard, O.-A., 2013, In: Journal of machine learning research (JMLR). 13, p. 1-1
Research output: Contribution to journal › Article › Research › peer-review
Published
Optimal regret bounds for selecting the state representation in reinforcement learning.
Maillard, O.-A., Nguyen, P., Ortner, R. & Ryabko, D., 2013, JMLR Workshop and Conference Proceedings Volume 28 : Proceedings of The 30th International Conference on Machine Learning. p. 543-551
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
Published
Reinforcement Learning (Dagstuhl Seminar 13321)
Auer, P., 2013, In: Dagstuhl Reports. 3, p. 1-26
Research output: Contribution to journal › Article › Research › peer-review
2012
Published
Autonomous Exploration For Navigating In MDPs.
Lim, S. H. & Auer, P., 2012, Proceedings of the 25th Annual Conference on Learning Theory. p. 40.1-40.24
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
Published
Entwicklung einer Simulation für Kommissioniersysteme
Salmutter, A., 2012
Research output: Thesis › Master's Thesis
Published
Evaluation and Analysis of the Performance of the EXP3 Algorithm in Stochastic Environments
Seldin, Y., Szepesvári, C., Auer, P. & Abbasi-Yadkori, Y., 2012, Proceedings of the Tenth European Workshop on Reinforcement Learning, EWRL 2012. p. 103-116 (JMLR proceedings).
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
Published
Online Optimization with Gradual Variations
Chiang, C.-K., 2012, COLT 2012: Proceedings of the 25th Annual Conference on Learning Theory June 25-27, 2012, Edinburgh, Scotland. Mannor, S., Srebro, N. & Willamson, R. C. (eds.). p. 6.1-6.20 (JMLR Workshop and Conference Proceedings; vol. 23).
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
Published
Online Regret Bounds for Undiscounted Continuous Reinforcement Learning
Ortner, R. & Ryabko, D., 2012, Advances in Neural Information Processing Systems 25. MIT Press, p. 1772-1780
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
Published
Online Regret Bounds for Undiscounted Continuous Reinforcement Learning
Ortner, R. & Ryabko, D., 2012.
Research output: Contribution to conference › Poster › Research › peer-review
Published
PAC-Bayes-Bernstein Inequality for Martingales and its Application to Multiarmed Bandits.
Seldin, Y., Cesa-Bianchi, N., Auer, P., Laviolette, F. & Shawe-Taylor, J., 2012, Proceedings of the Workshop on On-line Trading of Exploration and Exploitation 2. p. 98-111
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
Published
PAC-Bayesian Analysis of Contextual Bandits
Seldin, Y., Auer, P., Laviolette, F., Shawe-Taylor, J. S. & Ortner, R., 2012, Advances in Neural Information Processing Systems 24. MIT Press, p. 1683-1691
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
Published
PAC-Bayesian Inequalities for Martingales
Seldin, Y., Cesa-Bianchi, N., Shawe-Taylor, J. & Auer, P., 2012, Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence, UAI 2012.
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
Published
PAC-Bayesian Inequalities for Martingales.
Seldin, Y., Laviolette, F., Cesa-Bianchi, N., Shawe-Taylor, J. & Auer, P., 2012, In: IEEE transactions on information theory. 58, p. 7086-7093
Research output: Contribution to journal › Article › Research › peer-review
Published
PAC Subset Selection in Stochastic Multi-armed Bandits
Kalyanakrishnan, S., Tewari, A., Auer, P. & Stone, P., 2012, Proceedings of the 29th International Conference on Machine Learning, ICML 2012.
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Previous 1 2 3 Next

Research Portal

Chair of Information Technology (150)

Research output

Contact information