Publicación: Constructing a language for testing Reinforcement Learning programs using NLP techniques
| authorProfile.id.code | 200215739 | |
| dc.contributor.advisor | Cardozo Álvarez, Nicolás | |
| dc.contributor.author | Medina Afanador, Luis Alejandro | |
| dc.contributor.jury | Dusparic, Ivana | |
| dc.contributor.jury | Manrique Piramanrique, Rubén Francisco | |
| dc.date.accessioned | 2025-07-30T16:25:06Z | |
| dc.date.available | 2025-07-30T16:25:06Z | |
| dc.date.issued | 2025-07-28 | |
| dc.description.abstract | Probar el buen funcionamiento de un Agente en Reinforcement Learning es un desafío, esta tesis plantea usar técnicas de NLP para generar casos de prueba en donde la posibilidad de fallos es más probable, interpretando que la interacción entre un agente y el ambiente es un lenguaje o una secuencia según convenga. | |
| dc.description.degreelevel | Maestría | |
| dc.format.extent | 66 páginas | |
| dc.format.mimetype | application/pdf | |
| dc.identifier.instname | instname:Universidad de los Andes | |
| dc.identifier.reponame | reponame:Repositorio Institucional Séneca | |
| dc.identifier.repourl | repourl:https://repositorio.uniandes.edu.co/ | |
| dc.identifier.uri | https://hdl.handle.net/1992/76816 | |
| dc.language.iso | eng | |
| dc.publisher | Universidad de los Andes | |
| dc.publisher.department | Departamento de Ingeniería de Sistemas y Computación | |
| dc.publisher.faculty | Facultad de Ingeniería | |
| dc.publisher.program | Maestría en Ingeniería de Sistemas y Computación | |
| dc.relation.references | Lionel C. Briand Fellow IEEE Mojtaba Bagherzadeh Amirhossein Zolfagharian, Manel Abdellatif and Ramesh S. A Search-Based Testing Approach for Deep Reinforcement Learning Agents. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, vol. 49, no. 7 edition, 2023. ISBN 0-201-37921-X. | |
| dc.relation.references | Matteo Biagiola and Paolo Tonella. Testing the Plasticity of Reinforcement Learning Based Systems. Università della Svizzera italiana, Switzerland, revised edition, 2022. ISBN 0-201- 37921-X. | |
| dc.relation.references | Matteo Biagiola and Paolo Tonella. Testing of Deep Reinforcement Learning Agents with Surrogate Models. Università della Svizzera italiana, Switzerland, revised edition, 2023. ISBN 0-201-37921-X. | |
| dc.relation.references | Kapil Chauhan. CartPole_DQN: CartPole_v0 Jupyter Notebook. https://github. com/kapilnchauhan77/CartPole_DQN/blob/master/CartPole_v0.ipynb, 2019. Accessed: May 15, 2025. | |
| dc.relation.references | Ian Goodfellow, Yoshua Bengio, and Aaron Courville. Deep Learning. MIT Press, 2016. http://www.deeplearningbook.org. | |
| dc.relation.references | IBM. What is natural language processing? https://www.ibm.com/think/topics/ natural-language-processing, n.d. Accessed: May 23, 2025. | |
| dc.relation.references | Mahmood Khordoo. Deep-Reinforcement-Learning-with-PyTorch: nstep DQN for LunarLander-v2. https://github.com/khordoo/ Bibliography Deep-Reinforcement-Learning-with-PyTorch/blob/example/examples/DQN/ lunarlander_v2-dqn-n-step.py, 2020. Accessed: May 15,2025. | |
| dc.relation.references | Shakti Kumar. adaptiveSystems: RL_Benchmarks README. https://github. com/shaktikshri/adaptiveSystems/blob/master/RL_Benchmarks/README.md, 2019. Accessed: May 15, 2025. | |
| dc.relation.references | Emmanouil D. Oikonomou, Petros Karvelis, Nikolaos Giannakeas, Aristidis Vrachatis, Evripidis Glavas, and Alexandros T. Tzallas. How natural language processing derived techniques are used on biological data: a systematic review. Network Modeling Analysis in Health Informatics and Bioinformatics, 13(23), 2024. DOI 10.1007/s13721-024-00458-1. | |
| dc.relation.references | Alec Radford, Karthik Narasimhan, Tim Salimans, and Ilya Sutskever. Improving language understanding by generative pre-training. https://cdn.openai.com/research-covers/ language-unsupervised/language_understanding_paper.pdf, 2018. OpenAI Report. | |
| dc.relation.references | Nihal T. Rao. RL-Double-DQN: Double DQN Implementation for CartPole-v0. https: //github.com/nihal-rao/RL-Double-DQN, 2020. Accessed: May 15, 2025. | |
| dc.relation.references | Sigve Rokenes. learning-rl/gym/lunarlander-v2: DQN Example for LunarLander-v2. https: //github.com/evgiz/learning-rl/tree/master/gym/lunarlander-v2, 2019. Accessed: May 15, 2025. | |
| dc.relation.references | Richard S. Sutton and Andrew G. Barto. Reinforcement Learning: An Introduction. The MIT Press Cambridge, Massachusetts,London, England, revised edition, 2015. ISBN 0-201- 37921-X. | |
| dc.relation.references | Sanket Thakur. LunarLander_DQN: DQN Implementation for LunarLanderv2. https://github.com/sanketsans/openAIenv/blob/master/DQN/LunarLander/ LunarLander_DQN.ipynb, 2020. Accessed: May 15, 2025. | |
| dc.relation.references | Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Łukasz Kaiser, and Illia Polosukhin. Attention is all you need. In Advances in Neural Information Processing Systems, volume 30. Curran Asso- 64 Bibliography ciates, Inc., 2017. URL https://papers.nips.cc/paper_files/paper/2017/file/ 3f5ee243547dee91fbd053c1c4a845aa-Paper.pdf. | |
| dc.rights | Attribution 4.0 International | en |
| dc.rights.accessrights | info:eu-repo/semantics/openAccess | |
| dc.rights.coar | http://purl.org/coar/access_right/c_abf2 | |
| dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | |
| dc.subject.keyword | Testing Reinforcement Learning | |
| dc.subject.keyword | Reinforcement Learning | |
| dc.subject.keyword | Testing Reinforcement Learning with NLP | |
| dc.subject.keyword | NLP testing | |
| dc.subject.themes | Ingeniería | spa |
| dc.title | Constructing a language for testing Reinforcement Learning programs using NLP techniques | |
| dc.title.alternative | Constructing a language for testing RL | |
| dc.type | Trabajo de grado - Maestría | |
| dc.type.coar | http://purl.org/coar/resource_type/c_bdcc | |
| dc.type.coarversion | http://purl.org/coar/version/c_ab4af688f83e57aa | |
| dc.type.content | Text | |
| dc.type.driver | info:eu-repo/semantics/masterThesis | |
| dc.type.redcol | https://purl.org/redcol/resource_type/TM | |
| dc.type.version | info:eu-repo/semantics/acceptedVersion | |
| dspace.entity.type | Publication | |
| person.identifier.gsid | https://scholar.google.es/citations?user=3iTzjQsAAAAJ | |
| person.identifier.orcid | 0000-0002-1094-9952 | |
| relation.isDirectorOfPublication | a77ff528-fc33-44d6-9022-814f81ef407a | |
| relation.isDirectorOfPublication.latestForDiscovery | a77ff528-fc33-44d6-9022-814f81ef407a |
Archivos
Bloque de licencias
1 - 1 de 1
No hay miniatura disponible
- Nombre:
- license.txt
- Tamaño:
- 2.48 KB
- Formato:
- Item-specific license agreed upon to submission
- Descripción: