Constructing a language for testing Reinforcement Learning programs using NLP techniques

Medina Afanador, Luis Alejandro

Publicación:
Constructing a language for testing Reinforcement Learning programs using NLP techniques

authorProfile.id.code	200215739
dc.contributor.advisor	Cardozo Álvarez, Nicolás
dc.contributor.author	Medina Afanador, Luis Alejandro
dc.contributor.jury	Dusparic, Ivana
dc.contributor.jury	Manrique Piramanrique, Rubén Francisco
dc.date.accessioned	2025-07-30T16:25:06Z
dc.date.available	2025-07-30T16:25:06Z
dc.date.issued	2025-07-28
dc.description.abstract	Probar el buen funcionamiento de un Agente en Reinforcement Learning es un desafío, esta tesis plantea usar técnicas de NLP para generar casos de prueba en donde la posibilidad de fallos es más probable, interpretando que la interacción entre un agente y el ambiente es un lenguaje o una secuencia según convenga.
dc.description.degreelevel	Maestría
dc.format.extent	66 páginas
dc.format.mimetype	application/pdf
dc.identifier.instname	instname:Universidad de los Andes
dc.identifier.reponame	reponame:Repositorio Institucional Séneca
dc.identifier.repourl	repourl:https://repositorio.uniandes.edu.co/
dc.identifier.uri	https://hdl.handle.net/1992/76816
dc.language.iso	eng
dc.publisher	Universidad de los Andes
dc.publisher.department	Departamento de Ingeniería de Sistemas y Computación
dc.publisher.faculty	Facultad de Ingeniería
dc.publisher.program	Maestría en Ingeniería de Sistemas y Computación
dc.relation.references	Lionel C. Briand Fellow IEEE Mojtaba Bagherzadeh Amirhossein Zolfagharian, Manel Abdellatif and Ramesh S. A Search-Based Testing Approach for Deep Reinforcement Learning Agents. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, vol. 49, no. 7 edition, 2023. ISBN 0-201-37921-X.
dc.relation.references	Matteo Biagiola and Paolo Tonella. Testing the Plasticity of Reinforcement Learning Based Systems. Università della Svizzera italiana, Switzerland, revised edition, 2022. ISBN 0-201- 37921-X.
dc.relation.references	Matteo Biagiola and Paolo Tonella. Testing of Deep Reinforcement Learning Agents with Surrogate Models. Università della Svizzera italiana, Switzerland, revised edition, 2023. ISBN 0-201-37921-X.
dc.relation.references	Kapil Chauhan. CartPole_DQN: CartPole_v0 Jupyter Notebook. https://github. com/kapilnchauhan77/CartPole_DQN/blob/master/CartPole_v0.ipynb, 2019. Accessed: May 15, 2025.
dc.relation.references	Ian Goodfellow, Yoshua Bengio, and Aaron Courville. Deep Learning. MIT Press, 2016. http://www.deeplearningbook.org.
dc.relation.references	IBM. What is natural language processing? https://www.ibm.com/think/topics/ natural-language-processing, n.d. Accessed: May 23, 2025.
dc.relation.references	Mahmood Khordoo. Deep-Reinforcement-Learning-with-PyTorch: nstep DQN for LunarLander-v2. https://github.com/khordoo/ Bibliography Deep-Reinforcement-Learning-with-PyTorch/blob/example/examples/DQN/ lunarlander_v2-dqn-n-step.py, 2020. Accessed: May 15,2025.
dc.relation.references	Shakti Kumar. adaptiveSystems: RL_Benchmarks README. https://github. com/shaktikshri/adaptiveSystems/blob/master/RL_Benchmarks/README.md, 2019. Accessed: May 15, 2025.
dc.relation.references	Emmanouil D. Oikonomou, Petros Karvelis, Nikolaos Giannakeas, Aristidis Vrachatis, Evripidis Glavas, and Alexandros T. Tzallas. How natural language processing derived techniques are used on biological data: a systematic review. Network Modeling Analysis in Health Informatics and Bioinformatics, 13(23), 2024. DOI 10.1007/s13721-024-00458-1.
dc.relation.references	Alec Radford, Karthik Narasimhan, Tim Salimans, and Ilya Sutskever. Improving language understanding by generative pre-training. https://cdn.openai.com/research-covers/ language-unsupervised/language_understanding_paper.pdf, 2018. OpenAI Report.
dc.relation.references	Nihal T. Rao. RL-Double-DQN: Double DQN Implementation for CartPole-v0. https: //github.com/nihal-rao/RL-Double-DQN, 2020. Accessed: May 15, 2025.
dc.relation.references	Sigve Rokenes. learning-rl/gym/lunarlander-v2: DQN Example for LunarLander-v2. https: //github.com/evgiz/learning-rl/tree/master/gym/lunarlander-v2, 2019. Accessed: May 15, 2025.
dc.relation.references	Richard S. Sutton and Andrew G. Barto. Reinforcement Learning: An Introduction. The MIT Press Cambridge, Massachusetts,London, England, revised edition, 2015. ISBN 0-201- 37921-X.
dc.relation.references	Sanket Thakur. LunarLander_DQN: DQN Implementation for LunarLanderv2. https://github.com/sanketsans/openAIenv/blob/master/DQN/LunarLander/ LunarLander_DQN.ipynb, 2020. Accessed: May 15, 2025.
dc.relation.references	Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Łukasz Kaiser, and Illia Polosukhin. Attention is all you need. In Advances in Neural Information Processing Systems, volume 30. Curran Asso- 64 Bibliography ciates, Inc., 2017. URL https://papers.nips.cc/paper_files/paper/2017/file/ 3f5ee243547dee91fbd053c1c4a845aa-Paper.pdf.
dc.rights	Attribution 4.0 International	en
dc.rights.accessrights	info:eu-repo/semantics/openAccess
dc.rights.coar	http://purl.org/coar/access_right/c_abf2
dc.rights.uri	http://creativecommons.org/licenses/by/4.0/
dc.subject.keyword	Testing Reinforcement Learning
dc.subject.keyword	Reinforcement Learning
dc.subject.keyword	Testing Reinforcement Learning with NLP
dc.subject.keyword	NLP testing
dc.subject.themes	Ingeniería	spa
dc.title	Constructing a language for testing Reinforcement Learning programs using NLP techniques
dc.title.alternative	Constructing a language for testing RL
dc.type	Trabajo de grado - Maestría
dc.type.coar	http://purl.org/coar/resource_type/c_bdcc
dc.type.coarversion	http://purl.org/coar/version/c_ab4af688f83e57aa
dc.type.content	Text
dc.type.driver	info:eu-repo/semantics/masterThesis
dc.type.redcol	https://purl.org/redcol/resource_type/TM
dc.type.version	info:eu-repo/semantics/acceptedVersion
dspace.entity.type	Publication
person.identifier.gsid	https://scholar.google.es/citations?user=3iTzjQsAAAAJ
person.identifier.orcid	0000-0002-1094-9952
relation.isDirectorOfPublication	a77ff528-fc33-44d6-9022-814f81ef407a
relation.isDirectorOfPublication.latestForDiscovery	a77ff528-fc33-44d6-9022-814f81ef407a