Alberto Barrón Cedeño

This section contains a list of publications and invited talks including. A bibtex with proper references to most publications is available at the bottom.



2015

Flores, Barrón-Cedeño, Moreno, Rosso. Cross-Language Source Code Re-Use Detection Using Latent Semantic Analysis Journal of Universal Computer Science 21(13), pages 1708-1725.


Formiga, Barrón-Cedeño, Màrquez, Henríquez and Mariño (2015) Leveraging Online User Feedback to Improve Statistical Machine Translation Journal of Artificial Intelligence Research 54, pages 159-192


Joty, Barrón-Cedeño, Da San Martino, Filice, Màrquez, Moschitti, Nakov. Global Thread-level Inference for Comment Classification in Community Question Answering In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing (EMNLP 2015), pp. 573--578, Lisbon, Portugal


Barrón-Cedeño, Filice, Da San Martino, Joty, Màrquez, Nakov, Moschitti. Thread-Level Information for Comment Classification in Community Question Answering. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing (ACL-IJCNLP-2015), Beijing, China.


Barrón-Cedeño, España-Bonet, Boldoba, and Màrquez A Factory of Comparable Corpora from Wikipedia. In Proceedings of the 8th Workshop on Building and Using Comparable Corpora (BUCC 2015), Beijing, China.


Belinkov, Barrón-Cedeño, and Mubarak. Answer Selection in Arabic Community Question Answering: A Feature-Rich Approach. In Proceedings of the 2nd Arabic Natural Language Processing Workshop (WANLP 2015), Beijing, China.


Nicosia, Filice, Barrón-Cedeño, Saleh, Mubarak, Gao, Nakov, Da San Martino, Moschitti, Darwish, M\`arquez, Joty and Magdy QCRI: Answer Selection for Community Question Answering - Experiments for Arabic and English. In Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015). Association for Computational Linguistics. Denver, Colorado.



2014

Stamatatos, Daelemans, Verhoeven, Potthast,,Stein, Juola, Sanchez-Perez, and Barrón-Cedeño. Overview of the Author Identification Task at PAN 2014. In: CLEF 2014 Evaluation Labs and Workshop – Working Notes Papers. CEUR Workshop Proceedings. CLEF and CEUR-WS.org (September 2014),


Barrón-Cedeño Software para la detección de plagio académico. In: El plagio académico en Educación Secundaria: características del fenómeno y estrategias de intervención, pp. 85--100, Grupo de Investigación Educación y Ciudadanía de la Universidad de las Islas Baleares, 2014.


Flores, Barrón-Cedeño, Moreno, Rosso. Uncovering Source Code Reuse in Large-Scale Academic Environments Computer Applications in Engineering and Education 23(3), pp. 383–390. DOI: 10.1002/cae.21608


González, Barrón-Cedeño, Màrquez. IPA and STOUT: Leveraging Linguistic and Source-based Features for Machine Translation Evaluation. . In: Proc of the Ninth Workshop on Statistical Machine Translation, pp. 394--401, Baltimore, Maryland USA, June 26–27, 2014. Association for Computational Linguistics


Barrón-Cedeño, Lestari-Paramita, Clough, Rosso. A Comparison of Approaches for Measuring Cross-Lingual Similarity of Wikipedia Articles In: Proc. 36th European Conf. on Information Retrieval, ECIR-2014, Springer-Verlag, LNCS(8416), pp. 424-429


Flores, Barrón-Cedeño, Moreno, Rosso. Source Code Re-Use Detection. In: Proc. 3rd Spanish Conf. on Information Retrieval, CERI-2014, A Coruña, Spain, June 19-20


Boldoba, Barrón-Cedeño, España-Bonet. Wikicardi : hacia la extracción de oraciones paralelas de Wikipedia Technical Report LSI-14-3-R. Universitat Politècnica de Catalunya, 2014



2013

Barrón-Cedeño, Gupta, and Rosso. Methods for cross-language plagiarism detection. Knowledge-Based Systems Volume 50, September 2013, Pages 211-217, ISSN 0950-7051.


Barrón-Cedeño, Vila, Martí, and Rosso Plagiarism meets paraphrasing: Insights for the next generation in automatic plagiarism detection . Computational Linguistics 39(4): 917--947 (accepted November 2012)


Barrón-Cedeño, Màrquez, Henríquez, Formiga, Merino, May. Identifying Useful Human Correction Feedback from an On-line Machine Translation Service . In: Proc. of the 23rd International Joint Conference on Artificial Intelligence (IJCAI), 2013


Formiga, Ruiz Costa-Jussà, Mariño Rodríguez, Barr&ocaute;n-Cedeño, Màrquez. The TALP-UPC phrase-based translation systems for WMT13: system combination with morphology generation, domain adaptation and corpus filtering . In: Proceedings of the Eighth Workshop on Statistical Machine Translation. Sofia: 2013, p. 134-140.


Barrón-Cedeño, Màrquez, Fuentes, Rodríguez, Turmo. UPC-CORE: What Can Machine Translation Evaluation Metrics and Wikipedia Do for Estimating Semantic Textual Similarity?. In: Proc. of the Second Joint Conference on Lexical and Computational Semantics (*SEM), Atlanta, GA, 2013


Formiga, González, Barrón-Cedeño, Fonollosa, Màrquez. The TALP-UPC Approach to System Selection: ASIYA Features and Pairwise Classification using Random Forests. Eighth Workshop on Statistical Machine Translation. Quality Estimation Task, 2013.


Barrón-Cedeño, Rosso, Lalitha Devi, Clough, Stevenson. PAN@FIRE: Overview of the Cross-Language !ndian Text Re-Use Detection Competition. Multilingual Information Access in South Asian Languages. LNCS(7536), 2013, pp 59-70


Invited talk. Detection of (Cross-Language) Text Re-Use and Plagiarism. Norwegian University of Science and Technology, Trondheim, Norway, May 24th.

Invited talk. Uncovering Good Feedback Instances from an On-line Machine Translation System. Wimmics Seminar, INRIA-Sophia Antipolis, France, April 26th.


2012

Flores, Barrón-Cedeño, Rosso, Moreno DeSoCoRe: Detecting Source Code Re-Use across Programming Languages In: Proc. 12th Int. Conf. of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies,NAACL-2012, Montreal, Canada, June 3-8, pp. 1-4 2012


Gupta, Barrón-Cedeño, Rosso.Cross-language High Similarity Search using a Conceptual Thesaurus In 3rd Int. Conf. of CLEF on Information Access Evaluation meets Multilinguality, Multimodality, and Visual Analytics, CLEF 2012, Springer-Verlag, LNCS(7488), pp. 67-75 2012


Potthast, Gollub, Hagen, Grassegger, Kiesel, Michel, Oberländer, Tippmann, Barrón-Cedeño, Gupta, Rosso, Stein Overview of the 4th International Competition on Plagiarism Detection . In: Notebook Papers of CLEF 2012 LABs and Workshops, CLEF-2012, Rome, Italy, September 17-20 2012


.

Rodríguez-Torrejón, Barrón-Cedeño, Sidorov, Martín-Ramos, Rosso Influencia del diccionario en la traducción para la detección de plagio translingüe. In: Proc. 2nd Spanish Conf. on Information Retrieval, CERI-2012, Valencia, Spain, June 18-19, pp. 301-302


Alberto Barrón-Cedeño. On the Mono- and Cross-Language Detection of Text Re-Use and Plagiarism. Ph.D. dissertation. Universitat Politènica de València (Spain). 2012


Selected talk.Two Instances of Multilingual Natural Language Processing. ERCIM ABCDE Seminar. INRIA, Sophia Antipolis, France. October, 2012


Invited talk. On the Mono- and Cross-Language Detection of Plagiarism and Text Re-Use. NLP Seminar, Universitat Politecnica de Catalunya, Barcelona, Spain. July 11th (right before joining UPC!)


2011

Benno Stein, Martin Potthast, Paolo Rosso, Alberto Barrón-Cedeño, Efstathios Stamatatos,and Moshe Koppel. Fourth International Workshop on Uncovering Plagiarism, Authorship, and Social Software Misuse. ACM SIGIR Forum 45, no. 1 (May 2011): 45-48. DOI: 10.1145/1988852.1988860, 2011


Martin Potthast, Alberto Barrón-Cedeño, Benno Stein, Paolo Rosso. Cross-Language Plagiarism Detection. Language Resources and Evaluation, Special Issue on Plagiarism and Authorship Analysis, vol. 45, num. 1. DOI: 10.1007/s10579-009-9114-z, 2011


Enrique Flores, Alberto Barrón-Cedeño, Paolo Rosso, Lidia Moreno. Towards the Detection of Cross-Language Source Code Reuse. In: Proc. 16th Int. Conf. on Applications of Natural Language to Information Systems, NLDB-2011, Springer-Verlag, LNCS(6716), pp. 250-253


J.A. Silvestre-Cerdà , M. García-Martínez, Alberto Barrón-Cedeño, Jorge Civera, Paolo Rosso. Extracción de corpus paralelos de la Wikipedia basada en la obtención de alineamientos bilingües a nivel de frase. In: Proc. SEPLN Workshop on Iberian Cross-Language NLP tasks (ICL), CEUR-WS.org, vol. 824, pp. 14-21


Enrique Flores, Alberto Barrón-Cedeño, Paolo Rosso P, Lidia Moreno. Detecting Source Code Reuse across Programming Languages. Poster at Conf. of Sociedad Española para el Procesamiento del Lenguaje Natural (SEPLN), Huelva, Spain, 5-7 September


Enrique Flores, Alberto Barrón-Cedeño, Paolo Rosso, Lidia Moreno. Detección de reutilización de código fuente entre lenguajes de programación en base a la frecuencia de términos. In: Proc. IV Jornadas PLN-TIMM, Torres, Jaén, Spain, Abril 7-8, pp.21-26


Paolo Rosso, Alberto Barrón-Cedeño, Marta Vila, Jorge Civera, Anabela, I. Alegría (eds.) SEPLN Workshop on Iberian Cross-Language Natural Language Processing Tasks, ICL-2011. CEUR Workshop Proceedings. CEUR-WS.org, September 2011. http://ceur-ws.org/Vol-824


Invited talk. Automatic Detection of Plagiarism: Cut and Paste, Paraphrases and Translation. Computational Linguistics in Flanders (CLIF), Universiteit Antwerpen, Antwerp, Belgium, September 16th.


Seminar. Plagiarism Detection (including mono- and cross-Language text similarity measures, cross-language plagiarism detection, plagiarism detection competition: PAN 2009 and 2010). National Institute of Astrophysics, Optics and Electronics, Puebla, Mexico, July 7th and 8th.


Invited talk. Plagiarism Detection Overview. National Polytechnic Institute, Mexico City, Mexico, June.


2010

Alberto Barrón-Cedeño, Marta Vila, and Paolo Rosso Detección automática de plagio: De la copia exacta a la paráfrasis. Panorama actual de la lingüística forense en el ámbito legal y policial: Teoría y práctica. (Jornadas (in)formativas de lingüística forense)m pp. 76--96. Madrid. Euphonia Ediciones SL.2010


Martin Potthast, Alberto Barrón-Cedeño, Andreas Eiselt, Benno Stein, and Paolo Rosso. Overview of the 2nd International Competition on Plagiarism Detection. In Martin Braschler and Donna Harman, editors, Notebook Papers of CLEF 2010 LABs and Workshops. Padua, Italy, 22-23 September 2010


Alberto Barrón-Cedeño, Paolo Rosso, Eneko Agirre, Gorka Labaka. Plagiarism Detection across Distant Language Pairs. Proceedings of the 23rd International Conference on Computational Linguistics (COLING 2010). Beijing, China, 2010. Association for Computational Linguistics


Martin Potthast, Benno Stein, Alberto Barrón-Cedeño, and Paolo Rosso. An Evaluation Framework for Plagiarism Detection. Proceedings of the 23rd International Conference on Computational Linguistics (COLING 2010). Beijing, China, 2010. Association for Computational Linguistics


Alberto Barrón-Cedeño. On the Mono- and Cross-Language Detection of Text Reuse and Plagiarism. Proceedings of the 33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (Doctoral Consortium). Geneva, Switzerland, 2010


Alberto Barrón-Cedeño, Paolo Rosso. Towards the 2nd International Competition on Plagiarism Detection and Beyond. Proceedings of the 4th International Plagiarism Conference. Newcastle upon Tyne, UK, 2010


Alberto Barrón-Cedeño, Martin Potthast, Paolo Rosso, Benno Stein, Andreas Eiselt. Corpus and Evaluation Measures for Automatic Plagiarism Detection. Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10). La Valletta, Malta. European Language Resources Association (ELRA),2010


Grigori Sidorov, Alberto Barrón-Cedeño, and Paolo Rosso. English-Spanish Large Statistical Dictionary of Inflectional Forms. Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10). La Valletta, Malta. European Language Resources Association (ELRA),2010


Alberto Barrón-Cedeño, Chiara Basile, Mirko Degli Esposti, Paolo Rosso. Word Length n-grams for Text Re-Use Detection. In: Gelbukh A. (ed.) CICLing 2010, LNCS (6008), pp. 687-699, Springer-Verlag, 2010.


Tutorial. Detection of Plagiarism and Text Reuse (with Paolo Rosso). 8th International Conference on Natural Language Processing (ICON 2010), IIT Kharagpur, India, December 11th.


Invited talk. Detection of Plagiarism. MIT Campus of Anna University, Chennai, India, December 6th.


Invited talk. Detection of Plagiarism and Text Reuse. Tsinghua University, Beijing, China. August 30th.


Invited talk. The Quest of a Model for Cross-Language Plagiarism Detection. IR Talk, University of Sheffield, UK. July 16th.



2009

Alberto Barrón-Cedeño, Andreas Eiselt, and Paolo Rosso. Monolingual Text Similarity Measures: A comparison of Models over Wikipedia Articles Revisions. In: Proceedings of ICON-2009: 7th International Conference on Natural Language Processing, pp. 29-38, Hyderabad, India, 2009. Macmillan Publishers


Martin Potthast, Benno Stein, Andreas Eiselt, Alberto Barrón-Cedeño, and Paolo Rosso. Overview of the 1st International Competition on Plagiarism Detection. In: Benno Stein, Paolo Rosso, Efstathios Stamatatos, Moshe Koppel, and Eneko Agirre, editors, SEPLN 2009 Workshop on Uncovering Plagiarism, Authorship and Social Software Misuse (PAN 09), pp. 1-9, Donostia-San Sebastian, Spain, September 2009. CEUR-WS.org. ISSN 163-0073.


Alberto Barrón-Cedeño and Paolo Rosso. On the Relevance of Search Space Reduction in Automatic Plagiarism Detection. Procesamiento del Lenguaje Natural 43, pp. 141-149 (2009).


David Pinto, Jorge Civera, Alberto Barrón-Cedeño, Alfons Juan, and Paolo Rosso. A statistical approach to crosslingual natural language tasks. J. Algorithms 64(1), pp. 51-60 (2009) doi:10.1016/j.jalgor.2009.02.005


Alberto Barrón-Cedeño and Paolo Rosso. On Automatic Plagiarism Detection based on n-grams Comparison. In: Boughanem et al. (Eds.) ECIR 2009, LNCS 5478, pp. 696-700, Springer-Verlag Berlin Heidelberg (2009)


Alberto Barrón-Cedeño, Paolo Rosso, and José-Miguel Benedí. Reducing the Plagiarism Detection Search Space on the Basis of the Kullback-Leibler Distance. In: Gelbukh A. (ed.) CICLing 2009, LNCS 5449, pp. 523-534 Springer-Verlag (2009)


Alberto Barrón-Cedeño, Gerardo Sierra, Patrick Drouin, and Sophia Ananiadou. An Improved Automatic Term Recognition Method for Spanish. In: Gelbukh A. (Ed.) CICLing 2009, LNCS 5449, pp. 125-136, Springer Verlag (2009)


Invited talk. The Non-Trivial Problem of Automatic Plagiarism Detection. What we have been doing at the Technical University of Valencia. Mathematical Physics Seminars, Università di Bologna, Italy. November 30th and December 2nd.


Invited talk. Plagiarism Detection. University of Guanajuato, Guanajuato, Mexico. September.


2008

Alberto Barrón-Cedeño. Detección automática de plagio en texto. Master's thesis, Universidad Politécnica de Valencia (Spain), 2008. Winner of the MAVIR prize to the best MSc degree thesis on Language Technologies and Scientific Comunication through the Web. (press release).


David Pinto, Jorge Civera, Alfons Juan, Paolo Rosso, and Alberto Barrón-Cedeño. A statistical approach to crosslingual natural language tasks. In: Proceedings of the 4th Latin American Workshop on Non-Monotonic Reasoning, LANMR-2008, Puebla, Mexico, October 22-24


Alberto Barrón-Cedeño, Paolo Rosso, David Pinto, and Alfons Juan. On cross-lingual plagiarism analysis using a statistical model. In: Proceedings of the ECAI'08 PAN Workshop: Uncovering Plagiarism, Authorship and Social Software Misuse, pp. 9-13. Patras, Greece (2008). ISBN 978-960-6843-08-2. ISSN 1613-0073.


Alberto Barrón-Cedeño and Paolo Rosso. Towards the exploitation of statistical language models for plagiarism detection with reference. In: Proceedings of the ECAI'08 PAN Workshop: Uncovering Plagiarism, Authorship and Social Software Misuse, pp. 15-19. Patras, Greece (2008). ISBN 978-960-6843-08-2. ISSN 1613-0073


Alberto Barrón-Cedeño, Gerardo Sierra, and Nicolás Kemper. Can TF-IDF and Fuzzy Logic Improve Onomasiological Inference Ranking? Or Keywords Frequency is Good Enough?. In: Li, Chen, Xu, Li (eds.) Advances on Applied Computer and Applied Computational Science. Proceedings of the 7th WSEAS International Conference on Applied Computer and Applied Computational Science, pp. 358-364. Hangzhou, China, April 6-8 (2008) ISBN: 978-690-6766-49-7


2007

Alberto Barrón Cedeño Extracción automática de términos en contextos definitorios. Tesis de Maestría, Posgrado en Ciencia e Ingeniería en Computación. Universidad Nacional Autónoma de México (México), 2007.


Alberto Barrón-Cedeño. Métodos para la obtención automática de términos en un área de especialidad. In: 3er. Coloquio de Lingüística Computacional, México, Mexico.


2006

Alberto Barrón, Gerardo Sierra, and Elio Villaseñor. C-value aplicado a la extracción de términos multipalabra en documentos técnicos y científicos en español. 7th Mexican International Conference on Computer Science (ENC 2006), IEEE Computer Press, San Luis Potosí, México


Gerardo Sierra, Rodrigo Alarcón, César Aguilar, and Alberto Barrón. Towards the Elaboration of a Corpus on Definitional Contexts. In: Proceedings of the 12th EURALEX International Congress. ISBN 88-7694-918-6


Gerardo Sierra, Rodrigo Alarcón, César Aguilar, Alberto Barrón, Valeria Benítez, and Itzia Baca. Corpus de contextos definitorios: una herramienta para la lexicografía y la terminología. In: X Simposio Iberoamericano de Terminología (RITERM 2006). Montevideo, Uruguay