ISSN 2071-8594

Russian academy of sciences

Editor-in-Chief

Academician Stanislav Emelyanov

I.V. Smirnov, A.O. Shelmanov, E.S. Kuznecova, I.V. Khramoin. Semantic-syntactic analysis of natural languages. Part II. Method for semantic-syntactic analysis of texts

Abstract.

In the paper, we discuss problem of semantic-syntactic parsing of texts in natural language. Review of approaches and methods for semantic-syntactic analysis is presented. We describe syntactic parser of Russian texts created using MaltParser and experimental research of features that influence performance of syntactic parsing of Russian texts. We describe system for semantic role labeling based on method of relational-situational analysis and present its evaluation results on Russian texts. We consider method for joint semantic-syntactic analysis and present its evaluation results on Russian texts. We also compare results of the system based on joint analysis and the system that performs separate consequent syntactic parsing and semantic role labeling.

Keywords:

semantic-syntactic parsing, machine learning, syntactic parsing, semantic role labeling, parser.

PP. 11-24.

Full version of the article in pdf.

REFERENCES

1. Smirnov I.V., Shelmanov A.O. Semantiko-sintaksicheskiy analiz estestvennykh yazykov. Chast I. Obzor metodov sintaksicheskogo i semanticheskogo analiza tekstov // Iskusstvennyy intellekt i prinyatie resheniy. — 2013. — № 1. — S. 41–54.
2. Osipov G. Methods for extracting semantic types of natural language statements from texts // 10th IEEE International Symposium on Intelligent Control. — Monterey, California, USA, 1995. — aug.
3. Gildea D., Jurafsky D. Automatic labeling of semantic roles // Computational Linguistics. — 2002. — Vol. 28, no. 3. — P. 245–288.
4. Zolotova G.A., Onipenko N.K., Sidorova M.Yu. Kommunikativnaya grammatika russkogo yazyka // Institut russkogo yazyka RAN im. V. V. Vinogradova. — 2004.
5. Osipov G.S., Smirnov I.V., Tikhomirov I. Relyatsionno-situatsionnyy metod poiska i analiza tekstov i ego prilozheniya // Iskusstvennyy intellekt i prinyatie resheniy. — 2008. — № 2. — S. 3–10.
6. The CoNLL-2008 shared task on joint parsing of syntactic and semantic dependencies / Mihai Surdeanu, Richard Johansson, Adam Meyers et al. // Proceedings of the Twelfth Conference on Computational Natural Language Learning / Association for Computational Linguistics. — 2008. — P. 159–177.
7. The CoNLL-2009 shared task: Syntactic and semantic dependencies in multiple languages / Jan Hajic, Massimiliano Ciaramita, Richard Johansson et al. // Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task / Association for Computational Linguistics. — 2009. — P. 1–18.
8. Llu?s X., M?rquez L. A joint model for parsing syntactic and semantic dependencies // Proceedings of the Twelfth Conference on Computational Natural Language Learning / Association for Computational Linguistics. — 2008. — P. 188–192.
9. Llu?s X. Joint Learning of Syntactic and Semantic Dependencies : Ph.D. thesis / Xavier Llu?s ; Master Thesis, Universitat Politecnica de Catalunya (Artificial Intelligence Program), Barcelona. — 2008.
10. Llu?s X., Bott S., M?rquez L. A second-order joint eisner model for syntactic and semantic dependency parsing // Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task / Association for Computational Linguistics. — 2009. — P. 79–84.
11. Eisner J. M. Three new probabilistic models for dependency parsing: An exploration // Proceedings of the 16th conference on Computational linguistics. — Vol. 1. — 1996. — P. 340–345.
12. A latent variable model of synchronous parsing for syntactic and semantic dependencies / James Henderson, Paola Merlo, Gabriele Musillo, Ivan Titov // Proceedings of the Twelfth Conference on Computational Natural Language Learning / Association for Computational Linguistics. — 2008. — P. 178–182.
13. A latent variable model of synchronous syntactic-semantic parsing for multiple languages / Andrea Gesmundo, James Henderson, Paola Merlo, Ivan Titov // Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task / Association for Com-putational Linguistics. — 2009. — P. 37–42.
14. Multi-lingual joint parsing of syntactic and semantic dependencies with a latent variable model / James Henderson, Paola Merlo, Ivan Titov, Gabriele Musillo // Computational Linguistics. — 2013.
15. Titov I., Henderson J. A latent variable model for generative dependency parsing // Proceedings of the 10th International Conference on Parsing Technologies. — IWPT ’07. — Stroudsburg, PA, USA : Association for Computational Linguistics, 2007. — P. 144–155.
16. Mixing and blending syntactic and semantic dependencies / Yvonne Samuelsson, Oscar T?ckstr?m, Sumithra Velupillai et al. // Proceedings of the Twelfth Conference on Computational Natural Language Learning / Association for Computational Linguistics. — 2008. — P. 248–252.
17. Dai Q., Chen E., Shi L. An iterative approach for joint dependency parsing and semantic role labeling // Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task / Association for Computational Linguistics. — 2009. — P. 19–24.
18. Johansson R., Nugues P. Dependency-based syntactic-semantic analysis with PropBank and NomBank // Proceedings of the Twelfth Conference on Computational Natural Language Learning / Association for Computational Linguistics. — 2008. — P. 183–187.
19. Chen E., Shi L., Hu D. Probabilistic model for syntactic and semantic dependency parsing // Proceedings of the Twelfth Conference on Computational Natural Language Learning / Association for Computational Linguistics. — 2008. — P. 263–267.
20. Sun W., Li H., Sui Z. The integration of dependency relation classification and semantic role labeling using bilayer maximum entropy markov models // Proceedings of the Twelfth Conference on Computational Natural Language Learning / Association for Computational Linguistics. — 2008. — P. 243–247.
21. Morante R., Van Asch V., Van den Bosch A. Joint memory-based learning of syntactic and semantic dependencies in multiple languages // Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task / Association for Computational Linguistics. — 2009. — P. 25–30.
22. Syntactic and semantic parser based on ABBYY Compreno linguistic technologies / K. V. Anisimovich, K. Ju. Druzhkin, F. R. Minlos et al. // Papers from the An-nual International Conference "Dialogue" (2012). — Vol. 2. — 2012. — P. 91–103.
23. Kanevskiy Ye. A., Boyarskiy K. K. Semantiko-sintaksicheskiy analizator SemSin // Mezhdunarodnaya konferentsiya «Dialog 2012». Doklady, prinyatye k publikatsii na sayte. — URL: http://www.dialog-21.ru/digests/dialog2012/materials/pdf/Kanevsky.pdf.
24. Kuznetsov I. P. Metodiki vyyavleniya obektov i svyazey, zadannykh v neyavnom vide // Mezhdunarodnaya konferentsiya «Dialog 2012». Doklady, prinyatye k publikatsii na sayte. — 2012. — URL: http://www.dialog-21.ru/digests/dialog2012/materials/pdf/Kuznetsov_I_P.pdf.
25. Yermakov A. Ye., Pleshko V. V. Semanticheskaya interpretatsiya v sistemakh kompyuternogo analiza teksta // Informatsionnye tekhnologii. — T. 6. — S. 2–7.
26. Kashkin Ye., Lyashevskaya O. N. Semanticheskie roli i set konstruktsiy v sisteme FrameBank // Trudy mezhdunarodnoy konferentsii «Dialog 2013». — 2013. — S. 325–343.
27. Nivre J., Boguslavsky I. M., Iomdin L. L. Parsing the SynTagRus treebank of Russian // Proceedings of the 22nd International Conference on Computational Linguistics (Coling 2008). — Manchester, UK : Coling 2008 Organizing Committee, 2008. — August. — P. 641–648.
28. Sharoff S., Nivre J. The proper place of men and machines in language technology: Processing Russian without any linguistic knowledge // Papers from the Annual International Conference "Dialogue" (2011). — No. 10. — 2011. — P. 17.
29. MaltParser: A language-independent system for data-driven dependency parsing / Joakim Nivre, Johan Hall, Jens Nilsson et al. // Natural Language Engineering. — 2007. — Vol. 13, no. 2. — P. 95–135.
30. MaltParser. — 2013. — dek. — URL: http://maltparser.org/.
31. Sintaksicheski razmechennyy korpus russkogo yazyka: instruktsiya polzovatelya. — 2013. — dek. — URL: http://www.ruscorpora.ru/instruction-syntax.html.
32. Sintaksicheski i semanticheski annotirovannyy korpus russkogo yazyka: sovremennoe sostoyanie i perspektivy / Yu. D. Apresyan, I. M. Boguslavskiy, B. L. Iomdin i dr. // Natsionalnyy korpus russkogo yazyka: 2003–2005. — 2005. — S. 193–214.
33. Buchholz S., Marsi E. CoNLL-X shared task on multilingual dependency parsing // Proceedings of the Tenth Con-ference on Computational Natural Language Learning / Association for Computational Linguistics. — 2006. — P. 149–164.
34. Designing and evaluating a Russian tagset / Sharoff Serge, Kopotev Mikhail, Erjavec Tomaz et al. // Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC’08). — Marrakech, Morocco : European Language Resources Association (ELRA), 2008. — may.
35. MULTEXT-East morphosyntactic specifications, version 4. — 2013. — dek. — URL: http://nl.ijs.si/ME/V4/msd/html/msd-ru.html.
36. Chang C.-C., Lin C.-J. LIBSVM: A library for support vector machines // ACM Transactions on Intelligent Systems and Technology. — 2011. — Vol. 2. — P. 27.
37. LIBLINEAR: A library for large linear classification / Rong-En Fan, Kai-Wei Chang, Cho-Jui Hsieh et al. // The Journal of Machine Learning Research. — 2008. — Vol. 9. — P. 1871–1874.
38. Russian statistical taggers and parsers. — 2013. — dek. — URL: http://corpus.leeds.ac.uk/mocky/.
39. Nivre J., Nilsson J. Pseudo-projective dependency parsing // Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics / Association for Computational Linguistics. — 2005. — P. 99–106.
40. Relational–situational method for intelligent search and analysis of scientific publications / Gennady Osipov, Ivan Smirnov, Ilya Tikhomirov, Artem Shelmanov // Proceedings of the Workshop on Integrating IR technologies for Professional Search, in conjunction with the 35th European Conference on Information Retrieval (ECIR’13). — Vol. 968. — Moscow, Russia : CEUR Workshop Proceedings, 2013.
41. Osipov G. S. Metody iskusstvennogo intellekta. — FIZMATLIT, 2011.
42. Avtomaticheskaya obrabotka teksta. — 2013. — dek. — URL: http://www.aot.ru/.
43. Punyakanok V., Roth D., Yih W.-t. The importance of syntactic parsing and inference in semantic role labeling // Computational Linguistics. — 2008. — Vol. 34, no. 2. — P. 257–287.