{"id":655,"date":"2024-11-01T13:19:05","date_gmt":"2024-11-01T13:19:05","guid":{"rendered":"https:\/\/nlp.pef.mendelu.cz\/?page_id=655"},"modified":"2025-03-14T13:40:34","modified_gmt":"2025-03-14T13:40:34","slug":"en","status":"publish","type":"page","link":"https:\/\/nlp.pef.mendelu.cz\/index.php\/en\/","title":{"rendered":"Home"},"content":{"rendered":"\n<p class=\"has-text-align-left\"><a href=\"https:\/\/nlp.pef.mendelu.cz\/index.php\/\">Czech version<\/a><\/p>\n\n\n\n<div class=\"wp-block-columns is-layout-flex wp-container-core-columns-is-layout-1 wp-block-columns-is-layout-flex\">\n<div class=\"wp-block-column is-vertically-aligned-center is-layout-flow wp-block-column-is-layout-flow\" style=\"flex-basis:100%\">\n<div class=\"wp-block-group\"><div class=\"wp-block-group__inner-container is-layout-flow wp-block-group-is-layout-flow\">\n<h2 class=\"wp-block-heading alignfull has-text-align-center has-grey-color has-text-color\" id=\"natural-language-processing-group\">Natural Language Processing Group<\/h2>\n<\/div><\/div>\n\n\n\n<div style=\"height:30px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<div class=\"wp-block-group\"><div class=\"wp-block-group__inner-container is-layout-flow wp-block-group-is-layout-flow\"><div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"975\" height=\"543\" src=\"https:\/\/nlp.pef.mendelu.cz\/wp-content\/uploads\/2022\/07\/img-13.png\" alt=\"\" class=\"wp-image-556\" srcset=\"https:\/\/nlp.pef.mendelu.cz\/wp-content\/uploads\/2022\/07\/img-13.png 975w, https:\/\/nlp.pef.mendelu.cz\/wp-content\/uploads\/2022\/07\/img-13-300x167.png 300w, https:\/\/nlp.pef.mendelu.cz\/wp-content\/uploads\/2022\/07\/img-13-768x428.png 768w\" sizes=\"auto, (max-width: 975px) 100vw, 975px\" \/><\/figure><\/div><\/div><\/div>\n\n\n\n<div style=\"height:30px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<div class=\"wp-block-group\"><div class=\"wp-block-group__inner-container is-layout-flow wp-block-group-is-layout-flow\">\n<p class=\"has-text-align-center subTitle\"><meta charset=\"utf-8\">Gathering and revealing the information<\/p>\n<\/div><\/div>\n<\/div>\n<\/div>\n\n\n\n<div class=\"wp-block-group\"><div class=\"wp-block-group__inner-container is-layout-flow wp-block-group-is-layout-flow\">\n<div class=\"wp-block-columns is-layout-flex wp-container-core-columns-is-layout-3 wp-block-columns-is-layout-flex\">\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\" style=\"flex-basis:15%\"><\/div>\n\n\n\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\" style=\"flex-basis:70%\">\n<div class=\"wp-block-group\"><div class=\"wp-block-group__inner-container is-layout-flow wp-block-group-is-layout-flow\">\n<div class=\"wp-block-group\"><div class=\"wp-block-group__inner-container is-layout-flow wp-block-group-is-layout-flow\">\n<h2 class=\"wp-block-heading\" id=\"kdo-jsme\">Who we are?  <\/h2>\n<\/div><\/div>\n\n\n\n<div class=\"wp-block-columns is-layout-flex wp-container-core-columns-is-layout-2 wp-block-columns-is-layout-flex\">\n<div class=\"wp-block-column is-vertically-aligned-center is-layout-flow wp-block-column-is-layout-flow\" style=\"flex-basis:50%\">\n<div class=\"wp-block-group\"><div class=\"wp-block-group__inner-container is-layout-flow wp-block-group-is-layout-flow\">\n<p>We are a working group dedicated to natural language processing. We focus on semantic processing of electronic text data produced in natural languages with the goal of uncovering knowledge hidden in the data. In particular, we use machine learning methods. We develop web and mobile applications that use text mining algorithms to solve problems for everyday users and customers. Our solutions are also used by commercial companies.<\/p>\n<\/div><\/div>\n<\/div>\n\n\n\n<div class=\"wp-block-column is-vertically-aligned-center is-layout-flow wp-block-column-is-layout-flow\" style=\"flex-basis:50%\">\n<div class=\"wp-block-group\"><div class=\"wp-block-group__inner-container is-layout-flow wp-block-group-is-layout-flow\"><div class=\"wp-block-image\">\n<figure class=\"aligncenter size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"628\" src=\"https:\/\/nlp.pef.mendelu.cz\/wp-content\/uploads\/2022\/06\/Work_chat-1024x628.png\" alt=\"\" class=\"wp-image-401\" srcset=\"https:\/\/nlp.pef.mendelu.cz\/wp-content\/uploads\/2022\/06\/Work_chat-1024x628.png 1024w, https:\/\/nlp.pef.mendelu.cz\/wp-content\/uploads\/2022\/06\/Work_chat-300x184.png 300w, https:\/\/nlp.pef.mendelu.cz\/wp-content\/uploads\/2022\/06\/Work_chat-768x471.png 768w, https:\/\/nlp.pef.mendelu.cz\/wp-content\/uploads\/2022\/06\/Work_chat.png 1189w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure><\/div>\n\n\n<p><\/p>\n<\/div><\/div>\n<\/div>\n<\/div>\n<\/div><\/div>\n<\/div>\n\n\n\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\" style=\"flex-basis:15%\"><\/div>\n<\/div>\n\n\n\n<div class=\"wp-block-columns is-layout-flex wp-container-core-columns-is-layout-5 wp-block-columns-is-layout-flex\">\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\" style=\"flex-basis:15%\"><\/div>\n\n\n\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\" style=\"flex-basis:70%\">\n<div class=\"wp-block-group\"><div class=\"wp-block-group__inner-container is-layout-flow wp-block-group-is-layout-flow\">\n<div class=\"wp-block-group\"><div class=\"wp-block-group__inner-container is-layout-flow wp-block-group-is-layout-flow\">\n<h2 class=\"wp-block-heading\" id=\"projekty\">Outputs<\/h2>\n<\/div><\/div>\n\n\n\n<div class=\"wp-block-columns has-background is-layout-flex wp-container-core-columns-is-layout-4 wp-block-columns-is-layout-flex\" style=\"background:linear-gradient(135deg,rgb(238,238,238) 0%,rgb(245,245,245) 0%)\">\n<div class=\"wp-block-column is-vertically-aligned-top is-layout-flow wp-block-column-is-layout-flow\"><div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><a href=\"https:\/\/mta.pef.mendelu.cz\/trading\/#\/feed\"><img loading=\"lazy\" decoding=\"async\" width=\"184\" height=\"74\" src=\"https:\/\/nlp.pef.mendelu.cz\/wp-content\/uploads\/2022\/06\/sfa-1.png\" alt=\"\" class=\"wp-image-403\"\/><\/a><figcaption class=\"wp-element-caption\"><strong>Summary of Financial Articles<\/strong><br><br><\/figcaption><\/figure><\/div><\/div>\n\n\n\n<div class=\"wp-block-column is-vertically-aligned-top is-layout-flow wp-block-column-is-layout-flow\"><div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><a href=\"https:\/\/mta.pef.mendelu.cz\/apps\/#\/\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"428\" src=\"https:\/\/nlp.pef.mendelu.cz\/wp-content\/uploads\/2022\/06\/mta_black_large-1024x428-1.png\" alt=\"\" class=\"wp-image-404\" srcset=\"https:\/\/nlp.pef.mendelu.cz\/wp-content\/uploads\/2022\/06\/mta_black_large-1024x428-1.png 1024w, https:\/\/nlp.pef.mendelu.cz\/wp-content\/uploads\/2022\/06\/mta_black_large-1024x428-1-300x125.png 300w, https:\/\/nlp.pef.mendelu.cz\/wp-content\/uploads\/2022\/06\/mta_black_large-1024x428-1-768x321.png 768w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/a><figcaption class=\"wp-element-caption\"><strong>N\u00e1stroje a aplikace pro zpracov\u00e1n\u00ed p\u0159irozen\u00e9ho jazyka<\/strong><\/figcaption><\/figure><\/div><\/div>\n\n\n\n<div class=\"wp-block-column is-vertically-aligned-top is-layout-flow wp-block-column-is-layout-flow\"><div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><a href=\"https:\/\/mojilidi.cz\/cs\/search\"><img loading=\"lazy\" decoding=\"async\" width=\"366\" height=\"138\" src=\"https:\/\/nlp.pef.mendelu.cz\/wp-content\/uploads\/2022\/06\/logo-ml-1.png\" alt=\"\" class=\"wp-image-407\" srcset=\"https:\/\/nlp.pef.mendelu.cz\/wp-content\/uploads\/2022\/06\/logo-ml-1.png 366w, https:\/\/nlp.pef.mendelu.cz\/wp-content\/uploads\/2022\/06\/logo-ml-1-300x113.png 300w\" sizes=\"auto, (max-width: 366px) 100vw, 366px\" \/><\/a><figcaption class=\"wp-element-caption\"><strong>Vyhled\u00e1v\u00e1n\u00ed odborn\u00edk\u016f<\/strong><br><\/figcaption><\/figure><\/div><\/div>\n\n\n\n<div class=\"wp-block-column is-vertically-aligned-top is-layout-flow wp-block-column-is-layout-flow\"><div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full is-resized\"><a href=\"https:\/\/play.google.com\/store\/apps\/details?id=cz.product.reviews\"><img loading=\"lazy\" decoding=\"async\" width=\"180\" height=\"180\" src=\"https:\/\/nlp.pef.mendelu.cz\/wp-content\/uploads\/2022\/06\/opinio-1.png\" alt=\"\" class=\"wp-image-408\" style=\"width:135px;height:135px\" srcset=\"https:\/\/nlp.pef.mendelu.cz\/wp-content\/uploads\/2022\/06\/opinio-1.png 180w, https:\/\/nlp.pef.mendelu.cz\/wp-content\/uploads\/2022\/06\/opinio-1-150x150.png 150w\" sizes=\"auto, (max-width: 180px) 100vw, 180px\" \/><\/a><figcaption class=\"wp-element-caption\"><strong>Mobiln\u00ed aplikace Opinio<\/strong><br><\/figcaption><\/figure><\/div><\/div>\n<\/div>\n<\/div><\/div>\n<\/div>\n\n\n\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\" style=\"flex-basis:15px\"><\/div>\n<\/div>\n\n\n\n<div class=\"wp-block-columns is-layout-flex wp-container-core-columns-is-layout-6 wp-block-columns-is-layout-flex\">\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\" style=\"flex-basis:15%\"><\/div>\n\n\n\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\" style=\"flex-basis:70%\">\n<div class=\"wp-block-group\"><div class=\"wp-block-group__inner-container is-layout-flow wp-block-group-is-layout-flow\">\n<h2 class=\"wp-block-heading\" id=\"uzitecne\">Useful<\/h2>\n\n\n\n<p>In our work we use various tools for the preparation and subsequent analysis of text documents. These include lemmatization, stemming, stop word removal, sentiment identification, and more. You can also find our chatbot, which recommends mobile phones.<\/p>\n<\/div><\/div>\n<\/div>\n\n\n\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\" style=\"flex-basis:15%\"><\/div>\n<\/div>\n\n\n\n<div class=\"wp-block-columns is-layout-flex wp-container-core-columns-is-layout-10 wp-block-columns-is-layout-flex\">\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\" style=\"flex-basis:15%\"><\/div>\n\n\n\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\" style=\"flex-basis:70%\">\n<div class=\"wp-block-group\"><div class=\"wp-block-group__inner-container is-layout-flow wp-block-group-is-layout-flow\">\n<h2 class=\"wp-block-heading\" id=\"zpracovani-prirozeneho-jazyka\">Natural Language Processing<\/h2>\n\n\n\n<p>It is estimated that over 80% of data today is stored as text (newspaper articles, emails, blogs, Facebook posts, etc.) with little or no structure. The need for text data analysis is currently growing and is becoming a very commercially interesting area. The goal of analytical tasks is to uncover prior unknown knowledge contained in this data using non-trivial methods. The results find applications in the fields of marketing, computer security, information services, literature search, human resource management, counter-terrorism, etc.<\/p>\n\n\n\n<div class=\"wp-block-columns is-layout-flex wp-container-core-columns-is-layout-7 wp-block-columns-is-layout-flex\">\n<div class=\"wp-block-column is-vertically-aligned-center is-layout-flow wp-block-column-is-layout-flow\" style=\"flex-basis:60%\">\n<div class=\"wp-block-group\"><div class=\"wp-block-group__inner-container is-layout-flow wp-block-group-is-layout-flow\">\n<p>Working with text data is generally very difficult. The data is usually unstructured and has a completely different character than numerical data (complex grammar, different meanings of words, subjectivity, irony, etc.). Moreover, procedures that work satisfactorily for one domain may not work for another domain. We seek to deploy methods from the domain of Data Mining. This mature and well-developed discipline also focuses on finding hidden knowledge in data, but works with highly structured numerical data. It is therefore advantageous to prepare textual data in such a way that Data Mining methods are applicable to it. This requires the deployment of techniques from areas such as natural language processing, statistics, machine learning, linguistics and others.<\/p>\n<\/div><\/div>\n<\/div>\n\n\n\n<div class=\"wp-block-column is-vertically-aligned-center is-layout-flow wp-block-column-is-layout-flow\" style=\"flex-basis:40%\">\n<div class=\"wp-block-group\"><div class=\"wp-block-group__inner-container is-layout-flow wp-block-group-is-layout-flow\"><div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"809\" height=\"682\" src=\"https:\/\/nlp.pef.mendelu.cz\/wp-content\/uploads\/2022\/06\/online-article.png\" alt=\"\" class=\"wp-image-413\" srcset=\"https:\/\/nlp.pef.mendelu.cz\/wp-content\/uploads\/2022\/06\/online-article.png 809w, https:\/\/nlp.pef.mendelu.cz\/wp-content\/uploads\/2022\/06\/online-article-300x253.png 300w, https:\/\/nlp.pef.mendelu.cz\/wp-content\/uploads\/2022\/06\/online-article-768x647.png 768w\" sizes=\"auto, (max-width: 809px) 100vw, 809px\" \/><\/figure><\/div><\/div><\/div>\n<\/div>\n<\/div>\n\n\n\n<div class=\"wp-block-columns is-layout-flex wp-container-core-columns-is-layout-8 wp-block-columns-is-layout-flex\">\n<div class=\"wp-block-column is-vertically-aligned-center is-layout-flow wp-block-column-is-layout-flow\" style=\"flex-basis:40%\">\n<div class=\"wp-block-group\"><div class=\"wp-block-group__inner-container is-layout-flow wp-block-group-is-layout-flow\"><div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"523\" height=\"383\" src=\"https:\/\/nlp.pef.mendelu.cz\/wp-content\/uploads\/2022\/06\/data-extraction.png\" alt=\"\" class=\"wp-image-415\" srcset=\"https:\/\/nlp.pef.mendelu.cz\/wp-content\/uploads\/2022\/06\/data-extraction.png 523w, https:\/\/nlp.pef.mendelu.cz\/wp-content\/uploads\/2022\/06\/data-extraction-300x220.png 300w\" sizes=\"auto, (max-width: 523px) 100vw, 523px\" \/><\/figure><\/div><\/div><\/div>\n<\/div>\n\n\n\n<div class=\"wp-block-column is-vertically-aligned-center is-layout-flow wp-block-column-is-layout-flow\" style=\"flex-basis:60%\">\n<div class=\"wp-block-group\"><div class=\"wp-block-group__inner-container is-layout-flow wp-block-group-is-layout-flow\">\n<p>For text data analysis, the research mainly applies machine learning methods with teacher (classification), without teacher (clustering, attribute selection, association search) and semi-supervised learning and their combinations. Research goals include categorizing text documents, retrieving documents based on similarity, discovering the semantics of groups of documents, finding attributes that convey meaning, sentiment analysis, and more. The specific characteristics and limitations of these tasks are taken into account, such as the small number of suitable examples, the huge volumes and dimensionality of the data, the sparsity of the vectors representing the data, the unbalanced classes, and the multilinguality. These peculiarities are typical of datasets generated by users themselves on social networks, microblogs or discussion forums (as opposed to scientific papers or newspaper articles). Future research will also focus on analysing the relationships between textual data (news, economic summaries, social media posts and likes) and various economic phenomena such as stock price movements.<\/p>\n<\/div><\/div>\n<\/div>\n<\/div>\n\n\n\n<div class=\"wp-block-columns is-layout-flex wp-container-core-columns-is-layout-9 wp-block-columns-is-layout-flex\">\n<div class=\"wp-block-column is-vertically-aligned-center is-layout-flow wp-block-column-is-layout-flow\" style=\"flex-basis:50%\">\n<div class=\"wp-block-group\"><div class=\"wp-block-group__inner-container is-layout-flow wp-block-group-is-layout-flow\">\n<p>In the course of the research, tools for data preprocessing are developed and deployed, including stemming, word stemming, word stemming, word type identification, spell checking, and more. A unique application with an optional graphical user interface is continuously developed to transform the data into a format suitable for machine learning algorithms and various software tools. The research also includes the application of professional commercial or open-source software (C5, Cluto, Weka, IBM SPSS Modeler and others).<\/p>\n<\/div><\/div>\n<\/div>\n\n\n\n<div class=\"wp-block-column is-vertically-aligned-center is-layout-flow wp-block-column-is-layout-flow\" style=\"flex-basis:50%\">\n<div class=\"wp-block-group\"><div class=\"wp-block-group__inner-container is-layout-flow wp-block-group-is-layout-flow\"><div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"856\" height=\"611\" src=\"https:\/\/nlp.pef.mendelu.cz\/wp-content\/uploads\/2022\/06\/data-processing.png\" alt=\"\" class=\"wp-image-416\" srcset=\"https:\/\/nlp.pef.mendelu.cz\/wp-content\/uploads\/2022\/06\/data-processing.png 856w, https:\/\/nlp.pef.mendelu.cz\/wp-content\/uploads\/2022\/06\/data-processing-300x214.png 300w, https:\/\/nlp.pef.mendelu.cz\/wp-content\/uploads\/2022\/06\/data-processing-768x548.png 768w\" sizes=\"auto, (max-width: 856px) 100vw, 856px\" \/><\/figure><\/div><\/div><\/div>\n<\/div>\n<\/div>\n<\/div><\/div>\n<\/div>\n\n\n\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\" style=\"flex-basis:15%\"><\/div>\n<\/div>\n<\/div><\/div>\n\n\n\n<div class=\"wp-block-group\"><div class=\"wp-block-group__inner-container is-layout-flow wp-block-group-is-layout-flow\">\n<div class=\"wp-block-columns is-layout-flex wp-container-core-columns-is-layout-11 wp-block-columns-is-layout-flex\">\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\" style=\"flex-basis:15%\"><\/div>\n\n\n\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\" style=\"flex-basis:70%\">\n<div class=\"wp-block-group\"><div class=\"wp-block-group__inner-container is-layout-flow wp-block-group-is-layout-flow\"><\/div><\/div>\n<\/div>\n\n\n\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\" style=\"flex-basis:15%\"><\/div>\n<\/div>\n\n\n\n<div class=\"wp-block-columns is-layout-flex wp-container-core-columns-is-layout-12 wp-block-columns-is-layout-flex\">\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\" style=\"flex-basis:15%\"><\/div>\n\n\n\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\" style=\"flex-basis:15px\"><\/div>\n<\/div>\n\n\n\n<div class=\"wp-block-columns is-layout-flex wp-container-core-columns-is-layout-13 wp-block-columns-is-layout-flex\">\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\" style=\"flex-basis:15%\"><\/div>\n<\/div>\n\n\n\n<div class=\"wp-block-columns is-layout-flex wp-container-core-columns-is-layout-14 wp-block-columns-is-layout-flex\">\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\" style=\"flex-basis:15%\"><\/div>\n<\/div>\n<\/div><\/div>\n","protected":false},"excerpt":{"rendered":"<p>Czech version Natural Language Processing Group Gathering and revealing the information Who we are? We are a working group dedicated [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-655","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/nlp.pef.mendelu.cz\/index.php\/wp-json\/wp\/v2\/pages\/655","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/nlp.pef.mendelu.cz\/index.php\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/nlp.pef.mendelu.cz\/index.php\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/nlp.pef.mendelu.cz\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/nlp.pef.mendelu.cz\/index.php\/wp-json\/wp\/v2\/comments?post=655"}],"version-history":[{"count":7,"href":"https:\/\/nlp.pef.mendelu.cz\/index.php\/wp-json\/wp\/v2\/pages\/655\/revisions"}],"predecessor-version":[{"id":669,"href":"https:\/\/nlp.pef.mendelu.cz\/index.php\/wp-json\/wp\/v2\/pages\/655\/revisions\/669"}],"wp:attachment":[{"href":"https:\/\/nlp.pef.mendelu.cz\/index.php\/wp-json\/wp\/v2\/media?parent=655"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}