Skip to main content

Showing 1–6 of 6 results for author: Gramacki, P

  1. SRAI: Towards Standardization of Geospatial AI

    Authors: Piotr Gramacki, Kacper Leśniara, Kamil Raczycki, Szymon Woźniak, Marcin Przymus, Piotr Szymański

    Abstract: Spatial Representations for Artificial Intelligence (srai) is a Python library for working with geospatial data. The library can download geospatial data, split a given area into micro-regions using multiple algorithms and train an embedding model using various architectures. It includes baseline models as well as more complex methods from published works. Those capabilities make it possible to us… ▽ More

    Submitted 23 October, 2023; v1 submitted 19 October, 2023; originally announced October 2023.

    Comments: Accepted for the 6th ACM SIGSPATIAL International Workshop on AI for Geographic Knowledge Discovery (GeoAI 2023)

  2. arXiv:2306.07902  [pdf, other

    cs.CL cs.AI

    Massively Multilingual Corpus of Sentiment Datasets and Multi-faceted Sentiment Classification Benchmark

    Authors: Łukasz Augustyniak, Szymon Woźniak, Marcin Gruza, Piotr Gramacki, Krzysztof Rajda, Mikołaj Morzy, Tomasz Kajdanowicz

    Abstract: Despite impressive advancements in multilingual corpora collection and model training, developing large-scale deployments of multilingual models still presents a significant challenge. This is particularly true for language tasks that are culture-dependent. One such example is the area of multilingual sentiment analysis, where affective markers can be subtle and deeply ensconced in culture. This w… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

    Comments: submitted to NeurIPS 2023 Datasets and Benchmarks track. Dataset: https://huggingface.co/datasets/Brand24/mms Code: https://github.com/Brand24-AI/mms_benchmark

  3. arXiv:2304.01922  [pdf, other

    cs.CL

    Resources and Few-shot Learners for In-context Learning in Slavic Languages

    Authors: Michal Štefánik, Marek Kadlčík, Piotr Gramacki, Petr Sojka

    Abstract: Despite the rapid recent progress in creating accurate and compact in-context learners, most recent work focuses on in-context learning (ICL) for tasks in English. However, the ability to interact with users of languages outside English presents a great potential for broadening the applicability of language technologies to non-English speakers. In this work, we collect the infrastructure necessa… ▽ More

    Submitted 4 April, 2023; originally announced April 2023.

    Comments: EACL 2023 SlavicNLP Long Paper. New instructional templates and models are available on https://github.com/fewshot-goes-multilingual/slavic-incontext-learning

  4. arXiv:2204.04937  [pdf, other

    cs.CL cs.LG

    Assessment of Massively Multilingual Sentiment Classifiers

    Authors: Krzysztof Rajda, Łukasz Augustyniak, Piotr Gramacki, Marcin Gruza, Szymon Woźniak, Tomasz Kajdanowicz

    Abstract: Models are increasing in size and complexity in the hunt for SOTA. But what if those 2\% increase in performance does not make a difference in a production use case? Maybe benefits from a smaller, faster model outweigh those slight performance gains. Also, equally good performance across languages in multilingual tasks is more important than SOTA results on a single one. We present the biggest, un… ▽ More

    Submitted 11 April, 2022; originally announced April 2022.

    Comments: Accepted for WASSA at ACL 2022

  5. arXiv:2111.02405  [pdf, other

    cs.LG

    Unsupervised embedding and similarity detection of microregions using public transport schedules

    Authors: Piotr Gramacki

    Abstract: The role of spatial data in tackling city-related tasks has been growing in recent years. To use them in machine learning models, it is often necessary to transform them into a vector representation, which has led to the development in the field of spatial data representation learning. There is also a growing variety of spatial data types for which representation learning methods are proposed. Pub… ▽ More

    Submitted 3 November, 2021; originally announced November 2021.

    Comments: Master's thesis submitted at the Wrocław University of Science and Technology

  6. gtfs2vec -- Learning GTFS Embeddings for comparing Public Transport Offer in Microregions

    Authors: Piotr Gramacki, Szymon Woźniak, Piotr Szymański

    Abstract: We selected 48 European cities and gathered their public transport timetables in the GTFS format. We utilized Uber's H3 spatial index to divide each city into hexagonal micro-regions. Based on the timetables data we created certain features describing the quantity and variety of public transport availability in each region. Next, we trained an auto-associative deep neural network to embed each of… ▽ More

    Submitted 2 November, 2021; v1 submitted 1 November, 2021; originally announced November 2021.

    Comments: Accepted at 1st ACM SIGSPATIAL International Workshop on Searching and Mining Large Collections of Geospatial Data (GeoSearch 2021)