Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Piotr Bański is active.

Publication


Featured researches published by Piotr Bański.


linguistic annotation workshop | 2009

Stand-off TEI Annotation: the Case of the National Corpus of Polish

Piotr Bański; Adam Przepiórkowski

We present the annotation architecture of the National Corpus of Polish and discuss problems identified in the TEI stand-off annotation system, which, in its current version, is still very much unfinished and untested, due to both technical reasons (lack of tools implementing the TEI-defined XPointer schemes) and certain problems concerning data representation. We concentrate on two features that a stand-off system should possess and that are conspicuously missing in the current TEI Guidelines.


language and technology conference | 2009

Which XML standards for multilevel corpus annotation

Adam Przepiórkowski; Piotr Bański

The paper attempts to answer the question: Which XML standard(s) should be used for multilevel corpus annotation? Various more or less specific standards and best practices are reviewed: TEI P5, XCES, work within ISO TC37 / SC4, TIGER-XML and PAULA. The conclusion of the paper is that the approach with the best claim to following text encoding standards consists in 1) using TEI-conformant schemata that are 2) designed in a way compatible with other standards and data models.


Proceedings of the First Workshop on Language Technologies for African Languages | 2009

A Repository of Free Lexical Resources for African Languages: The Project and the Method

Piotr Bański; Beata Wójtowicz

We report on a project which we believe to have the potential to become home to, among others, bilingual dictionaries for African languages. Kept in a well-structured XML format with several possible degrees of conformance, the dictionaries will be able to get usable even in their early versions, which will be then subject to supervised improvement as user feedback accumulates. The project is FreeDict, part of SourceForge, a well-known Internet repository of open source content. We demonstrate a possible process of dictionary development on the example of one of FreeDict dictionaries, a Swahili-English dictionary that we maintain and have been developing through subsequent stages of increasing complexity and machine-processability. The aim of the paper is to show that even a small bilingual lexical resource can be submitted to this project and gradually developed into a machine-processable form that can then interact with other FreeDict resources. We also present the immediate benefits of locating bilingual African dictionaries in this project. We have found FreeDict to be a very promising project with a lot of potential, and the present paper is meant to spread the news about it, in the hope to create an active community of linguists and lexicographers of various backgrounds, where common research subprojects can be fruitfully carried out.


Proceedings of the workshop on challenges in the management of large corpora and big data and natural language processing (CMLC-5+BigNLP) 2017 including the papers from the web-as-corpus (WAC-XI) guest section. Birmingham, 24 july 2017. Edited by: Bański, Piotr <https://www.zora.uzh.ch/view/authors_for_linking_in_citation/Ba==0144ski=3APiotr=3A=3A.html>; Kupietz, Marc <https://www.zora.uzh.ch/view/authors_for_linking_in_citation/Kupietz=3AMarc=3A=3A.html>; Lüngen, Harald <https://www.zora.uzh.ch/view/authors_for_linking_in_citation/L=FCngen=3AHarald=3A=3A.html>; Rayson, Paul <https://www.zora.uzh.ch/view/authors_for_linking_in_citation/Rayson=3APaul=3A=3A.html>; Biber, Hanno <https://www.zora.uzh.ch/view/authors_for_linking_in_citation/Biber=3AHanno=3A=3A.html>; Breiteneder, Evelyn <https://www.zora.uzh.ch/view/authors_for_linking_in_citation/Breiteneder=3AEvelyn=3A=3A.html>; Clematide, Simon <https://www.zora.uzh.ch/view/authors_for_linking_in_citation/Clematide=3ASimon=3A=3A.html>; Mariani, John <https://www.zora.uzh.ch/view/authors_for_linking_in_citation/Mariani=3AJohn=3A=3A.html>; Stevenson, Mark <https://www.zora.uzh.ch/view/authors_for_linking_in_citation/Stevenson=3AMark=3A=3A.html>; Sick, Theresa <https://www.zora.uzh.ch/view/authors_for_linking_in_citation/Sick=3ATheresa=3A=3A.html> (2017). Mannheim: Institut für Deutsche Sprache. | 2017

Proceedings of the workshop on challenges in the management of large corpora and big data and natural language processing (CMLC-5+BigNLP) 2017 including the papers from the web-as-corpus (WAC-XI) guest section. Birmingham, 24 july 2017

Piotr Bański; Marc Kupietz; Harald Lüngen; Paul Rayson; Hanno Biber; Evelyn Breiteneder; Simon Clematide; John A. Mariani; Mark Stevenson; Theresa Sick

Many (modernist) works of literature can be understood by their associativeness, be it constructed or “free”. This network-like character of (modernist) literature has often been addressed by terms like “free association”, connotation”, “context” or “intertext”. This paper proposes an experimental and exemplary approach to intraconnect a literary corpus of the Austrian writer Ilse Aichinger with semantic webtechnologies to enable interactive explorations of word-associations.


language resources and evaluation | 2004

A Search Tool for Corpora with Positional Tagsets and Ambiguities

Adam Przepiórkowski; Zygmunt Krynicki; Lukasz Debowski; Marcin Woliński; Daniel Janus; Piotr Bański


7th Practical Applications in Language and Computers | 2011

XML Text Interchange Format in the National Corpus of Polish

Adam Przepiórkowski; Piotr Bański


language resources and evaluation | 2012

Evaluating Query Languages for a Corpus Processing System

Elena Frick; Carsten Schnober; Piotr Bański


language resources and evaluation | 2012

The New IDS Corpus Analysis Platform: Challenges and Prospects

Piotr Bański; Peter M. Fischer; Elena Frick; Erik Ketzan; Marc Kupietz; Carsten Schnober; Oliver Schonefeld; Andreas Witt


Archive | 2013

KorAP: the new corpus analysis platform at IDS Mannheim

Piotr Bański; Joachim Bingel; Nils Diewald; Elena Frick; Michael Hanl; Marc Kupietz; Piotr Pȩzik; Carsten Schnober; Andreas Witt


language resources and evaluation | 2014

Access control by query rewriting: the case of KorAP

Piotr Bański; Nils Diewald; Michael Hanl; Marc Kupietz; Andreas Witt

Collaboration


Dive into the Piotr Bański's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Laurent Romary

French Institute for Research in Computer Science and Automation

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Marcin Woliński

Polish Academy of Sciences

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Anna Kupść

Polish Academy of Sciences

View shared research outputs
Top Co-Authors

Avatar

Elżbieta Hajnicz

Polish Academy of Sciences

View shared research outputs
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge