All publications
  2017 - Articles
Hoek, J., Zufferey, S.I., Evers-Vermeul, J. & Sanders, T.J.M. (2017). Cognitive complexity and the linguistic marking of coherence relations - A parallel corpus study. Journal of Pragmatics, 121, (pp. 113-131).
Evers-Vermeul, J., Hoek, J. & Scholman, M.C.J. (2017). On temporality in discourse annotation - Theoretical and practical considerations. Dialogue & DIscourse, 8 (2), (pp. 1-20) (20 p.).
Hoek, J., Evers-Vermeul, J. & Sanders, T.J.M. (13.01.2017). Segmenting Discourse - Incorporating interpretation into segmentation?. Corpus linguistics and Linguistic theory
  2017 - Articles in volumes / proceedings
Hoek, J. & Scholman, Merel C.J. (2017). Evaluating discourse annotation - Some recent insights and new approaches. Proceedings 13th Joint ISO - ACL Workshop on Interoperable Semantic Annotation (isa-13) (pp. 1-13).
  2016 - Book parts / chapters
Hoek, J. & de Hoop, H. (24.03.2016). On the optimal interpretation of yes and no in Dutch. In Géraldine Legendre, Michael T. Putnam, Henriëtte de Swart & Erin Zaroukian (Eds.), Optimality Theoretic Syntax, Semantics, and Pragmatics - From Uni- to Bidirectional Optimization (pp. 220-247) (28 p.). Oxford.
  2015 - Articles in volumes / proceedings
Hoek, Jet & Zufferey, Sandrine (2015). Factors influencing the implicitation of discourse relations across languages. In Harry Bunt (Eds.), Proceedings 11th Joint ACL - ISO Workshop on Interoperable Semantic Annotation (isa-11) (pp. 39-45). Tilburg: TiCC, Tilburg center for Cognition and Communication.
Hoek, J., Evers-Vermeul, J. & Sanders, T.J.M. (2015). The role of expectedness in the implicitation and explicitation of discourse relations. Proceedings of the Second Workshop on Discourse in Machine Translation (DiscoMT) (pp. 41-46). Association for Computational Linguistics, Lisbon, Portugal, 17 September 2015..
^ top
Gegenereerd op 2018-02-20 22:07:09
Project:
TextLink: Structuring discourse in multilingual Europe
11.04.2014 to 10.04.2018
General project description 

EU-COST program for academic exchange


Effective discourse in any language is characterized by clear relations between sentences and coherent structure. But languages vary in how relations and structure are signalled. While monolingual dictionaries and grammars can characterise the words and sentences of a language and bilingual dictionaries can do the same between languages, there is nothing similar for discourse. For discourse, however, discourse-annotated corpora are becoming available in individual languages. The project will facilitate European multilingualism by (1) identifying and creating a portal into such resources within Europe - including annotation tools, search tools, and discourse-annotated corpora; (2) delineating the dimensions and properties of discourse annotation across corpora; (3) organising these properties into a sharable taxonomy; (4) encouraging the use of this taxonomy in subsequent discourse annotation and in cross-lingual search and studies of devices that relate and structure discourse; and (5) promoting use of the portal, its resources and sharable taxonomy. With partners from across Europe, TextLink will unify numerous but scattered linguistic resources on discourse structure. With its resources searchable by form and/or meaning and a source of valuable correspondences, TextLink will enhance the experience and performance of human translators, lexicographers, language technology and language learners alike.

Role Researcher Funding
Other
Project members UU

Completed projects

Project:
MODERN: Modeling discourse entities and relations for coherent machine translation 01.01.2014 to 01.01.2017
General project description

State-of-the-art machine translation (MT) systems, especially statistical but also rule-based
ones, operate in a sentence-by-sentence mode, and do not propagate information through the series of sentences that constitute texts. Such a propagation is however helpful, and sometimes even indispensable, to make correct translation choices for words and phrases that depend on previous ones. The goal of MODERN is to model and automatically detect such dependencies, and to study their integration within MT, with the aim of demonstrating improvement in translation quality.
The focus of MODERN is on the interplay between referring expressions such as noun
phrases and pronouns, which must be coherently translated throughout a text, and discourse relations between sentences, which are often conveyed by explicit connectives that are notoriously difficult to translate. MODERN will study joint computational models of discourse entities and discourse relations in texts, based on linguistic theories and experimental grounding, and the inclusion in such models of automatically generated domain-knowledge related to the discourse entities. MODERN will design and implement these probabilistic models, and integrate them with operational MT systems, both rule-based (Apertium) and statistical (Moses).
Particular attention will be paid to the evaluation of MT improvement, studying the e ect on
human readers of various translation options for discourse entities and connectives, and aiming to optimize MT output in this respect. The MODERN project will focus on four languages {English, French, German and Dutch} for which the partners have considerable expertise. Two domains will be used as case studies: Alpine texts from a multilingual corpus of Alpine Club yearbooks (Text+Berg) and texts on environmental legislation and debates extracted from the JRC-Acquis, DGT-Acquis, and Europarl parallel corpora.

 
Role Researcher Funding
External funding
Project members UU
Gegenereerd op 2018-02-20 22:07:10
Full name
J. Hoek MA Contact details
Transcomplex

Trans 10
Room 1.61
3512 JK  UTRECHT
The Netherlands


Phone number (direct) +31 30 253 8695
E-mail
J.Hoek@uu.nl
Gegenereerd op 2018-02-20 22:07:10
Last updated 29.01.2018