Skip to main content

Showing 1–12 of 12 results for author: Vincent, S

  1. arXiv:2407.00108  [pdf, other

    cs.LG cs.AI cs.CL cs.HC

    A Case Study on Contextual Machine Translation in a Professional Scenario of Subtitling

    Authors: Sebastian Vincent, Charlotte Prescott, Chris Bayliss, Chris Oakley, Carolina Scarton

    Abstract: Incorporating extra-textual context such as film metadata into the machine translation (MT) pipeline can enhance translation quality, as indicated by automatic evaluation in recent work. However, the positive impact of such systems in industry remains unproven. We report on an industrial case study carried out to investigate the benefit of MT in a professional scenario of translating TV subtitles… ▽ More

    Submitted 27 June, 2024; originally announced July 2024.

    Comments: Accepted to EAMT 2024

  2. SwimXYZ: A large-scale dataset of synthetic swimming motions and videos

    Authors: Fiche Guénolé, Sevestre Vincent, Gonzalez-Barral Camila, Leglaive Simon, Séguier Renaud

    Abstract: Technologies play an increasingly important role in sports and become a real competitive advantage for the athletes who benefit from it. Among them, the use of motion capture is developing in various sports to optimize sporting gestures. Unfortunately, traditional motion capture systems are expensive and constraining. Recently developed computer vision-based approaches also struggle in certain spo… ▽ More

    Submitted 6 October, 2023; originally announced October 2023.

    Comments: ACM MIG 2023

  3. arXiv:2305.15904  [pdf, other

    cs.CL cs.AI cs.LG

    MTCue: Learning Zero-Shot Control of Extra-Textual Attributes by Leveraging Unstructured Context in Neural Machine Translation

    Authors: Sebastian Vincent, Robert Flynn, Carolina Scarton

    Abstract: Efficient utilisation of both intra- and extra-textual context remains one of the critical gaps between machine and human translation. Existing research has primarily focused on providing individual, well-defined types of context in translation, such as the surrounding text or discrete external variables like the speaker's gender. This work introduces MTCue, a novel neural machine translation (NMT… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

    Comments: Accepted to Findings at ACL2023

  4. arXiv:2303.16618  [pdf, other

    cs.CL cs.AI cs.LG

    Reference-less Analysis of Context Specificity in Translation with Personalised Language Models

    Authors: Sebastian Vincent, Alice Dowek, Rowanne Sumner, Charlotte Blundell, Emily Preston, Chris Bayliss, Chris Oakley, Carolina Scarton

    Abstract: Sensitising language models (LMs) to external context helps them to more effectively capture the speaking patterns of individuals with specific characteristics or in particular environments. This work investigates to what extent rich character and film annotations can be leveraged to personalise LMs in a scalable manner. We then explore the use of such models in evaluating context specificity in m… ▽ More

    Submitted 5 March, 2024; v1 submitted 29 March, 2023; originally announced March 2023.

    Comments: Accepted to LREC-COLING 2024

  5. arXiv:2303.02028  [pdf, ps, other

    cs.AI quant-ph

    Calibration of Quantum Decision Theory: Aversion to Large Losses and Predictability of Probabilistic Choices

    Authors: T. Kovalenko, S. Vincent, V. I. Yukalov, D. Sornette

    Abstract: We present the first calibration of quantum decision theory (QDT) to a dataset of binary risky choice. We quantitatively account for the fraction of choice reversals between two repetitions of the experiment, using a probabilistic choice formulation in the simplest form without model assumption or adjustable parameters. The prediction of choice reversal is then refined by introducing heterogeneity… ▽ More

    Submitted 3 March, 2023; originally announced March 2023.

    Comments: Latex file, 51 pages, 19 figures

    Journal ref: J. Phys. Complex. 4 (2023) 015009

  6. arXiv:2211.00718  [pdf

    cs.CV cs.AI

    SleepyWheels: An Ensemble Model for Drowsiness Detection leading to Accident Prevention

    Authors: Jomin Jose, Andrew J, Kumudha Raimond, Shweta Vincent

    Abstract: Around 40 percent of accidents related to driving on highways in India occur due to the driver falling asleep behind the steering wheel. Several types of research are ongoing to detect driver drowsiness but they suffer from the complexity and cost of the models. In this paper, SleepyWheels a revolutionary method that uses a lightweight neural network in conjunction with facial landmark identificat… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

    Comments: 20 pages

  7. arXiv:2205.05990  [pdf, other

    cs.CL

    Controlling Formality in Low-Resource NMT with Domain Adaptation and Re-Ranking: SLT-CDT-UoS at IWSLT2022

    Authors: Sebastian T. Vincent, Loïc Barrault, Carolina Scarton

    Abstract: This paper describes the SLT-CDT-UoS group's submission to the first Special Task on Formality Control for Spoken Language Translation, part of the IWSLT 2022 Evaluation Campaign. Our efforts were split between two fronts: data engineering and altering the objective function for best hypothesis selection. We used language-independent methods to extract formal and informal sentence pairs from the p… ▽ More

    Submitted 12 May, 2022; originally announced May 2022.

    Comments: 8 pages, 10 figures, IWSLT22 camera-ready (system paper @ ACL-IWSLT Shared Task on Formality Control for Spoken Language Translation)

  8. arXiv:2205.04747  [pdf, other

    cs.CL cs.AI

    Controlling Extra-Textual Attributes about Dialogue Participants -- A Case Study of English-to-Polish Neural Machine Translation

    Authors: Sebastian T. Vincent, Loïc Barrault, Carolina Scarton

    Abstract: Unlike English, morphologically rich languages can reveal characteristics of speakers or their conversational partners, such as gender and number, via pronouns, morphological endings of words and syntax. When translating from English to such languages, a machine translation model needs to opt for a certain interpretation of textual context, which may lead to serious translation errors if extra-tex… ▽ More

    Submitted 30 May, 2022; v1 submitted 10 May, 2022; originally announced May 2022.

    Comments: 9 pages, 9 figures, EAMT2022 camera-ready

    Journal ref: Proceedings of the 23rd Annual Conference of the European Association for Machine Translation, p. 121-130, Ghent, Belgium, June 2022

  9. arXiv:2203.11383  [pdf, other

    cs.IR cs.CY cs.LG

    DIANES: A DEI Audit Toolkit for News Sources

    Authors: Xiaoxiao Shang, Zhiyuan Peng, Qiming Yuan, Sabiq Khan, Lauren Xie, Yi Fang, Subramaniam Vincent

    Abstract: Professional news media organizations have always touted the importance that they give to multiple perspectives. However, in practice the traditional approach to all-sides has favored people in the dominant culture. Hence it has come under ethical critique under the new norms of diversity, equity, and inclusion (DEI). When DEI is applied to journalism, it goes beyond conventional notions of impart… ▽ More

    Submitted 28 April, 2022; v1 submitted 21 March, 2022; originally announced March 2022.

  10. arXiv:2102.10979  [pdf, other

    cs.CL

    Towards Personalised and Document-level Machine Translation of Dialogue

    Authors: Sebastian T. Vincent

    Abstract: State-of-the-art (SOTA) neural machine translation (NMT) systems translate texts at sentence level, ignoring context: intra-textual information, like the previous sentence, and extra-textual information, like the gender of the speaker. Because of that, some sentences are translated incorrectly. Personalised NMT (PersNMT) and document-level NMT (DocNMT) incorporate this information into the transla… ▽ More

    Submitted 11 February, 2021; originally announced February 2021.

    Comments: Thesis Proposal, 6 pages, 7 figures, accepted to the EACL2021 Student Workshop

  11. arXiv:1805.06070  [pdf, ps, other

    cs.CR

    A Survey of Intrusion Detection Systems Leveraging Host Data

    Authors: Tarrah R. Glass-Vanderlan, Michael D. Iannacone, Maria S. Vincent, Qian, Chen, Robert A. Bridges

    Abstract: This survey focuses on intrusion detection systems (IDS) that leverage host-based data sources for detecting attacks on enterprise network. The host-based IDS (HIDS) literature is organized by the input data source, presenting targeted sub-surveys of HIDS research leveraging system logs, audit data, Windows Registry, file systems, and program analysis. While system calls are generally included in… ▽ More

    Submitted 16 May, 2018; v1 submitted 15 May, 2018; originally announced May 2018.

  12. NECTAR: Non-Interactive Smart Contract Protocol using Blockchain Technology

    Authors: Alexandra Covaci, Simone Madeo, Patrick Motylinski, Stéphane Vincent

    Abstract: Blockchain-driven technologies are considered disruptive because of the availability of dis-intermediated, censorship-resistant and tamper-proof digital platforms of distributed trust. Among these technologies, smart contract platforms have the potential to take over functions usually done by intermediaries like banks, escrow or legal services. In this paper, we introduce a novel protocol aiming t… ▽ More

    Submitted 13 March, 2018; originally announced March 2018.

    Comments: IEEE/ACM 1st International Workshop on Emerging Trends in Software Engineering for Blockchain (WETSEB 2018)