Skip to main content

Showing 1–12 of 12 results for author: Min, L

  1. arXiv:2405.09901  [pdf, other

    cs.SD cs.AI cs.LG eess.AS

    Whole-Song Hierarchical Generation of Symbolic Music Using Cascaded Diffusion Models

    Authors: Ziyu Wang, Lejun Min, Gus Xia

    Abstract: Recent deep music generation studies have put much emphasis on long-term generation with structures. However, we are yet to see high-quality, well-structured whole-song generation. In this paper, we make the first attempt to model a full music piece under the realization of compositional hierarchy. With a focus on symbolic representations of pop songs, we define a hierarchical language, in which e… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: Proceedings of the International Conference on Learning Representations (ICLR 2024)

    MSC Class: 68Txx

  2. arXiv:2404.06393  [pdf, other

    cs.SD cs.AI eess.AS

    MuPT: A Generative Symbolic Music Pretrained Transformer

    Authors: Xingwei Qu, Yuelin Bai, Yinghao Ma, Ziya Zhou, Ka Man Lo, Jiaheng Liu, Ruibin Yuan, Lejun Min, Xueling Liu, Tianyu Zhang, Xinrun Du, Shuyue Guo, Yiming Liang, Yizhi Li, Shangda Wu, Junting Zhou, Tianyu Zheng, Ziyang Ma, Fengze Han, Wei Xue, Gus Xia, Emmanouil Benetos, Xiang Yue, Chenghua Lin, Xu Tan , et al. (4 additional authors not shown)

    Abstract: In this paper, we explore the application of Large Language Models (LLMs) to the pre-training of music. While the prevalent use of MIDI in music modeling is well-established, our findings suggest that LLMs are inherently more compatible with ABC Notation, which aligns more closely with their design and strengths, thereby enhancing the model's performance in musical composition. To address the chal… ▽ More

    Submitted 10 April, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

  3. arXiv:2307.10304  [pdf, other

    cs.SD cs.LG eess.AS

    Polyffusion: A Diffusion Model for Polyphonic Score Generation with Internal and External Controls

    Authors: Lejun Min, Junyan Jiang, Gus Xia, Jingwei Zhao

    Abstract: We propose Polyffusion, a diffusion model that generates polyphonic music scores by regarding music as image-like piano roll representations. The model is capable of controllable music generation with two paradigms: internal control and external control. Internal control refers to the process in which users pre-define a part of the music and then let the model infill the rest, similar to the task… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.

    Comments: In Proceedings of the 24th Conference of the International Society for Music Information Retrieval (ISMIR 2023), Milan, Italy

  4. arXiv:2307.03918  [pdf

    cs.CV

    VS-TransGRU: A Novel Transformer-GRU-based Framework Enhanced by Visual-Semantic Fusion for Egocentric Action Anticipation

    Authors: Congqi Cao, Ze Sun, Qinyi Lv, Lingtong Min, Yanning Zhang

    Abstract: Egocentric action anticipation is a challenging task that aims to make advanced predictions of future actions from current and historical observations in the first-person view. Most existing methods focus on improving the model architecture and loss function based on the visual input and recurrent neural network to boost the anticipation performance. However, these methods, which merely consider v… ▽ More

    Submitted 8 July, 2023; originally announced July 2023.

    Comments: 12 pages, 7 figures

  5. arXiv:2307.02974  [pdf, other

    cs.CV

    Cross-Spatial Pixel Integration and Cross-Stage Feature Fusion Based Transformer Network for Remote Sensing Image Super-Resolution

    Authors: Yuting Lu, Lingtong Min, Binglu Wang, Le Zheng, Xiaoxu Wang, Yongqiang Zhao, Teng Long

    Abstract: Remote sensing image super-resolution (RSISR) plays a vital role in enhancing spatial detials and improving the quality of satellite imagery. Recently, Transformer-based models have shown competitive performance in RSISR. To mitigate the quadratic computational complexity resulting from global self-attention, various methods constrain attention to a local window, enhancing its efficiency. Conseque… ▽ More

    Submitted 6 July, 2023; originally announced July 2023.

  6. arXiv:2112.09939  [pdf, other

    cs.CL cs.AI cs.IR stat.AP

    Syntactic-GCN Bert based Chinese Event Extraction

    Authors: Jiangwei Liu, Jingshu Zhang, Xiaohong Huang, Liangyu Min

    Abstract: With the rapid development of information technology, online platforms (e.g., news portals and social media) generate enormous web information every moment. Therefore, it is crucial to extract structured representations of events from social streams. Generally, existing event extraction research utilizes pattern matching, machine learning, or deep learning methods to perform event extraction tasks… ▽ More

    Submitted 18 December, 2021; originally announced December 2021.

    Comments: 9 pages, 4 figures, 3 tables. arXiv admin note: text overlap with arXiv:2111.03212

  7. arXiv:2111.11350  [pdf

    cs.CV

    ShufaNet: Classification method for calligraphers who have reached the professional level

    Authors: Ge Yunfei, Diao Changyu, Li Min, Yu Ruohan, Qiu Linshan, Xu Duanqing

    Abstract: The authenticity of calligraphy is significant but difficult task in the realm of art, where the key problem is the few-shot classification of calligraphy. We propose a novel method, ShufaNet ("Shufa" is the pinyin of Chinese calligraphy), to classify Chinese calligraphers' styles based on metric learning in the case of few-shot, whose classification accuracy exceeds the level of students majoring… ▽ More

    Submitted 22 November, 2021; originally announced November 2021.

    Comments: 10pages, 11 figures

  8. arXiv:2111.03212  [pdf, other

    cs.CL cs.AI cs.LG

    An overview of event extraction and its applications

    Authors: Jiangwei Liu, Liangyu Min, Xiaohong Huang

    Abstract: With the rapid development of information technology, online platforms have produced enormous text resources. As a particular form of Information Extraction (IE), Event Extraction (EE) has gained increasing popularity due to its ability to automatically extract events from human language. However, there are limited literature surveys on event extraction. Existing review works either spend much eff… ▽ More

    Submitted 4 November, 2021; originally announced November 2021.

  9. arXiv:2105.06284  [pdf, other

    cs.IT eess.SP

    Ergodic Capacity of High Throughput Satellite Systems With Mixed FSO-RF Transmission

    Authors: Kong Huaicong, Lin Min, Wang Zining, Ouyang Jian, Cheng Julian

    Abstract: We study a high throughput satellite system, where the feeder link uses free-space optical (FSO) and the user link uses radio frequency (RF) communication. In particular, we first propose a transmit diversity using Alamouti space time block coding to mitigate the atmospheric turbulence in the feeder link. Then, based on the concept of average virtual signal-to-interference-plus-noise ratio and one… ▽ More

    Submitted 13 May, 2021; originally announced May 2021.

  10. arXiv:2005.07225  [pdf, other

    eess.IV cs.CV

    SAGE: Sequential Attribute Generator for Analyzing Glioblastomas using Limited Dataset

    Authors: Padmaja Jonnalagedda, Brent Weinberg, Jason Allen, Taejin L. Min, Shiv Bhanu, Bir Bhanu

    Abstract: While deep learning approaches have shown remarkable performance in many imaging tasks, most of these methods rely on availability of large quantities of data. Medical image data, however, is scarce and fragmented. Generative Adversarial Networks (GANs) have recently been very effective in handling such datasets by generating more data. If the datasets are very small, however, GANs cannot learn th… ▽ More

    Submitted 3 June, 2022; v1 submitted 14 May, 2020; originally announced May 2020.

  11. arXiv:1704.03168  [pdf, other

    cs.AR

    FMMU: A Hardware-Automated Flash Map Management Unit for Scalable Performance of NAND Flash-Based SSDs

    Authors: Yeong-Jae Woo, Sang Lyul Min

    Abstract: NAND flash-based Solid State Drives (SSDs), which are widely used from embedded systems to enterprise servers, are enhancing performance by exploiting the parallelism of NAND flash memories. To cope with the performance improvement of SSDs, storage systems have rapidly adopted the host interface for SSDs from Serial-ATA, which is used for existing hard disk drives, to high-speed PCI express. Since… ▽ More

    Submitted 11 April, 2017; originally announced April 2017.

  12. arXiv:1612.04277  [pdf

    cs.AR

    Copycat: A High Precision Real Time NAND Simulator

    Authors: Juyong Shin, Jongbo Bae, Ansu Na, Sang Lyul Min

    Abstract: In this paper, we describe the design and implementation of a high precision real time NAND simulator called Copycat that runs on a commodity multi-core desktop environment. This NAND simulator facilitates the development of embedded flash memory management software such as the flash translation layer (FTL). The simulator also allows a comprehensive fault injection for testing the reliability of t… ▽ More

    Submitted 11 December, 2016; originally announced December 2016.