-
Module control of network analysis in psychopathology
Authors:
Chunyu Pan,
Quan Zhang,
Yue Zhu,
Shengzhou Kong,
Juan Liu,
Changsheng Zhang,
Fei Wang,
Xizhe Zhang
Abstract:
The network approach to characterizing psychopathology departs from traditional latent categorical and dimensional approaches. Causal interplay among symptoms contributed to dynamic psychopathology system. Therefore, analyzing the symptom clusters is critical for understanding mental disorders. Furthermore, despite extensive research studying the topological features of symptom networks, the contr…
▽ More
The network approach to characterizing psychopathology departs from traditional latent categorical and dimensional approaches. Causal interplay among symptoms contributed to dynamic psychopathology system. Therefore, analyzing the symptom clusters is critical for understanding mental disorders. Furthermore, despite extensive research studying the topological features of symptom networks, the control relationships between symptoms remain largely unclear. Here, we present a novel systematizing concept, module control, to analyze the control principle of the symptom network at a module level. We introduce Module Control Network (MCN) to identify key modules that regulate the network's behavior. By applying our approach to a multivariate psychological dataset, we discover that non-emotional modules, such as sleep-related and stress-related modules, are the primary controlling modules in the symptom network. Our findings indicate that module control can expose central symptom cluster governing psychopathology network, offering novel insights into the underlying mechanisms of mental disorders and individualized approach to psychological interventions.
△ Less
Submitted 30 May, 2024;
originally announced July 2024.
-
SynAsk: Unleashing the Power of Large Language Models in Organic Synthesis
Authors:
Chonghuan Zhang,
Qianghua Lin,
Biwei Zhu,
Haopeng Yang,
Xiao Lian,
Hao Deng,
Jiajun Zheng,
Kuangbiao Liao
Abstract:
The field of natural language processing (NLP) has witnessed a transformative shift with the emergence of large language models (LLMs), revolutionizing various language tasks and applications, and the integration of LLM into specialized domains enhances their capabilities for domain-specific applications. Notably, NLP has made significant strides in organic chemistry, particularly in predicting sy…
▽ More
The field of natural language processing (NLP) has witnessed a transformative shift with the emergence of large language models (LLMs), revolutionizing various language tasks and applications, and the integration of LLM into specialized domains enhances their capabilities for domain-specific applications. Notably, NLP has made significant strides in organic chemistry, particularly in predicting synthetic tasks, paving the way for the development of LLMs tailored to the organic chemistry field. In this work, we introduce SynAsk, a comprehensive organic chemistry domain-specific LLM platform developed by AIChemEco Inc. By finetuning an LLM with domain-specific data and integrating it with a chain of thought approach, SynAsk seamlessly accesses our knowledge base and advanced chemistry tools in a question-and-answer format. This includes functionalities such as a basic chemistry knowledge base, molecular information retrieval, reaction performance prediction, retrosynthesis prediction, chemical literature acquisition, and more. This novel methodology synergizes fine-tuning techniques with external resource integration, resulting in an organic chemistry-specific model poised to facilitate research and discovery in the field. Accessible via http://synask.aichemeco.com, SynAsk represents a significant advancement in leveraging NLP for synthetic applications.
△ Less
Submitted 13 June, 2024; v1 submitted 6 June, 2024;
originally announced June 2024.
-
Kirigami: large convolutional kernels improve deep learning-based RNA secondary structure prediction
Authors:
Marc Harary,
Chengxin Zhang
Abstract:
We introduce a novel fully convolutional neural network (FCN) architecture for predicting the secondary structure of ribonucleic acid (RNA) molecules. Interpreting RNA structures as weighted graphs, we employ deep learning to estimate the probability of base pairing between nucleotide residues. Unique to our model are its massive 11-pixel kernels, which we argue provide a distinct advantage for FC…
▽ More
We introduce a novel fully convolutional neural network (FCN) architecture for predicting the secondary structure of ribonucleic acid (RNA) molecules. Interpreting RNA structures as weighted graphs, we employ deep learning to estimate the probability of base pairing between nucleotide residues. Unique to our model are its massive 11-pixel kernels, which we argue provide a distinct advantage for FCNs on the specialized domain of RNA secondary structures. On a widely adopted, standardized test set comprised of 1,305 molecules, the accuracy of our method exceeds that of current state-of-the-art (SOTA) secondary structure prediction software, achieving a Matthews Correlation Coefficient (MCC) over 11-40% higher than that of other leading methods on overall structures and 58-400% higher on pseudoknots specifically.
△ Less
Submitted 6 June, 2024; v1 submitted 4 June, 2024;
originally announced June 2024.
-
Augmentation-based Unsupervised Cross-Domain Functional MRI Adaptation for Major Depressive Disorder Identification
Authors:
Yunling Ma,
Chaojun Zhang,
Xiaochuan Wang,
Qianqian Wang,
Liang Cao,
Limei Zhang,
Mingxia Liu
Abstract:
Major depressive disorder (MDD) is a common mental disorder that typically affects a person's mood, cognition, behavior, and physical health. Resting-state functional magnetic resonance imaging (rs-fMRI) data are widely used for computer-aided diagnosis of MDD. While multi-site fMRI data can provide more data for training reliable diagnostic models, significant cross-site data heterogeneity would…
▽ More
Major depressive disorder (MDD) is a common mental disorder that typically affects a person's mood, cognition, behavior, and physical health. Resting-state functional magnetic resonance imaging (rs-fMRI) data are widely used for computer-aided diagnosis of MDD. While multi-site fMRI data can provide more data for training reliable diagnostic models, significant cross-site data heterogeneity would result in poor model generalizability. Many domain adaptation methods are designed to reduce the distributional differences between sites to some extent, but usually ignore overfitting problem of the model on the source domain. Intuitively, target data augmentation can alleviate the overfitting problem by forcing the model to learn more generalized features and reduce the dependence on source domain data. In this work, we propose a new augmentation-based unsupervised cross-domain fMRI adaptation (AUFA) framework for automatic diagnosis of MDD. The AUFA consists of 1) a graph representation learning module for extracting rs-fMRI features with spatial attention, 2) a domain adaptation module for feature alignment between source and target data, 3) an augmentation-based self-optimization module for alleviating model overfitting on the source domain, and 4) a classification module. Experimental results on 1,089 subjects suggest that AUFA outperforms several state-of-the-art methods in MDD identification. Our approach not only reduces data heterogeneity between different sites, but also localizes disease-related functional connectivity abnormalities and provides interpretability for the model.
△ Less
Submitted 6 June, 2024; v1 submitted 31 May, 2024;
originally announced June 2024.
-
Target-Specific De Novo Peptide Binder Design with DiffPepBuilder
Authors:
Fanhao Wang,
Yuzhe Wang,
Laiyi Feng,
Changsheng Zhang,
Luhua Lai
Abstract:
Despite the exciting progress in target-specific de novo protein binder design, peptide binder design remains challenging due to the flexibility of peptide structures and the scarcity of protein-peptide complex structure data. In this study, we curated a large synthetic dataset, referred to as PepPC-F, from the abundant protein-protein interface data and developed DiffPepBuilder, a de novo target-…
▽ More
Despite the exciting progress in target-specific de novo protein binder design, peptide binder design remains challenging due to the flexibility of peptide structures and the scarcity of protein-peptide complex structure data. In this study, we curated a large synthetic dataset, referred to as PepPC-F, from the abundant protein-protein interface data and developed DiffPepBuilder, a de novo target-specific peptide binder generation method that utilizes an SE(3)-equivariant diffusion model trained on PepPC-F to co-design peptide sequences and structures. DiffPepBuilder also introduces disulfide bonds to stabilize the generated peptide structures. We tested DiffPepBuilder on 30 experimentally verified strong peptide binders with available protein-peptide complex structures. DiffPepBuilder was able to effectively recall the native structures and sequences of the peptide ligands and to generate novel peptide binders with improved binding free energy. We subsequently conducted de novo generation case studies on three targets. In both the regeneration test and case studies, DiffPepBuilder outperformed AfDesign and RFdiffusion coupled with ProteinMPNN, in terms of sequence and structure recall, interface quality, and structural diversity. Molecular dynamics simulations confirmed that the introduction of disulfide bonds enhanced the structural rigidity and binding performance of the generated peptides. As a general peptide binder de novo design tool, DiffPepBuilder can be used to design peptide binders for given protein targets with three dimensional and binding site information.
△ Less
Submitted 30 April, 2024;
originally announced May 2024.
-
BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers
Authors:
Ran Xu,
Wenqi Shi,
Yue Yu,
Yuchen Zhuang,
Yanqiao Zhu,
May D. Wang,
Joyce C. Ho,
Chao Zhang,
Carl Yang
Abstract:
Developing effective biomedical retrieval models is important for excelling at knowledge-intensive biomedical tasks but still challenging due to the deficiency of sufficient publicly annotated biomedical data and computational resources. We present BMRetriever, a series of dense retrievers for enhancing biomedical retrieval via unsupervised pre-training on large biomedical corpora, followed by ins…
▽ More
Developing effective biomedical retrieval models is important for excelling at knowledge-intensive biomedical tasks but still challenging due to the deficiency of sufficient publicly annotated biomedical data and computational resources. We present BMRetriever, a series of dense retrievers for enhancing biomedical retrieval via unsupervised pre-training on large biomedical corpora, followed by instruction fine-tuning on a combination of labeled datasets and synthetic pairs. Experiments on 5 biomedical tasks across 11 datasets verify BMRetriever's efficacy on various biomedical applications. BMRetriever also exhibits strong parameter efficiency, with the 410M variant outperforming baselines up to 11.7 times larger, and the 2B variant matching the performance of models with over 5B parameters. The training data and model checkpoints are released at \url{https://huggingface.co/BMRetriever} to ensure transparency, reproducibility, and application to new domains.
△ Less
Submitted 29 April, 2024;
originally announced April 2024.
-
Emergence of cooperation under punishment: A reinforcement learning perspective
Authors:
Chenyang Zhao,
Guozhong Zheng,
Chun Zhang,
Jiqiang Zhang,
Li Chen
Abstract:
Punishment is a common tactic to sustain cooperation and has been extensively studied for a long time. While most of previous game-theoretic work adopt the imitation learning where players imitate the strategies who are better off, the learning logic in the real world is often much more complex. In this work, we turn to the reinforcement learning paradigm, where individuals make their decisions ba…
▽ More
Punishment is a common tactic to sustain cooperation and has been extensively studied for a long time. While most of previous game-theoretic work adopt the imitation learning where players imitate the strategies who are better off, the learning logic in the real world is often much more complex. In this work, we turn to the reinforcement learning paradigm, where individuals make their decisions based upon their past experience and long-term returns. Specifically, we investigate the Prisoners' dilemma game with Q-learning algorithm, and cooperators probabilistically pose punishment on defectors in their neighborhood. Interestingly, we find that punishment could lead to either continuous or discontinuous cooperation phase transitions, and the nucleation process of cooperation clusters is reminiscent of the liquid-gas transition. The uncovered first-order phase transition indicates that great care needs to be taken when implementing the punishment compared to the continuous scenario.
△ Less
Submitted 29 January, 2024;
originally announced January 2024.
-
Bio-Image Informatics Index BIII: A unique database of image analysis tools and workflows for and by the bioimaging community
Authors:
Chong Zhang,
Alban Gaignard,
Matus Kalas,
Florian Levet,
Felipe Delestro,
Joakim Lindblad,
Natasa Sladoje,
Laure Plantard,
Alain Latour,
Robert Haase,
Gabriel Martins,
Paula Sampaio,
Leandro Scholz,
NEUBIAS taggers,
Sébastien Tosi,
Kota Miura,
Julien Colombelli,
Perrine Paul-Gilloteaux
Abstract:
Bio image analysis has recently become one keystone of biological research but biologists tend to get lost in a plethora of available software and the way to adjust available tools to their own image analysis problem. We present BIII, BioImage Informatic Index (www.biii.eu), the result of the first large community effort to bridge the communities of algorithm and software developers, bioimage anal…
▽ More
Bio image analysis has recently become one keystone of biological research but biologists tend to get lost in a plethora of available software and the way to adjust available tools to their own image analysis problem. We present BIII, BioImage Informatic Index (www.biii.eu), the result of the first large community effort to bridge the communities of algorithm and software developers, bioimage analysts and biologists, under the form of a web-based knowledge database crowdsourced by these communities. Software tools (> 1300), image databases for benchmarking (>20) and training materials (>70) for bio image analysis are referenced and curated following standards constructed by the community and then reaching a broader audience. Software tools are organized as full protocol of analysis (workflow), specific brick (component) to construct a workflow, or software platform or library (collection). They are described using Edam Bio Imaging, which is iteratively defined using this website. All entries are exposed following FAIR principles and accessible for other usage.
△ Less
Submitted 18 December, 2023;
originally announced December 2023.
-
Transfer Learning across Different Chemical Domains: Virtual Screening of Organic Materials with Deep Learning Models Pretrained on Small Molecule and Chemical Reaction Data
Authors:
Chengwei Zhang,
Yushuang Zhai,
Ziyang Gong,
Hongliang Duan,
Yuan-Bin She,
Yun-Fang Yang,
An Su
Abstract:
Machine learning is becoming a preferred method for the virtual screening of organic materials due to its cost-effectiveness over traditional computationally demanding techniques. However, the scarcity of labeled data for organic materials poses a significant challenge for training advanced machine learning models. This study showcases the potential of utilizing databases of drug-like small molecu…
▽ More
Machine learning is becoming a preferred method for the virtual screening of organic materials due to its cost-effectiveness over traditional computationally demanding techniques. However, the scarcity of labeled data for organic materials poses a significant challenge for training advanced machine learning models. This study showcases the potential of utilizing databases of drug-like small molecules and chemical reactions to pretrain the BERT model, enhancing its performance in the virtual screening of organic materials. By fine-tuning the BERT models with data from five virtual screening tasks, the version pretrained with the USPTO-SMILES dataset achieved R2 scores exceeding 0.94 for three tasks and over 0.81 for two others. This performance surpasses that of models pretrained on the small molecule or organic materials databases and outperforms three traditional machine learning models trained directly on virtual screening data. The success of the USPTO-SMILES pretrained BERT model can be attributed to the diverse array of organic building blocks in the USPTO database, offering a broader exploration of the chemical space. The study further suggests that accessing a reaction database with a wider range of reactions than the USPTO could further enhance model performance. Overall, this research validates the feasibility of applying transfer learning across different chemical domains for the efficient virtual screening of organic materials.
△ Less
Submitted 5 March, 2024; v1 submitted 30 November, 2023;
originally announced November 2023.
-
ViDa: Visualizing DNA hybridization trajectories with biophysics-informed deep graph embeddings
Authors:
Chenwei Zhang,
Jordan Lovrod,
Boyan Beronov,
Khanh Dao Duc,
Anne Condon
Abstract:
Visualization tools can help synthetic biologists and molecular programmers understand the complex reactive pathways of nucleic acid reactions, which can be designed for many potential applications and can be modelled using a continuous-time Markov chain (CTMC). Here we present ViDa, a new visualization approach for DNA reaction trajectories that uses a 2D embedding of the secondary structure stat…
▽ More
Visualization tools can help synthetic biologists and molecular programmers understand the complex reactive pathways of nucleic acid reactions, which can be designed for many potential applications and can be modelled using a continuous-time Markov chain (CTMC). Here we present ViDa, a new visualization approach for DNA reaction trajectories that uses a 2D embedding of the secondary structure state space underlying the CTMC model. To this end, we integrate a scattering transform of the secondary structure adjacency, a variational autoencoder, and a nonlinear dimensionality reduction method. We augment the training loss with domain-specific supervised terms that capture both thermodynamic and kinetic features. We assess ViDa on two well-studied DNA hybridization reactions. Our results demonstrate that the domain-specific features lead to significant quality improvements over the state-of-the-art in DNA state space visualization, successfully separating different folding pathways and thus providing useful insights into dominant reaction mechanisms.
△ Less
Submitted 6 November, 2023;
originally announced November 2023.
-
Visualizing DNA reaction trajectories with deep graph embedding approaches
Authors:
Chenwei Zhang,
Khanh Dao Duc,
Anne Condon
Abstract:
Synthetic biologists and molecular programmers design novel nucleic acid reactions, with many potential applications. Good visualization tools are needed to help domain experts make sense of the complex outputs of folding pathway simulations of such reactions. Here we present ViDa, a new approach for visualizing DNA reaction folding trajectories over the energy landscape of secondary structures. W…
▽ More
Synthetic biologists and molecular programmers design novel nucleic acid reactions, with many potential applications. Good visualization tools are needed to help domain experts make sense of the complex outputs of folding pathway simulations of such reactions. Here we present ViDa, a new approach for visualizing DNA reaction folding trajectories over the energy landscape of secondary structures. We integrate a deep graph embedding model with common dimensionality reduction approaches, to map high-dimensional data onto 2D Euclidean space. We assess ViDa on two well-studied and contrasting DNA hybridization reactions. Our preliminary results suggest that ViDa's visualization successfully separates trajectories with different folding mechanisms, thereby providing useful insight to users, and is a big improvement over the current state-of-the-art in DNA kinetics visualization.
△ Less
Submitted 6 November, 2023;
originally announced November 2023.
-
EMPOT: partial alignment of density maps and rigid body fitting using unbalanced Gromov-Wasserstein divergence
Authors:
Aryan Tajmir Riahi,
Chenwei Zhang,
James Chen,
Anne Condon,
Khanh Dao Duc
Abstract:
Aligning EM density maps and fitting atomic models are essential steps in single particle cryogenic electron microscopy (cryo-EM), with recent methods leveraging various algorithms and machine learning tools. As aligning maps remains challenging in the presence of a map that only partially fits the other (e.g. one subunit), we here propose a new procedure, EMPOT (EM Partial alignment with Optimal…
▽ More
Aligning EM density maps and fitting atomic models are essential steps in single particle cryogenic electron microscopy (cryo-EM), with recent methods leveraging various algorithms and machine learning tools. As aligning maps remains challenging in the presence of a map that only partially fits the other (e.g. one subunit), we here propose a new procedure, EMPOT (EM Partial alignment with Optimal Transport), for partial alignment of 3D maps. EMPOT first finds a coupling between 3D point-cloud representations, which is associated with their so-called unbalanced Gromov Wasserstein divergence, and second, uses this coupling to find an optimal rigid body transformation. Upon running and benchmarking our method with experimental maps and structures, we show that EMPOT outperforms standard methods for aligning subunits of a protein complex and fitting atomic models to a density map, suggesting potential applications of Partial Optimal Transport for improving Cryo-EM pipelines.
△ Less
Submitted 1 November, 2023;
originally announced November 2023.
-
AI for Mathematics: A Cognitive Science Perspective
Authors:
Cedegao E. Zhang,
Katherine M. Collins,
Adrian Weller,
Joshua B. Tenenbaum
Abstract:
Mathematics is one of the most powerful conceptual systems developed and used by the human species. Dreams of automated mathematicians have a storied history in artificial intelligence (AI). Rapid progress in AI, particularly propelled by advances in large language models (LLMs), has sparked renewed, widespread interest in building such systems. In this work, we reflect on these goals from a \text…
▽ More
Mathematics is one of the most powerful conceptual systems developed and used by the human species. Dreams of automated mathematicians have a storied history in artificial intelligence (AI). Rapid progress in AI, particularly propelled by advances in large language models (LLMs), has sparked renewed, widespread interest in building such systems. In this work, we reflect on these goals from a \textit{cognitive science} perspective. We call attention to several classical and ongoing research directions from cognitive science, which we believe are valuable for AI practitioners to consider when seeking to build truly human (or superhuman)-level mathematical systems. We close with open discussions and questions that we believe necessitate a multi-disciplinary perspective -- cognitive scientists working in tandem with AI researchers and mathematicians -- as we move toward better mathematical AI systems which not only help us push the frontier of the mathematics, but also offer glimpses into how we as humans are even capable of such great cognitive feats.
△ Less
Submitted 18 October, 2023;
originally announced October 2023.
-
STW-MD: A Novel Spatio-Temporal Weighting and Multi-Step Decision Tree Method for Considering Spatial Heterogeneity in Brain Gene Expression Data
Authors:
Shanjun Mao,
Xiao Huang,
Runjiu Chen,
Chenyang Zhang,
Yizhu Diao,
Zongjin Li,
Qingzhe Wang,
Shan Tang,
Shuixia Guo
Abstract:
Motivation: Gene expression during brain development or abnormal development is a biological process that is highly dynamic in spatio and temporal. Due to the lack of comprehensive integration of spatial and temporal dimensions of brain gene expression data, previous studies have mainly focused on individual brain regions or a certain developmental stage. Our motivation is to address this gap by i…
▽ More
Motivation: Gene expression during brain development or abnormal development is a biological process that is highly dynamic in spatio and temporal. Due to the lack of comprehensive integration of spatial and temporal dimensions of brain gene expression data, previous studies have mainly focused on individual brain regions or a certain developmental stage. Our motivation is to address this gap by incorporating spatio-temporal information to gain a more complete understanding of the mechanisms underlying brain development or disorders associated with abnormal brain development, such as Alzheimer's disease (AD), and to identify potential determinants of response.
Results: In this study, we propose a novel two-step framework based on spatial-temporal information weighting and multi-step decision trees. This framework can effectively exploit the spatial similarity and temporal dependence between different stages and different brain regions, and facilitate differential gene analysis in brain regions with high heterogeneity. We focus on two datasets: the AD dataset, which includes gene expression data from early, middle, and late stages, and the brain development dataset, spanning fetal development to adulthood. Our findings highlight the advantages of the proposed framework in discovering gene classes and elucidating their impact on brain development and AD progression across diverse brain regions and stages. These findings align with existing studies and provide insights into the processes of normal and abnormal brain development.
Availability: The code of STW-MD is available at https://github.com/tsnm1/STW-MD.
△ Less
Submitted 18 October, 2023;
originally announced October 2023.
-
ARTree: A Deep Autoregressive Model for Phylogenetic Inference
Authors:
Tianyu Xie,
Cheng Zhang
Abstract:
Designing flexible probabilistic models over tree topologies is important for developing efficient phylogenetic inference methods. To do that, previous works often leverage the similarity of tree topologies via hand-engineered heuristic features which would require pre-sampled tree topologies and may suffer from limited approximation capability. In this paper, we propose a deep autoregressive mode…
▽ More
Designing flexible probabilistic models over tree topologies is important for developing efficient phylogenetic inference methods. To do that, previous works often leverage the similarity of tree topologies via hand-engineered heuristic features which would require pre-sampled tree topologies and may suffer from limited approximation capability. In this paper, we propose a deep autoregressive model for phylogenetic inference based on graph neural networks (GNNs), called ARTree. By decomposing a tree topology into a sequence of leaf node addition operations and modeling the involved conditional distributions based on learnable topological features via GNNs, ARTree can provide a rich family of distributions over the entire tree topology space that have simple sampling algorithms and density estimation procedures, without using heuristic features. We demonstrate the effectiveness and efficiency of our method on a benchmark of challenging real data tree topology density estimation and variational Bayesian phylogenetic inference problems.
△ Less
Submitted 14 October, 2023;
originally announced October 2023.
-
May the Force be with You: Unified Force-Centric Pre-Training for 3D Molecular Conformations
Authors:
Rui Feng,
Qi Zhu,
Huan Tran,
Binghong Chen,
Aubrey Toland,
Rampi Ramprasad,
Chao Zhang
Abstract:
Recent works have shown the promise of learning pre-trained models for 3D molecular representation. However, existing pre-training models focus predominantly on equilibrium data and largely overlook off-equilibrium conformations. It is challenging to extend these methods to off-equilibrium data because their training objective relies on assumptions of conformations being the local energy minima. W…
▽ More
Recent works have shown the promise of learning pre-trained models for 3D molecular representation. However, existing pre-training models focus predominantly on equilibrium data and largely overlook off-equilibrium conformations. It is challenging to extend these methods to off-equilibrium data because their training objective relies on assumptions of conformations being the local energy minima. We address this gap by proposing a force-centric pretraining model for 3D molecular conformations covering both equilibrium and off-equilibrium data. For off-equilibrium data, our model learns directly from their atomic forces. For equilibrium data, we introduce zero-force regularization and forced-based denoising techniques to approximate near-equilibrium forces. We obtain a unified pre-trained model for 3D molecular representation with over 15 million diverse conformations. Experiments show that, with our pre-training objective, we increase forces accuracy by around 3 times compared to the un-pre-trained Equivariant Transformer model. By incorporating regularizations on equilibrium data, we solved the problem of unstable MD simulations in vanilla Equivariant Transformers, achieving state-of-the-art simulation performance with 2.45 times faster inference time than NequIP. As a powerful molecular encoder, our pre-trained model achieves on-par performance with state-of-the-art property prediction tasks.
△ Less
Submitted 23 August, 2023;
originally announced August 2023.
-
Predicting Drug Solubility Using Different Machine Learning Methods -- Linear Regression Model with Extracted Chemical Features vs Graph Convolutional Neural Network
Authors:
John Ho,
Zhao-Heng Yin,
Colin Zhang,
Nicole Guo,
Yang Ha
Abstract:
Predicting the solubility of given molecules remains crucial in the pharmaceutical industry. In this study, we revisited this extensively studied topic, leveraging the capabilities of contemporary computing resources. We employed two machine learning models: a linear regression model and a graph convolutional neural network (GCNN) model, using various experimental datasets. Both methods yielded re…
▽ More
Predicting the solubility of given molecules remains crucial in the pharmaceutical industry. In this study, we revisited this extensively studied topic, leveraging the capabilities of contemporary computing resources. We employed two machine learning models: a linear regression model and a graph convolutional neural network (GCNN) model, using various experimental datasets. Both methods yielded reasonable predictions, with the GCNN model exhibiting the highest level of performance. However, the present GCNN model has limited interpretability while the linear regression model allows scientists for a greater in-depth analysis of the underlying factors through feature importance analysis, although more human inputs and evaluations on the overall dataset is required. From the perspective of chemistry, using the linear regression model, we elucidated the impact of individual atom species and functional groups on overall solubility, highlighting the significance of comprehending how chemical structure influences chemical properties in the drug development process. It is learned that introducing oxygen atoms can increase the solubility of organic molecules, while almost all other hetero atoms except oxygen and nitrogen tend to decrease solubility.
△ Less
Submitted 4 January, 2024; v1 submitted 23 August, 2023;
originally announced August 2023.
-
STGIC: a graph and image convolution-based method for spatial transcriptomic clustering
Authors:
Chen Zhang,
Junhui Gao,
Lingxin Kong,
Guangshuo cao,
Xiangyu Guo,
Wei Liu
Abstract:
Spatial transcriptomic (ST) clustering employs spatial and transcription information to group spots spatially coherent and transcriptionally similar together into the same spatial domain. Graph convolution network (GCN) and graph attention network (GAT), fed with spatial coordinates derived adjacency and transcription profile derived feature matrix are often used to solve the problem. Our proposed…
▽ More
Spatial transcriptomic (ST) clustering employs spatial and transcription information to group spots spatially coherent and transcriptionally similar together into the same spatial domain. Graph convolution network (GCN) and graph attention network (GAT), fed with spatial coordinates derived adjacency and transcription profile derived feature matrix are often used to solve the problem. Our proposed method STGIC (spatial transcriptomic clustering with graph and image convolution) utilizes an adaptive graph convolution (AGC) to get high quality pseudo-labels and then resorts to dilated convolution framework (DCF) for virtual image converted from gene expression information and spatial coordinates of spots. The dilation rates and kernel sizes are set appropriately and updating of weight values in the kernels is made to be subject to the spatial distance from the position of corresponding elements to kernel centers so that feature extraction of each spot is better guided by spatial distance to neighbor spots. Self-supervision realized by KL-divergence, spatial continuity loss and cross entropy calculated among spots with high confidence pseudo-labels make up the training objective of DCF. STGIC attains state-of-the-art (SOTA) clustering performance on the benchmark dataset of human dorsolateral prefrontal cortex (DLPFC). Besides, it's capable of depicting fine structures of other tissues from other species as well as guiding the identification of marker genes. Also, STGIC is expandable to Stereo-seq data with high spatial resolution.
△ Less
Submitted 23 October, 2023; v1 submitted 19 March, 2023;
originally announced March 2023.
-
Learnable Topological Features for Phylogenetic Inference via Graph Neural Networks
Authors:
Cheng Zhang
Abstract:
Structural information of phylogenetic tree topologies plays an important role in phylogenetic inference. However, finding appropriate topological structures for specific phylogenetic inference tasks often requires significant design effort and domain expertise. In this paper, we propose a novel structural representation method for phylogenetic inference based on learnable topological features. By…
▽ More
Structural information of phylogenetic tree topologies plays an important role in phylogenetic inference. However, finding appropriate topological structures for specific phylogenetic inference tasks often requires significant design effort and domain expertise. In this paper, we propose a novel structural representation method for phylogenetic inference based on learnable topological features. By combining the raw node features that minimize the Dirichlet energy with modern graph representation learning techniques, our learnable topological features can provide efficient structural information of phylogenetic trees that automatically adapts to different downstream tasks without requiring domain expertise. We demonstrate the effectiveness and efficiency of our method on a simulated data tree probability estimation task and a benchmark of challenging real data variational Bayesian phylogenetic inference problems.
△ Less
Submitted 17 February, 2023;
originally announced February 2023.
-
Kainate receptor modulation by NETO2
Authors:
Lingli He,
Jiahui Sun,
Yiwei Gao,
Bin Li,
Yuhang Wang,
Yanli Dong,
Weidong An,
Hang Li,
Bei Yang,
Yuhan Ge,
Xuejun Cai Zhang,
Yun Stone Shi,
Yan Zhao
Abstract:
Glutamate-gated kainate receptors (KARs) are ubiquitous in the central nervous system of vertebrates, mediate synaptic transmission on post-synapse, and modulate transmitter release on pre-synapse. In the brain, the trafficking, gating kinetics, and pharmacology of KARs are tightly regulated by Neuropilin and tolloid-like proteins (Netos). Here we report cryo-EM structures of homo-tetrameric GluK2…
▽ More
Glutamate-gated kainate receptors (KARs) are ubiquitous in the central nervous system of vertebrates, mediate synaptic transmission on post-synapse, and modulate transmitter release on pre-synapse. In the brain, the trafficking, gating kinetics, and pharmacology of KARs are tightly regulated by Neuropilin and tolloid-like proteins (Netos). Here we report cryo-EM structures of homo-tetrameric GluK2 in complex with Neto2 at inhibited and desensitized states, illustrating variable stoichiometry of GluK2-Neto2 complexes, with one or two Neto2 subunits associate with the GluK2. We find that Neto2 accesses only two broad faces of KARs, intermolecularly crosslinking the lower-lobe of ATDA/C, upper-lobe of LBDB/D, and lower-lobe of LBDA/C, illustrating how Neto2 regulates receptor-gating kinetics. The transmembrane helix of Neto2 is positioned proximal to the selectivity filter and competes with the amphiphilic H1-helix after M4 for interacting with an ICD formed by the M1-M2 linkers of the receptor, revealing how rectification is regulated by Neto2.
△ Less
Submitted 2 February, 2023;
originally announced February 2023.
-
Flow cytometry with anti-diffraction light sheet (ADLS) by spatial light modulation
Authors:
Yanyan Gong,
Ming Zeng,
Yueqiang Zhu,
Shangyu Li,
Wei Zhao,
Ce Zhang,
Tianyun Zhao,
Kaige Wang,
Jiangcun Yang,
Jintao Bai
Abstract:
Flow cytometry is a widespread and powerful technique, whose resolution is determined by its capacity to accurately distinguish fluorescently positive populations from negative ones. However, most informative results are discarded while performing the measurements of conventional flow cytometry, e.g., the cell size, shape, morphology, and distribution or location of labeled exosomes within the unp…
▽ More
Flow cytometry is a widespread and powerful technique, whose resolution is determined by its capacity to accurately distinguish fluorescently positive populations from negative ones. However, most informative results are discarded while performing the measurements of conventional flow cytometry, e.g., the cell size, shape, morphology, and distribution or location of labeled exosomes within the unpurified biological samples. We, herein, propose a novel approach using an anti-diffraction light sheet with anisotroic feature to excite fluorescent tags. Constituted by an anti-diffraction Bessel-Gaussian beam array, the light sheet is 12 $μ$m wide, 12 $μ$m high, with a thickness of $~ 0.8 μ$m. The intensity profile of the excited fluorescent signal can, therefore, reflect the size and allow samples in the range from O(100 nm) to 10 $μ$m (e.g., blood cells) to be transported via hydrodynamic focusing in a microfluidic chip. The sampling rate is 500 kHz provides a capability of high throughput without sacrificing the spatial resolution. Consequently, the proposed anti-diffraction light-sheet flow cytometry (ADLSFC) can obtain more informative results than the conventional methodologies, and is able to provide multiple characteristics (e.g., the size and distribution of fluorescent signal) helping to distinguish the target samples from the complex backgrounds.
△ Less
Submitted 23 January, 2023;
originally announced January 2023.
-
Deep Learning Enables Reduced Gadolinium Dose for Contrast-Enhanced Blood-Brain Barrier Opening
Authors:
P. Lee,
H. Wei,
A. N. Pouliopoulos,
B. T. Forsyth,
Y. Yang,
C. Zhang,
A. F. Laine,
E. E. Konofagou,
C. Wu,
J. Guo
Abstract:
Focused ultrasound (FUS) can be used to open the blood-brain barrier (BBB), and MRI with contrast agents can detect that opening. However, repeated use of gadolinium-based contrast agents (GBCAs) presents safety concerns to patients. This study is the first to propose the idea of modeling a volume transfer constant (Ktrans) through deep learning to reduce the dosage of contrast agents. The goal of…
▽ More
Focused ultrasound (FUS) can be used to open the blood-brain barrier (BBB), and MRI with contrast agents can detect that opening. However, repeated use of gadolinium-based contrast agents (GBCAs) presents safety concerns to patients. This study is the first to propose the idea of modeling a volume transfer constant (Ktrans) through deep learning to reduce the dosage of contrast agents. The goal of the study is not only to reconstruct artificial intelligence (AI) derived Ktrans images but to also enhance the intensity with low dosage contrast agent T1 weighted MRI scans. We successfully validated this idea through a previous state-of-the-art temporal network algorithm, which focused on extracting time domain features at the voxel level. Then we used a Spatiotemporal Network (ST-Net), composed of a spatiotemporal convolutional neural network (CNN)-based deep learning architecture with the addition of a three-dimensional CNN encoder, to improve the model performance. We tested the ST-Net model on ten datasets of FUS-induced BBB-openings aquired from different sides of the mouse brain. ST-Net successfully detected and enhanced BBB-opening signals without sacrificing spatial domain information. ST-Net was shown to be a promising method of reducing the need of contrast agents for modeling BBB-opening K-trans maps from time-series Dynamic Contrast-Enhanced Magnetic Resonance Imaging (DCE-MRI) scans.
△ Less
Submitted 17 January, 2023;
originally announced January 2023.
-
Learning Task-Aware Effective Brain Connectivity for fMRI Analysis with Graph Neural Networks
Authors:
Yue Yu,
Xuan Kan,
Hejie Cui,
Ran Xu,
Yujia Zheng,
Xiangchen Song,
Yanqiao Zhu,
Kun Zhang,
Razieh Nabi,
Ying Guo,
Chao Zhang,
Carl Yang
Abstract:
Functional magnetic resonance imaging (fMRI) has become one of the most common imaging modalities for brain function analysis. Recently, graph neural networks (GNN) have been adopted for fMRI analysis with superior performance. Unfortunately, traditional functional brain networks are mainly constructed based on similarities among region of interests (ROI), which are noisy and agnostic to the downs…
▽ More
Functional magnetic resonance imaging (fMRI) has become one of the most common imaging modalities for brain function analysis. Recently, graph neural networks (GNN) have been adopted for fMRI analysis with superior performance. Unfortunately, traditional functional brain networks are mainly constructed based on similarities among region of interests (ROI), which are noisy and agnostic to the downstream prediction tasks and can lead to inferior results for GNN-based models. To better adapt GNNs for fMRI analysis, we propose TBDS, an end-to-end framework based on \underline{T}ask-aware \underline{B}rain connectivity \underline{D}AG (short for Directed Acyclic Graph) \underline{S}tructure generation for fMRI analysis. The key component of TBDS is the brain network generator which adopts a DAG learning approach to transform the raw time-series into task-aware brain connectivities. Besides, we design an additional contrastive regularization to inject task-specific knowledge during the brain network generation process. Comprehensive experiments on two fMRI datasets, namely Adolescent Brain Cognitive Development (ABCD) and Philadelphia Neuroimaging Cohort (PNC) datasets demonstrate the efficacy of TBDS. In addition, the generated brain networks also highlight the prediction-related brain regions and thus provide unique interpretations of the prediction results. Our implementation will be published to https://github.com/yueyu1030/TBDS upon acceptance.
△ Less
Submitted 31 October, 2022;
originally announced November 2022.
-
Bottom-up data integration in polymer models of chromatin organisation
Authors:
Alex Chen Yi Zhang,
Angelo Rosa,
Guido Sanguinetti
Abstract:
Cellular functions crucially depend on the precise execution of complex biochemical reactions taking place on the chromatin fiber in the tightly packed environment of the cell nucleus. Despite the availability of large data sets probing this process from multiple angles, we still lack a bottom-up framework which can incorporate the sequence-specific nature of biochemistry in a unified model of 3D…
▽ More
Cellular functions crucially depend on the precise execution of complex biochemical reactions taking place on the chromatin fiber in the tightly packed environment of the cell nucleus. Despite the availability of large data sets probing this process from multiple angles, we still lack a bottom-up framework which can incorporate the sequence-specific nature of biochemistry in a unified model of 3D chromatin dynamics. Here we propose SEMPER (Sequence Enhanced Magnetic PolymER), a novel stochastic polymer model which naturally incorporates observational data about sequence-driven biochemical processes, such as binding of transcription factor proteins, in a 3D model of chromatin structure. By introducing a new algorithm for approximate Bayesian inference, we discuss how to estimate in a robust manner the relative importance of biochemical vs. polymer signals in the determination of the chromatin epigenetic states which is leading to a significant revision of the interpretation of previous models. Furthermore we show that, without additional input from the genome 3D structure, our model can predict with reasonable accuracy some notable and non trivial conformational features of chromatin folding within the nucleus. Our work highlights the importance of introducing physically realistic statistical models for predicting chromatin states from epigenetic data, and opens the way to a new class of more systematic approaches to interpret epigenomic data.
△ Less
Submitted 16 March, 2023; v1 submitted 20 October, 2022;
originally announced October 2022.
-
Current and perspective sensing methods for monkeypox virus: a reemerging zoonosis in its infancy
Authors:
Ijaz Gul,
Changyue Liu,
Yuan Xi,
Zhicheng Du,
Shiyao Zhai,
Zhengyang Lei,
Chen Qun,
Muhammad Akmal Raheem,
Qian He,
Zhang Haihui,
Canyang Zhang,
Runming Wang,
Sanyang Han,
Du Ke,
Peiwu Qin
Abstract:
Objectives The review is dedicated to evaluate the current monkeypox virus (MPXV) detection methods, discuss their pros and cons, and provide recommended solutions to the problems.
Methods The literature for this review is identified through searches in PubMed, Web of Science, Google Scholar, ResearchGate, and Science Direct advanced search for articles published in English without any start dat…
▽ More
Objectives The review is dedicated to evaluate the current monkeypox virus (MPXV) detection methods, discuss their pros and cons, and provide recommended solutions to the problems.
Methods The literature for this review is identified through searches in PubMed, Web of Science, Google Scholar, ResearchGate, and Science Direct advanced search for articles published in English without any start date until June, 2022, by use of the terms "monkeypox virus" or "poxvirus" along with "diagnosis"; "PCR"; "real-time PCR"; "LAMP"; "RPA"; "immunoassay"; "reemergence"; "biothreat"; "endemic", and "multi-country outbreak" and also, by tracking citations of the relevant papers. The most relevant articles are included in the review.
Results Our literature review shows that PCR is the gold standard method for MPXV detection. In addition, loop-mediated isothermal amplification (LAMP) and recombinase polymerase amplification (RPA) have been reported as alternatives to PCR. Immunodiagnostics, whole particle detection, and image-based detection are the non-nucleic acid-based MPXV detection modalities.
Conclusions PCR is easy to leverage and adapt for a quick response to an outbreak, but the PCR-based MPXV detection approaches may not be suitable for marginalized settings. Limited progress has been made towards innovations in MPXV diagnostics, providing room for the development of novel detection techniques for this virus.
△ Less
Submitted 10 August, 2022;
originally announced August 2022.
-
Superficial White Matter Analysis: An Efficient Point-cloud-based Deep Learning Framework with Supervised Contrastive Learning for Consistent Tractography Parcellation across Populations and dMRI Acquisitions
Authors:
Tengfei Xue,
Fan Zhang,
Chaoyi Zhang,
Yuqian Chen,
Yang Song,
Alexandra J. Golby,
Nikos Makris,
Yogesh Rathi,
Weidong Cai,
Lauren J. O'Donnell
Abstract:
Diffusion MRI tractography is an advanced imaging technique that enables in vivo mapping of the brain's white matter connections. White matter parcellation classifies tractography streamlines into clusters or anatomically meaningful tracts. It enables quantification and visualization of whole-brain tractography. Currently, most parcellation methods focus on the deep white matter (DWM), whereas few…
▽ More
Diffusion MRI tractography is an advanced imaging technique that enables in vivo mapping of the brain's white matter connections. White matter parcellation classifies tractography streamlines into clusters or anatomically meaningful tracts. It enables quantification and visualization of whole-brain tractography. Currently, most parcellation methods focus on the deep white matter (DWM), whereas fewer methods address the superficial white matter (SWM) due to its complexity. We propose a novel two-stage deep-learning-based framework, Superficial White Matter Analysis (SupWMA), that performs an efficient and consistent parcellation of 198 SWM clusters from whole-brain tractography. A point-cloud-based network is adapted to our SWM parcellation task, and supervised contrastive learning enables more discriminative representations between plausible streamlines and outliers for SWM. We train our model on a large-scale tractography dataset including streamline samples from labeled long- and medium-range (over 40 mm) SWM clusters and anatomically implausible streamline samples, and we perform testing on six independently acquired datasets of different ages and health conditions (including neonates and patients with space-occupying brain tumors). Compared to several state-of-the-art methods, SupWMA obtains highly consistent and accurate SWM parcellation results on all datasets, showing good generalization across the lifespan in health and disease. In addition, the computational speed of SupWMA is much faster than other methods.
△ Less
Submitted 23 January, 2023; v1 submitted 18 July, 2022;
originally announced July 2022.
-
Graph-based Molecular Representation Learning
Authors:
Zhichun Guo,
Kehan Guo,
Bozhao Nan,
Yijun Tian,
Roshni G. Iyer,
Yihong Ma,
Olaf Wiest,
Xiangliang Zhang,
Wei Wang,
Chuxu Zhang,
Nitesh V. Chawla
Abstract:
Molecular representation learning (MRL) is a key step to build the connection between machine learning and chemical science. In particular, it encodes molecules as numerical vectors preserving the molecular structures and features, on top of which the downstream tasks (e.g., property prediction) can be performed. Recently, MRL has achieved considerable progress, especially in methods based on deep…
▽ More
Molecular representation learning (MRL) is a key step to build the connection between machine learning and chemical science. In particular, it encodes molecules as numerical vectors preserving the molecular structures and features, on top of which the downstream tasks (e.g., property prediction) can be performed. Recently, MRL has achieved considerable progress, especially in methods based on deep molecular graph learning. In this survey, we systematically review these graph-based molecular representation techniques, especially the methods incorporating chemical domain knowledge. Specifically, we first introduce the features of 2D and 3D molecular graphs. Then we summarize and categorize MRL methods into three groups based on their input. Furthermore, we discuss some typical chemical applications supported by MRL. To facilitate studies in this fast-developing area, we also list the benchmarks and commonly used datasets in the paper. Finally, we share our thoughts on future research directions.
△ Less
Submitted 28 November, 2023; v1 submitted 8 July, 2022;
originally announced July 2022.
-
Detecting Schizophrenia with 3D Structural Brain MRI Using Deep Learning
Authors:
Junhao Zhang,
Vishwanatha M. Rao,
Ye Tian,
Yanting Yang,
Nicolas Acosta,
Zihan Wan,
Pin-Yu Lee,
Chloe Zhang,
Lawrence S. Kegeles,
Scott A. Small,
Jia Guo
Abstract:
Schizophrenia is a chronic neuropsychiatric disorder that causes distinct structural alterations within the brain. We hypothesize that deep learning applied to a structural neuroimaging dataset could detect disease-related alteration and improve classification and diagnostic accuracy. We tested this hypothesis using a single, widely available, and conventional T1-weighted MRI scan, from which we e…
▽ More
Schizophrenia is a chronic neuropsychiatric disorder that causes distinct structural alterations within the brain. We hypothesize that deep learning applied to a structural neuroimaging dataset could detect disease-related alteration and improve classification and diagnostic accuracy. We tested this hypothesis using a single, widely available, and conventional T1-weighted MRI scan, from which we extracted the 3D whole-brain structure using standard post-processing methods. A deep learning model was then developed, optimized, and evaluated on three open datasets with T1-weighted MRI scans of patients with schizophrenia. Our proposed model outperformed the benchmark model, which was also trained with structural MR images using a 3D CNN architecture. Our model is capable of almost perfectly (area under the ROC curve = 0.987) distinguishing schizophrenia patients from healthy controls on unseen structural MRI scans. Regional analysis localized subcortical regions and ventricles as the most predictive brain regions. Subcortical structures serve a pivotal role in cognitive, affective, and social functions in humans, and structural abnormalities of these regions have been associated with schizophrenia. Our finding corroborates that schizophrenia is associated with widespread alterations in subcortical brain structure and the subcortical structural information provides prominent features in diagnostic classification. Together, these results further demonstrate the potential of deep learning to improve schizophrenia diagnosis and identify its structural neuroimaging signatures from a single, standard T1-weighted brain MRI.
△ Less
Submitted 7 July, 2022; v1 submitted 26 June, 2022;
originally announced June 2022.
-
Generative Models Improve Radiomics Reproducibility in Low Dose CTs: A Simulation Study
Authors:
Junhua Chen,
Chong Zhang,
Alberto Traverso,
Ivan Zhovannik,
Andre Dekker,
Leonard Wee,
Inigo Bermejo
Abstract:
Radiomics is an active area of research in medical image analysis, the low reproducibility of radiomics has limited its applicability to clinical practice. This issue is especially prominent when radiomic features are calculated from noisy images, such as low dose computed tomography (CT) scans. In this article, we investigate the possibility of improving the reproducibility of radiomic features c…
▽ More
Radiomics is an active area of research in medical image analysis, the low reproducibility of radiomics has limited its applicability to clinical practice. This issue is especially prominent when radiomic features are calculated from noisy images, such as low dose computed tomography (CT) scans. In this article, we investigate the possibility of improving the reproducibility of radiomic features calculated on noisy CTs by using generative models for denoising.One traditional denoising method - non-local means - and two generative models - encoder-decoder networks (EDN) and conditional generative adversarial networks (CGANs) - were selected as the test models. We added noise to the sinograms of full dose CTs to mimic low dose CTs with two different levels of noise: low-noise CT and high-noise CT. Models were trained on high-noise CTs and used to denoise low-noise CTs without re-training. We also test the performance of our model in real data, using dataset of same-day repeat low dose CTs to assess the reproducibility of radiomic features in denoised images. The EDN and the CGAN improved the concordance correlation coefficients (CCC) of radiomic features for low-noise images from 0.87 to 0.92 and for high-noise images from 0.68 to 0.92 respectively. Moreover, the EDN and the CGAN improved the test-retest reliability of radiomic features (mean CCC increased from 0.89 to 0.94) based on real low dose CTs. The results show that denoising using EDN and CGANs can improve the reproducibility of radiomic features calculated on noisy CTs. Moreover, images with different noise levels can be denoised to improve the reproducibility using these models without re-training, as long as the noise intensity is equal or lower than that in high-noise CTs. To the authors' knowledge, this is the first effort to improve the reproducibility of radiomic features calculated on low dose CT scans.
△ Less
Submitted 13 August, 2021; v1 submitted 30 April, 2021;
originally announced April 2021.
-
Variational Bayesian Supertrees
Authors:
Michael Karcher,
Cheng Zhang,
Frederick A Matsen IV
Abstract:
Given overlapping subsets of a set of taxa (e.g. species), and posterior distributions on phylogenetic tree topologies for each of these taxon sets, how can we infer a posterior distribution on phylogenetic tree topologies for the entire taxon set? Although the equivalent problem for in the non-Bayesian case has attracted substantial research, the Bayesian case has not attracted the attention it d…
▽ More
Given overlapping subsets of a set of taxa (e.g. species), and posterior distributions on phylogenetic tree topologies for each of these taxon sets, how can we infer a posterior distribution on phylogenetic tree topologies for the entire taxon set? Although the equivalent problem for in the non-Bayesian case has attracted substantial research, the Bayesian case has not attracted the attention it deserves. In this paper we develop a variational Bayes approach to this problem and demonstrate its effectiveness.
△ Less
Submitted 22 April, 2021;
originally announced April 2021.
-
Improved Variational Bayesian Phylogenetic Inference with Normalizing Flows
Authors:
Cheng Zhang
Abstract:
Variational Bayesian phylogenetic inference (VBPI) provides a promising general variational framework for efficient estimation of phylogenetic posteriors. However, the current diagonal Lognormal branch length approximation would significantly restrict the quality of the approximating distributions. In this paper, we propose a new type of VBPI, VBPI-NF, as a first step to empower phylogenetic poste…
▽ More
Variational Bayesian phylogenetic inference (VBPI) provides a promising general variational framework for efficient estimation of phylogenetic posteriors. However, the current diagonal Lognormal branch length approximation would significantly restrict the quality of the approximating distributions. In this paper, we propose a new type of VBPI, VBPI-NF, as a first step to empower phylogenetic posterior estimation with deep learning techniques. By handling the non-Euclidean branch length space of phylogenetic models with carefully designed permutation equivariant transformations, VBPI-NF uses normalizing flows to provide a rich family of flexible branch length distributions that generalize across different tree topologies. We show that VBPI-NF significantly improves upon the vanilla VBPI on a benchmark of challenging real data Bayesian phylogenetic inference problems. Further investigation also reveals that the structured parameterization in those permutation equivariant transformations can provide additional amortization benefit.
△ Less
Submitted 1 December, 2020;
originally announced December 2020.
-
Parameter estimation in FACS-seq enables high-throughput characterization of phenotypic heterogeneity
Authors:
Huibao Feng,
Chong Zhang
Abstract:
Phenotypic heterogeneity is a most fascinating property of a population of cells, which shows the differences among individuals even with the same genetic background and extracellular environmental conditions. However, the lack of high-throughput analysis of phenotypic diversity has limited our research progress. To deal with it, we constructed a novel parameter estimation method in FACS-seq, a co…
▽ More
Phenotypic heterogeneity is a most fascinating property of a population of cells, which shows the differences among individuals even with the same genetic background and extracellular environmental conditions. However, the lack of high-throughput analysis of phenotypic diversity has limited our research progress. To deal with it, we constructed a novel parameter estimation method in FACS-seq, a commonly used experimental framework, to achieve simultaneous characterization of thousands of variants in a library. We further demonstrated the model's ability in estimating the expression properties of each variant, which we believe can help to decipher the mechanisms of phenotypic heterogeneity.
△ Less
Submitted 7 October, 2020;
originally announced October 2020.
-
SumGNN: Multi-typed Drug Interaction Prediction via Efficient Knowledge Graph Summarization
Authors:
Yue Yu,
Kexin Huang,
Chao Zhang,
Lucas M. Glass,
Jimeng Sun,
Cao Xiao
Abstract:
Thanks to the increasing availability of drug-drug interactions (DDI) datasets and large biomedical knowledge graphs (KGs), accurate detection of adverse DDI using machine learning models becomes possible. However, it remains largely an open problem how to effectively utilize large and noisy biomedical KG for DDI detection. Due to its sheer size and amount of noise in KGs, it is often less benefic…
▽ More
Thanks to the increasing availability of drug-drug interactions (DDI) datasets and large biomedical knowledge graphs (KGs), accurate detection of adverse DDI using machine learning models becomes possible. However, it remains largely an open problem how to effectively utilize large and noisy biomedical KG for DDI detection. Due to its sheer size and amount of noise in KGs, it is often less beneficial to directly integrate KGs with other smaller but higher quality data (e.g., experimental data). Most of the existing approaches ignore KGs altogether. Some try to directly integrate KGs with other data via graph neural networks with limited success. Furthermore, most previous works focus on binary DDI prediction whereas the multi-typed DDI pharmacological effect prediction is a more meaningful but harder task. To fill the gaps, we propose a new method SumGNN: knowledge summarization graph neural network, which is enabled by a subgraph extraction module that can efficiently anchor on relevant subgraphs from a KG, a self-attention based subgraph summarization scheme to generate a reasoning path within the subgraph, and a multi-channel knowledge and data integration module that utilizes massive external biomedical knowledge for significantly improved multi-typed DDI predictions. SumGNN outperforms the best baseline by up to 5.54\%, and the performance gain is particularly significant in low data relation types. In addition, SumGNN provides interpretable prediction via the generated reasoning paths for each prediction.
△ Less
Submitted 6 May, 2021; v1 submitted 3 October, 2020;
originally announced October 2020.
-
A calibration-free method for biosensing in cell manufacturing
Authors:
Jialei Chen,
Zhaonan Liu,
Kan Wang,
Chen Jiang,
Chuck Zhang,
Ben Wang
Abstract:
Chimeric antigen receptor T cell therapy has demonstrated innovative therapeutic effectiveness in fighting cancers; however, it is extremely expensive due to the intrinsic patient-to-patient variability in cell manufacturing. We propose in this work a novel calibration-free statistical framework to effectively recover critical quality attributes under the patient-to-patient variability. Specifical…
▽ More
Chimeric antigen receptor T cell therapy has demonstrated innovative therapeutic effectiveness in fighting cancers; however, it is extremely expensive due to the intrinsic patient-to-patient variability in cell manufacturing. We propose in this work a novel calibration-free statistical framework to effectively recover critical quality attributes under the patient-to-patient variability. Specifically, we model this variability via a patient-specific calibration parameter, and use readings from multiple biosensors to construct a patient-invariance statistic, thereby alleviating the effect of the calibration parameter. A carefully formulated optimization problem and an algorithmic framework are presented to find the best patient-invariance statistic and the model parameters. Using the patient-invariance statistic, we can recover the critical quality attribute of interest, free from the calibration parameter. We demonstrate improvements of the proposed calibration-free method in different simulation experiments. In the cell manufacturing case study, our method not only effectively recovers viable cell concentration for monitoring, but also reveals insights for the cell manufacturing process.
△ Less
Submitted 27 July, 2020;
originally announced July 2020.
-
Neural encoding and interpretation for high-level visual cortices based on fMRI using image caption features
Authors:
Kai Qiao,
Chi Zhang,
Jian Chen,
Linyuan Wang,
Li Tong,
Bin Yan
Abstract:
On basis of functional magnetic resonance imaging (fMRI), researchers are devoted to designing visual encoding models to predict the neuron activity of human in response to presented image stimuli and analyze inner mechanism of human visual cortices. Deep network structure composed of hierarchical processing layers forms deep network models by learning features of data on specific task through big…
▽ More
On basis of functional magnetic resonance imaging (fMRI), researchers are devoted to designing visual encoding models to predict the neuron activity of human in response to presented image stimuli and analyze inner mechanism of human visual cortices. Deep network structure composed of hierarchical processing layers forms deep network models by learning features of data on specific task through big dataset. Deep network models have powerful and hierarchical representation of data, and have brought about breakthroughs for visual encoding, while revealing hierarchical structural similarity with the manner of information processing in human visual cortices. However, previous studies almost used image features of those deep network models pre-trained on classification task to construct visual encoding models. Except for deep network structure, the task or corresponding big dataset is also important for deep network models, but neglected by previous studies. Because image classification is a relatively fundamental task, it is difficult to guide deep network models to master high-level semantic representations of data, which causes into that encoding performance for high-level visual cortices is limited. In this study, we introduced one higher-level vision task: image caption (IC) task and proposed the visual encoding model based on IC features (ICFVEM) to encode voxels of high-level visual cortices. Experiment demonstrated that ICFVEM obtained better encoding performance than previous deep network models pre-trained on classification task. In addition, the interpretation of voxels was realized to explore the detailed characteristics of voxels based on the visualization of semantic words, and comparative analysis implied that high-level visual cortices behaved the correlative representation of image content.
△ Less
Submitted 26 March, 2020;
originally announced March 2020.
-
BigGAN-based Bayesian reconstruction of natural images from human brain activity
Authors:
Kai Qiao,
Jian Chen,
Linyuan Wang,
Chi Zhang,
Li Tong,
Bin Yan
Abstract:
In the visual decoding domain, visually reconstructing presented images given the corresponding human brain activity monitored by functional magnetic resonance imaging (fMRI) is difficult, especially when reconstructing viewed natural images. Visual reconstruction is a conditional image generation on fMRI data and thus generative adversarial network (GAN) for natural image generation is recently i…
▽ More
In the visual decoding domain, visually reconstructing presented images given the corresponding human brain activity monitored by functional magnetic resonance imaging (fMRI) is difficult, especially when reconstructing viewed natural images. Visual reconstruction is a conditional image generation on fMRI data and thus generative adversarial network (GAN) for natural image generation is recently introduced for this task. Although GAN-based methods have greatly improved, the fidelity and naturalness of reconstruction are still unsatisfactory due to the small number of fMRI data samples and the instability of GAN training. In this study, we proposed a new GAN-based Bayesian visual reconstruction method (GAN-BVRM) that includes a classifier to decode categories from fMRI data, a pre-trained conditional generator to generate natural images of specified categories, and a set of encoding models and evaluator to evaluate generated images. GAN-BVRM employs the pre-trained generator of the prevailing BigGAN to generate masses of natural images, and selects the images that best matches with the corresponding brain activity through the encoding models as the reconstruction of the image stimuli. In this process, the semantic and detailed contents of reconstruction are controlled by decoded categories and encoding models, respectively. GAN-BVRM used the Bayesian manner to avoid contradiction between naturalness and fidelity from current GAN-based methods and thus can improve the advantages of GAN. Experimental results revealed that GAN-BVRM improves the fidelity and naturalness, that is, the reconstruction is natural and similar to the presented image stimuli.
△ Less
Submitted 13 March, 2020;
originally announced March 2020.
-
Protein structure and sequence re-analysis of 2019-nCoV genome does not indicate snakes as its intermediate host or the unique similarity between its spike protein insertions and HIV-1
Authors:
Chengxin Zhang,
Wei Zheng,
Xiaoqiang Huang,
Eric W. Bell,
Xiaogen Zhou,
Yang Zhang
Abstract:
As the infection of 2019-nCoV coronavirus is quickly developing into a global pneumonia epidemic, careful analysis of its transmission and cellular mechanisms is sorely needed. In this report, we re-analyzed the computational approaches and findings presented in two recent manuscripts by Ji et al. (https://doi.org/10.1002/jmv.25682) and by Pradhan et al. (https://doi.org/10.1101/2020.01.30.927871)…
▽ More
As the infection of 2019-nCoV coronavirus is quickly developing into a global pneumonia epidemic, careful analysis of its transmission and cellular mechanisms is sorely needed. In this report, we re-analyzed the computational approaches and findings presented in two recent manuscripts by Ji et al. (https://doi.org/10.1002/jmv.25682) and by Pradhan et al. (https://doi.org/10.1101/2020.01.30.927871), which concluded that snakes are the intermediate hosts of 2019-nCoV and that the 2019-nCoV spike protein insertions shared a unique similarity to HIV-1. Results from our re-implementation of the analyses, built on larger-scale datasets using state-of-the-art bioinformatics methods and databases, do not support the conclusions proposed by these manuscripts. Based on our analyses and existing data of coronaviruses, we concluded that the intermediate hosts of 2019-nCoV are more likely to be mammals and birds than snakes, and that the "novel insertions" observed in the spike protein are naturally evolved from bat coronaviruses.
△ Less
Submitted 8 February, 2020;
originally announced February 2020.
-
Predicting the Impact of Electric Field Stimulation in a Detailed Computational Model of Cortical Tissue
Authors:
Frances Hutchings,
Christopher Thornton,
Chencheng Zhang,
Yujiang Wang,
Marcus Kaiser
Abstract:
Neurostimulation using weak electric fields has generated excitement in recent years due to its potential as a medical intervention. However, study of this stimulation modality has been hampered by inconsistent results and large variability within and between studies. In order to begin addressing this variability we need to properly characterise the impact of the current on the underlying neuron p…
▽ More
Neurostimulation using weak electric fields has generated excitement in recent years due to its potential as a medical intervention. However, study of this stimulation modality has been hampered by inconsistent results and large variability within and between studies. In order to begin addressing this variability we need to properly characterise the impact of the current on the underlying neuron populations. To develop and test a computational model capable of capturing the impact of electric field stimulation on networks of neurons. We construct a cortical tissue model with distinct layers and explicit neuron morphologies. We then apply a model of electrical stimulation and carry out multiple test case simulations. The cortical slice model is compared to experimental literature and shown to capture the main features of the electrophysiological response to stimulation. Namely, the model showed 1) a similar level of depolarisation in individual pyramidal neurons, 2) acceleration of intrinsic oscillations, and 3) retention of the spatial profile of oscillations in different layers. We then apply alternative electric fields to demonstrate how the model can capture differences in neuronal responses to the electric field. We demonstrate that the tissue response is dependent on layer depth, the angle of the apical dendrite relative to the field, and stimulation strength. We present publicly available computational modelling software that predicts the neuron network population response to electric field stimulation.
△ Less
Submitted 28 January, 2020;
originally announced January 2020.
-
Multiplex stimulated Raman scattering imaging cytometry reveals cancer metabolic signatures in a spatially, temporally, and spectrally resolved manner
Authors:
Kai-Chih Huang,
Junjie Li,
Chi Zhang,
Yuying Tan,
Ji-Xin Cheng
Abstract:
In situ measurement of cellular metabolites is still a challenge in biology. Conventional methods, such as mass spectrometry or fluorescence microscopy, would either destruct the sample or introduce strong perturbations to the functions of target molecules. Here, we present multiplex stimulated Raman scattering (SRS) imaging cytometry as a label-free single-cell analysis platform with chemical spe…
▽ More
In situ measurement of cellular metabolites is still a challenge in biology. Conventional methods, such as mass spectrometry or fluorescence microscopy, would either destruct the sample or introduce strong perturbations to the functions of target molecules. Here, we present multiplex stimulated Raman scattering (SRS) imaging cytometry as a label-free single-cell analysis platform with chemical specifity, and high-throughput capabilities. Cellular compartments such as lipid droplets, endoplasmic reticulum, and nuclei are seperated from the cytoplasm. Based on these chemical segmentations, 260 features from both morphology and molecular composition were generated and analyzed for each cell. Using SRS imaging cytometry, we studied the metabolic responses of human pancreatic cancer cells under stress by starvation and chemotherapy drug treatments. We unveiled lipid-facilitated protrusion as a metabolic marker for stress-resistant cancer cells through statistical analysis of thousands of cells. Our findings also demonstrate the potential of targeting lipid metabolism for selective treatment of starvation-resistant and chemotherapy-resistant cancers. These results highlight our SRS imaging cytometry as a powerful label-free tool for biological discoveries with a high-throughput, high-content capacity.
△ Less
Submitted 17 December, 2019;
originally announced December 2019.
-
Effective and efficient ROI-wise visual encoding using an end-to-end CNN regression model and selective optimization
Authors:
Kai Qiao,
Chi Zhang,
Jian Chen,
Linyuan Wang,
Li Tong,
Bin Yan
Abstract:
Recently, visual encoding based on functional magnetic resonance imaging (fMRI) have realized many achievements with the rapid development of deep network computation. Visual encoding model is aimed at predicting brain activity in response to presented image stimuli. Currently, visual encoding is accomplished mainly by firstly extracting image features through convolutional neural network (CNN) mo…
▽ More
Recently, visual encoding based on functional magnetic resonance imaging (fMRI) have realized many achievements with the rapid development of deep network computation. Visual encoding model is aimed at predicting brain activity in response to presented image stimuli. Currently, visual encoding is accomplished mainly by firstly extracting image features through convolutional neural network (CNN) model pre-trained on computer vision task, and secondly training a linear regression model to map specific layer of CNN features to each voxel, namely voxel-wise encoding. However, the two-step manner model, essentially, is hard to determine which kind of well features are well linearly matched for beforehand unknown fMRI data with little understanding of human visual representation. Analogizing computer vision mostly related human vision, we proposed the end-to-end convolution regression model (ETECRM) in the region of interest (ROI)-wise manner to accomplish effective and efficient visual encoding. The end-to-end manner was introduced to make the model automatically learn better matching features to improve encoding performance. The ROI-wise manner was used to improve the encoding efficiency for many voxels. In addition, we designed the selective optimization including self-adapting weight learning and weighted correlation loss, noise regularization to avoid interfering of ineffective voxels in ROI-wise encoding. Experiment demonstrated that the proposed model obtained better predicting accuracy than the two-step manner of encoding models. Comparative analysis implied that end-to-end manner and large volume of fMRI data may drive the future development of visual encoding.
△ Less
Submitted 27 July, 2019;
originally announced July 2019.
-
Category decoding of visual stimuli from human brain activity using a bidirectional recurrent neural network to simulate bidirectional information flows in human visual cortices
Authors:
Kai Qiao,
Jian Chen,
Linyuan Wang,
Chi Zhang,
Lei Zeng,
Li Tong,
Bin Yan
Abstract:
Recently, visual encoding and decoding based on functional magnetic resonance imaging (fMRI) have realized many achievements with the rapid development of deep network computation. Despite the hierarchically similar representations of deep network and human vision, visual information flows from primary visual cortices to high visual cortices and vice versa based on the bottom-up and top-down manne…
▽ More
Recently, visual encoding and decoding based on functional magnetic resonance imaging (fMRI) have realized many achievements with the rapid development of deep network computation. Despite the hierarchically similar representations of deep network and human vision, visual information flows from primary visual cortices to high visual cortices and vice versa based on the bottom-up and top-down manners, respectively. Inspired by the bidirectional information flows, we proposed a bidirectional recurrent neural network (BRNN)-based method to decode the categories from fMRI data. The forward and backward directions in the BRNN module characterized the bottom-up and top-down manners, respectively. The proposed method regarded the selected voxels of each visual cortex region (V1, V2, V3, V4, and LO) as one node in the sequence fed into the BRNN module and combined the output of the BRNN module to decode the categories with the subsequent fully connected layer. This new method allows the efficient utilization of hierarchical information representations and bidirectional information flows in human visual cortices. Experiment results demonstrated that our method improved the accuracy of three-level category decoding than other methods, which implicitly validated the hierarchical and bidirectional human visual representations. Comparative analysis revealed that the category representations of human visual cortices were hierarchical, distributed, complementary, and correlative.
△ Less
Submitted 18 March, 2019;
originally announced March 2019.
-
Molecular Polar Belief Propagation Decoder and Successive Cancellation Decoder
Authors:
Zhiwei Zhong,
Lulu Ge,
Zaichen Zhang,
Xiaohu You,
Chuan Zhang
Abstract:
By constructing chemical reaction networks (CRNs), this paper proposes a method of synthesizing polar decoder using belief propagation (BP) algorithm and successive cancellation (SC) algorithm, respectively. Theoretical analysis and simulation results have validated the feasibility of the method. Reactions in the proposed design could be experimentally implemented with DNA strand displacement reac…
▽ More
By constructing chemical reaction networks (CRNs), this paper proposes a method of synthesizing polar decoder using belief propagation (BP) algorithm and successive cancellation (SC) algorithm, respectively. Theoretical analysis and simulation results have validated the feasibility of the method. Reactions in the proposed design could be experimentally implemented with DNA strand displacement reactions, making the proposed polar decoders promising for wide application in nanoscale devices.
△ Less
Submitted 16 March, 2019;
originally announced March 2019.
-
A visual encoding model based on deep neural networks and transfer learning
Authors:
Chi Zhang,
Kai Qiao,
Linyuan Wang,
Li Tong,
Guoen Hu,
Ruyuan Zhang,
Bin Yan
Abstract:
Background: Building visual encoding models to accurately predict visual responses is a central challenge for current vision-based brain-machine interface techniques. To achieve high prediction accuracy on neural signals, visual encoding models should include precise visual features and appropriate prediction algorithms. Most existing visual encoding models employ hand-craft visual features (e.g.,…
▽ More
Background: Building visual encoding models to accurately predict visual responses is a central challenge for current vision-based brain-machine interface techniques. To achieve high prediction accuracy on neural signals, visual encoding models should include precise visual features and appropriate prediction algorithms. Most existing visual encoding models employ hand-craft visual features (e.g., Gabor wavelets or semantic labels) or data-driven features (e.g., features extracted from deep neural networks (DNN)). They also assume a linear mapping between feature representation to brain activity. However, it remains unknown whether such linear mapping is sufficient for maximizing prediction accuracy. New Method: We construct a new visual encoding framework to predict cortical responses in a benchmark functional magnetic resonance imaging (fMRI) dataset. In this framework, we employ the transfer learning technique to incorporate a pre-trained DNN (i.e., AlexNet) and train a nonlinear mapping from visual features to brain activity. This nonlinear mapping replaces the conventional linear mapping and is supposed to improve prediction accuracy on brain activity. Results: The proposed framework can significantly predict responses of over 20% voxels in early visual areas (i.e., V1-lateral occipital region, LO) and achieve unprecedented prediction accuracy. Comparison with Existing Methods: Comparing to two conventional visual encoding models, we find that the proposed encoding model shows consistent higher prediction accuracy in all early visual areas, especially in relatively anterior visual areas (i.e., V4 and LO). Conclusions: Our work proposes a new framework to utilize pre-trained visual features and train non-linear mappings from visual features to brain activity.
△ Less
Submitted 23 February, 2019;
originally announced February 2019.
-
Cameraless High-throughput 3D Imaging Flow Cytometry
Authors:
Yuanyuan Han,
Rui Tang,
Yi Gu,
Alex Ce Zhang,
Wei Cai,
Violet Castor,
Sung Hwan Cho,
William Alaynick,
Yu-Hwa Lo
Abstract:
Increasing demand for understanding the vast heterogeneity of cellular phenotypes has driven the development of imaging flow cytometry (IFC), that combines features of flow cytometry with fluorescence and bright field microscopy. IFC combines the throughput and statistical advantage of flow cytometry with the ability to discretely measure events based on a real or computational image, as well as c…
▽ More
Increasing demand for understanding the vast heterogeneity of cellular phenotypes has driven the development of imaging flow cytometry (IFC), that combines features of flow cytometry with fluorescence and bright field microscopy. IFC combines the throughput and statistical advantage of flow cytometry with the ability to discretely measure events based on a real or computational image, as well as conventional flow cytometry metrics. A limitation of existing IFC systems is that, regardless of detection methodology, only two-dimensional (2D) cell images are obtained. Without tomographic three-dimensional (3D) resolution the projection problem remains: collapsing 3D information onto a 2D image, limiting the reliability of spot counting or co-localization crucial to cell phenotyping. Here we present a solution to the projection problem: three-dimensional imaging flow cytometry (3D-IFC), a high-throughput 3D cell imager based on optical sectioning microscopy. We combine orthogonal light-sheet scanning illumination with our previous spatiotemporal transformation detection to produce 3D cell image reconstruction from a cameraless single-pixel photodetector readout. We further demonstrate this capability by co-capturing 3D fluorescence and label-free side-scattering images of single cells in flow at a velocity of 0.2 m s-1, corresponding to a throughput of approximately 500 cells per second with 60,000 voxels (resized subsequently to 106 voxels) for each cell image at a resolution of less than 1 micron in X, Y, and Z dimensions. Improved high-throughput imaging tools are needed to phenotype-genotype recognized heterogeneity in the fields of immunology, oncology, cell- and gene- therapy, and drug discovery.
△ Less
Submitted 2 February, 2019;
originally announced February 2019.
-
A^2-Net: Molecular Structure Estimation from Cryo-EM Density Volumes
Authors:
Kui Xu,
Zhe Wang,
Jiangping Shi,
Hongsheng Li,
Qiangfeng Cliff Zhang
Abstract:
Constructing of molecular structural models from Cryo-Electron Microscopy (Cryo-EM) density volumes is the critical last step of structure determination by Cryo-EM technologies. Methods have evolved from manual construction by structural biologists to perform 6D translation-rotation searching, which is extremely compute-intensive. In this paper, we propose a learning-based method and formulate thi…
▽ More
Constructing of molecular structural models from Cryo-Electron Microscopy (Cryo-EM) density volumes is the critical last step of structure determination by Cryo-EM technologies. Methods have evolved from manual construction by structural biologists to perform 6D translation-rotation searching, which is extremely compute-intensive. In this paper, we propose a learning-based method and formulate this problem as a vision-inspired 3D detection and pose estimation task. We develop a deep learning framework for amino acid determination in a 3D Cryo-EM density volume. We also design a sequence-guided Monte Carlo Tree Search (MCTS) to thread over the candidate amino acids to form the molecular structure. This framework achieves 91% coverage on our newly proposed dataset and takes only a few minutes for a typical structure with a thousand amino acids. Our method is hundreds of times faster and several times more accurate than existing automated solutions without any human intervention.
△ Less
Submitted 12 February, 2019; v1 submitted 3 January, 2019;
originally announced January 2019.
-
Dissociable neural representations of adversarially perturbed images in convolutional neural networks and the human brain
Authors:
Chi Zhang,
Xiaohan Duan,
Linyuan Wang,
Yongli Li,
Bin Yan,
Guoen Hu,
Ruyuan Zhang,
Li Tong
Abstract:
Despite the remarkable similarities between convolutional neural networks (CNN) and the human brain, CNNs still fall behind humans in many visual tasks, indicating that there still exist considerable differences between the two systems. Here, we leverage adversarial noise (AN) and adversarial interference (AI) images to quantify the consistency between neural representations and perceptual outcome…
▽ More
Despite the remarkable similarities between convolutional neural networks (CNN) and the human brain, CNNs still fall behind humans in many visual tasks, indicating that there still exist considerable differences between the two systems. Here, we leverage adversarial noise (AN) and adversarial interference (AI) images to quantify the consistency between neural representations and perceptual outcomes in the two systems. Humans can successfully recognize AI images as corresponding categories but perceive AN images as meaningless noise. In contrast, CNNs can correctly recognize AN images but mistakenly classify AI images into wrong categories with surprisingly high confidence. We use functional magnetic resonance imaging to measure brain activity evoked by regular and adversarial images in the human brain, and compare it to the activity of artificial neurons in a prototypical CNN-AlexNet. In the human brain, we find that the representational similarity between regular and adversarial images largely echoes their perceptual similarity in all early visual areas. In AlexNet, however, the neural representations of adversarial images are inconsistent with network outputs in all intermediate processing layers, providing no neural foundations for perceptual similarity. Furthermore, we show that voxel-encoding models trained on regular images can successfully generalize to the neural responses to AI images but not AN images. These remarkable differences between the human brain and AlexNet in the representation-perception relation suggest that future CNNs should emulate both behavior and the internal neural presentations of the human brain.
△ Less
Submitted 19 July, 2020; v1 submitted 21 December, 2018;
originally announced December 2018.
-
Simultaneous Measurement Imputation and Outcome Prediction for Achilles Tendon Rupture Rehabilitation
Authors:
Charles Hamesse,
Ruibo Tu,
Paul Ackermann,
Hedvig Kjellström,
Cheng Zhang
Abstract:
Achilles Tendon Rupture (ATR) is one of the typical soft tissue injuries. Rehabilitation after such a musculoskeletal injury remains a prolonged process with a very variable outcome. Accurately predicting rehabilitation outcome is crucial for treatment decision support. However, it is challenging to train an automatic method for predicting the ATR rehabilitation outcome from treatment data, due to…
▽ More
Achilles Tendon Rupture (ATR) is one of the typical soft tissue injuries. Rehabilitation after such a musculoskeletal injury remains a prolonged process with a very variable outcome. Accurately predicting rehabilitation outcome is crucial for treatment decision support. However, it is challenging to train an automatic method for predicting the ATR rehabilitation outcome from treatment data, due to a massive amount of missing entries in the data recorded from ATR patients, as well as complex nonlinear relations between measurements and outcomes. In this work, we design an end-to-end probabilistic framework to impute missing data entries and predict rehabilitation outcomes simultaneously. We evaluate our model on a real-life ATR clinical cohort, comparing with various baselines. The proposed method demonstrates its clear superiority over traditional methods which typically perform imputation and prediction in two separate stages.
△ Less
Submitted 13 August, 2019; v1 submitted 8 September, 2018;
originally announced October 2018.
-
DNA Computing for Combinational Logic
Authors:
Chuan Zhang,
Lulu Ge,
Yuchen Zhuang,
Ziyuan Shen,
Zhiwei Zhong,
Zaichen Zhang,
Xiaohu You
Abstract:
With the progressive scale-down of semiconductor's feature size, people are looking forward to More Moore and More than Moore. In order to offer a possible alternative implementation process, people are trying to figure out a feasible transfer from silicon to molecular computing. Such transfer lies on bio-based modules programming with computer-like logic, aiming at realizing the Turing machine. T…
▽ More
With the progressive scale-down of semiconductor's feature size, people are looking forward to More Moore and More than Moore. In order to offer a possible alternative implementation process, people are trying to figure out a feasible transfer from silicon to molecular computing. Such transfer lies on bio-based modules programming with computer-like logic, aiming at realizing the Turing machine. To accomplish this, the DNA-based combinational logic is inevitably the first step we have taken care of. This timely overview paper introduces combinational logic synthesized in DNA computing from both analog and digital perspectives separately. State-of-the-art research progress is summarized for interested readers to quick understand DNA computing, initiate discussion on existing techniques and inspire innovation solutions. We hope this paper can pave the way for the future DNA computing synthesis.
△ Less
Submitted 5 July, 2018;
originally announced July 2018.
-
Ultrafast population coding and axo-somatic compartmentalization
Authors:
Chenfei Zhang,
David Hofmann,
Andreas Neef,
Fred Wolf
Abstract:
Cortical neurons in the fluctuation driven regime can realize ultrafast population encoding. The underlying biophysical mechanisms, however, are not well understood. Reducing the sharpness of the action potential onset can impair ultrafast population encoding, but it is not clear whether a sharp action potential onset is sufficient for ultrafast population encoding. One hypothesis proposes that th…
▽ More
Cortical neurons in the fluctuation driven regime can realize ultrafast population encoding. The underlying biophysical mechanisms, however, are not well understood. Reducing the sharpness of the action potential onset can impair ultrafast population encoding, but it is not clear whether a sharp action potential onset is sufficient for ultrafast population encoding. One hypothesis proposes that the sharp action potential onset is caused by the electrotonic separation of the site of action potential initiation from the soma, and that this spatial separation also results in ultrafast population encoding. Here we examined this hypothesis by studying the linear response properties of model neurons with a defined initiation site. We find that placing the initiation site at different axonal positions has only a weak impact on the linear response function of the model. It fails to generate the ultrafast response and high bandwidth that is observed in cortical neurons. Furthermore, the high frequency regime of the linear response function of this model is insensitive to correlation times of the input current contradicting empirical evidence. When we increase the voltage sensitivity of sodium channels at the initiation site, the two empirically observed phenomena can be recovered. We provide an explanation for the dissociation of sharp action potential onset and ultrafast response. By investigating varying soma sizes, we furthermore highlight the effect of neuron morphology on the linear response. Our results show that a sharp onset of action potentials is not sufficient for the ultrafast response. In the light of recent reports of activity-dependent repositioning of the axon initial segment, our study predicts that a more distal initiation site can lead to an increased sharpness of the somatic waveform but it does not affect the linear response of a population of neurons.
△ Less
Submitted 3 July, 2018; v1 submitted 2 July, 2018;
originally announced July 2018.
-
Non-bifurcating phylogenetic tree inference via the adaptive LASSO
Authors:
Cheng Zhang,
Vu Dinh,
Frederick A. Matsen IV
Abstract:
Phylogenetic tree inference using deep DNA sequencing is reshaping our understanding of rapidly evolving systems, such as the within-host battle between viruses and the immune system. Densely sampled phylogenetic trees can contain special features, including "sampled ancestors" in which we sequence a genotype along with its direct descendants, and "polytomies" in which multiple descendants arise s…
▽ More
Phylogenetic tree inference using deep DNA sequencing is reshaping our understanding of rapidly evolving systems, such as the within-host battle between viruses and the immune system. Densely sampled phylogenetic trees can contain special features, including "sampled ancestors" in which we sequence a genotype along with its direct descendants, and "polytomies" in which multiple descendants arise simultaneously. These features are apparent after identifying zero-length branches in the tree. However, current maximum-likelihood based approaches are not capable of revealing such zero-length branches. In this paper, we find these zero-length branches by introducing adaptive-LASSO-type regularization estimators to phylogenetics, deriving their properties, and showing regularization to be a practically useful approach for phylogenetics.
△ Less
Submitted 1 June, 2020; v1 submitted 28 May, 2018;
originally announced May 2018.