Yahoo Search Búsqueda en la Web

Resultado de búsqueda

  1. DeepLoc - 2.0 Prediction of eukaryotic protein subcellular localization using deep learning. DeepLoc 2.0 predicts the subcellular localization(s) of eukaryotic proteins. DeepLoc 2.0 is a multi-label predictor, which means that is able to predict one or more localizations for any given protein.

    • News

      Department of Health Technology Ørsteds Plads, Building 345C...

    • Education

      The department’s outstanding research profile, the...

    • Services and Products

      Based on unique facilities and specialized knowledge on a...

    • Cookie Overview

      Anker Engelunds Vej 101 2800 Kongens Lyngby CVR-nr. 30 06 09...

    • Frontpage

      At DTU Health Tech, we aspire to improve good health and...

    • DeepLoc 1.0

      DeepLoc-1.0 predicts the subcellular localization of...

    • Overview
    • Data
    • Models
    • Setup
    • Training Workflow
    • Citation

    Multi-label subcellular localization and sorting signal prediction based on protein foundation models (https://github.com/agemagician/ProtTrans, https://github.com/facebookresearch/esm).

    Prediction webserver is available at https://services.healthtech.dtu.dk/services/DeepLoc-2.0/

    The 'data_files' folder contains the data for training

    1.multisub_5_partitions_unique.csv: Annotated SwissProt Sequences, labels, and partitions for subcellular localization

    2.multisub_ninesignals.pkl, sorting_signals.csv: Annotated SwissProt Sequences and sorting signal annotations

    3.Processed FASTA files for generating embeddings

    Two models dubbed Fast (ESM1b) and Accurate (ProtT5) are used. refers to one of these.

    It is recommened to setup a conda environment using

    conda env create -f environment.yml

    Training is divided into two stages:

    python train_sl.py --model

    1.Generate and store embeddings for faster training. Note: h5 files of ~30-40 GB are stored in "data_files/embeddings".

    2.Train subcellular localization and interpretable attention.

    3.Generate predictions and intermediate representations for sorting signal prediction.

    4.Compute metrics on the SwissProt CV dataset.

    If you found this useful please consider citing

  2. 30 de abr. de 2022 · DeepLoc 2.0 uses a transformer-based protein language model to predict multi-label subcellular localization and provides interpretability via the attention and sorting signal prediction. Issue Section: Web Server issue.

  3. 5 de jul. de 2022 · Protein Sorting Signals. The prediction of protein subcellular localization is of great relevance for proteomics research. Here, we propose an update to the popular tool DeepLoc with multi-localization prediction and improvements in both performance and interpretability.

  4. 8 de abr. de 2024 · DeepLoc 2.0 is a popular web server for the prediction of protein subcellular localization and sorting signals. Here, we introduce DeepLoc 2.1, which additionally classifies the input proteins into the membrane protein types Transmembrane, Peripheral, Lipid-anchored and Soluble.

  5. For training and validation, we curate eukaryotic and human multi-location protein datasets with stringent homology partitioning and enriched with sorting signal information compiled from the liter-ature. We achieve state-of-the-art performance in DeepLoc 2.0 by using a pre-trained protein language model.