SIB Home page Computational Cancer Genomics Swiss EMBnet node ExPASy ISREC
CleanEx Tutorial



Target Batch Search : Formats for Input Files


Accepted identifiers
  • Gene symbols
    Only officially approved symbols. For Human, the HUGO nomenclature is used. For Mouse the MGD nomenclature is applied.
  • RefSeq accession numbers
    Only verified RefSeq sequences beginnig with "NM_"
  • Unigene accession numbers
  • Targets identifiers
    Standard format : "target-type-code"_ followed by the clone identifier. Specific targets codes are :

    IMAGE clones IMAGE_
    RESGEN clones RESGEN_
    Other clones OTHER_
    EST sequences EMBL_
    DNA sequences EMBL_
    RNA sequences EMBL_
    Affymetrix probesets AFFY_"Chip name"_

    For Affymetrix probesets, the code contains the regular Affymetrix chip identifier (for example, the code for probeset "1053_at" on the AFFY chip "HG-U133A" is : AFFY_HG-U133A_1053_at ).

File format
Text file, identifier separator can be space(s), carriage return, tab.
Favorite one is a text file with one space as input separator.