|
Manual of Patent Examining Procedure - MPEP
Table of Contents
Previous section
- Administrative Instructions Under the PCT
Appendices
Appendix 1: Numeric Identifiers
Appendix 2: Nucleotide and Amino Acid Symbols and Feature Table
|
|
Table 1: |
List of Nucleotides |
|
|
Table 2: |
List of Modified Nucleotides |
|
|
Table 3: |
List of Amino Acids |
|
|
Table 4: |
List of Modified and Unusual Amino Acids |
|
|
Table 5: |
List of Feature Keys Related to Nucleotide Sequences |
|
|
Table 6: |
List of Feature Keys Related to Protein Sequences |
Appendix 3: Specimen Sequence Listing
[Appendices 1 to 3 to Annex C follow]
Annex C, Appendix 1Numeric Identifiers
Only numeric identifiers as defined below may be used in sequence listings submitted in applications. The text of the data element headings given below shall not be included in the sequence listings.
Numeric identifiers of mandatory data elements, that is, data elements which must be included in all sequence listings (see paragraph 25 of this Standard: items 110, 120, 160, 210, 211, 212, 213 and 400) and numeric identifiers of data elements which must be included in circumstances specified in this Standard (see paragraphs 26, 27, 28, 29 and 30 of this Standard: items 130, 140, 141, 150 and 151, and 220 to 223) are marked by the symbol "M."
Numeric identifiers of optional data elements (see paragraph 31 of this Standard) are marked by the symbol "O."
Table 3: List of Amino Acids
|
Numeric Identifier |
Numeric Identifier Description |
Mandatory (M) or Optional (O) |
Comment |
|
110 |
Applicant name |
M |
where the name of the applicant is written in characters other than those of the Latin alphabet, the same shall also be indicated in characters of the Latin alphabet either as a mere transliteration or through translation into English |
|
120 |
Title of Invention |
M |
|
|
130 |
File Reference |
M, in the circumstances specified in paragraph 26 of this Standard |
see paragraph 26 of this Standard |
|
140 |
Current patent application |
M, in the circumstances specified in paragraph 27 of this Standard |
see paragraph 27 of this Standard; the current patent application shall be identified, in the following order, by the two-letter code indicated in accordance with WIPO Standard ST.3 and the application number (in the format used by the industrial property Office with which the current patent application is filed) or, for an international application, by the international application number |
|
141 |
Current filing date |
M, in the circumstances specified in paragraph 27 of this Standard |
see paragraph 27 of this Standard; the date shall be indicated in accordance with WIPO Standard ST.2 (CCYY MM DD) |
|
150 |
Earlier patent application |
M, in the circumstances specified in paragraph 28 of this Standard |
see paragraph 28 of this Standard; the earlier patent application shall be identified, in the following order, by the two-letter code indicated in accordance with WIPO Standard ST.3 and the application number (in the format used by the industrial property Office with which the earlier patent application was filed) or, for an international application, by the international application number |
|
151 |
Earlier application filing date |
M, in the circumstances specified in paragraph 28 of this Standard |
see paragraph 28 of this Standard; the date shall be indicated in accordance with WIPO Standard ST.2 (CCYY MM DD) |
|
160 |
Number of SEQ ID NOs |
M |
|
|
170 |
Software |
O |
|
|
210 |
Information for SEQ ID NO: x |
M |
response shall be an integer representing the SEQ ID NO shown |
|
211 |
Length |
M |
sequence length expressed in number of base pairs or amino acids |
|
212 |
Type |
M |
type of molecule sequenced in SEQ ID NO: x, either DNA, RNA or PRT; if a nucleotide sequence contains both DNA and RNA fragments, the value shall be "DNA"; in addition, the combined DNA/RNA molecule shall be further described in the 220 to 223 feature section |
|
213 |
Organism |
M |
Genus Species (that is, scientific name) or "Artificial Sequence" or "Unknown" |
|
220 |
Feature |
M, in the circumstances specified in paragraph 29 and 30 of this Standard |
leave blank; see paragraphs 29 and 30 of this Standard; description of points of biological significance in the sequence in SEQ ID NO: x) (may be repeated depending on the number of features indicated) |
|
221 |
Name/key |
M in the circumstances specified in paragraph 29 of this Standard |
see paragraph 29 of this Standard; only those keys as described in Table 5 or 6 of Appendix 2 shall be used |
|
222 |
Location |
M, in the circumstances specified in paragraph 29 of this Standard |
see paragraph 29 of this Standard; - from (number of first base/amino acid in the feature) - to (number of last base/amino acid in the feature) - base pairs (numbers refer to positions of base pairs in a nucleotide sequence) - amino acids (numbers refer to positions of amino acid residues in an amino acid sequence) - whether feature is located on the complementary strand to that filed in the sequence listing |
|
223 |
Other information: |
M, in the circumstances specified in paragraphs 29 and 30 of this Standard |
see paragraphs 29 and 30 of this Standard; any other relevant information, using language neutral vocabulary, or free text (preferably in English); any free text is to be repeated in the main part of the description in the language thereof (see paragraph 36 of this Standard); where any modified base or modified/unusual L-amino acid appearing in Appendix 2, Tables 2 and 4, is in the sequence, the symbol associated with that base or amino acid from Appendix 2, Tables 2 and 4, should be used |
|
300 |
Publication information |
O |
leave blank; repeat section for each relevant publication |
|
301 |
Authors |
O |
|
|
302 |
Title |
O |
title of publication |
|
303 |
Journal |
O |
journal name in which data published |
|
304 |
Volume |
O |
journal volume in which data published |
|
305 |
Issue |
O |
journal issue number in which data published |
|
306 |
Pages |
O |
journal page numbers on which data published |
|
307 |
Date |
O |
journal date on which data published; if possible, the date shall be indicated in accordance with WIPO Standard ST.2 (CCYY MM DD) |
|
308 |
Database accession number |
O |
accession number assigned by database including database name |
|
309 |
Database entry date |
O |
date of entry in database; the date shall be indicated in accordance with WIPO Standard ST.2 (CCYY MM DD) |
|
310 |
Document number |
O |
document number, for patent type citations only; the full document shall specify, in the following order, the two-letter code indicated in accordance with WIPO Standard ST.3, the publication number indicated in accordance with WIPO Standard ST.6, and the kind-of-document code indicated in accordance with WIPO Standard ST.16 |
|
311 |
Filing date |
O |
document filing date, for patent-type citations only; the date shall be indicated in accordance with WIPO Standard ST.2 (CCYY MM DD) |
|
312 |
Publication date |
O |
document publication date; for patent-type citations only; the date shall be indicated in accordance with WIPO Standard ST.2 (CCYY MM DD) |
|
313 |
Relevant residues in SEQ ID NO: x: from to |
O |
|
|
400 |
Sequence |
M |
SEQ ID NO: x should follow the numeric identifier and should appear on the line preceding the sequence (see Appendix 3) |
Annex C, Appendix 2 Nucleotide and Amino Acid Symbols and Feature Table Table 1: List of Nucleotides
|
Symbol |
Meaning |
Origin of designation |
|
a |
a |
adenine |
|
g |
g |
guanine |
|
c |
c |
cytosine |
|
t |
t |
thymine |
|
u |
u |
uracil |
|
r |
g or a |
purine |
|
y |
t/u or c |
pyrimidine |
|
m |
a or c |
amino |
|
k |
g or t/u |
keto |
|
s |
g or c |
strong interactions 3H-bonds |
|
w |
a or t/u |
weak interactions 2H-bonds |
|
b |
g or c or t/u |
not a |
|
d |
a or g or t/u |
not c |
|
h |
a or c or t/u |
not g |
|
v |
a or g or c |
not t, not u |
|
n |
a or g or c or t/u, unknown, or other |
any |
Table 2: List of Modified Nucleotides
|
Symbol |
Meaning |
|
ac4c |
4-acetylcytidine |
|
chm5u |
5-(carboxyhydroxymethyl)uridine |
|
cm |
2'-O-methylcytidine |
|
cmnm5s2u |
5-carboxymethylaminomethyl-2-thiouridine |
|
cmnm5u |
5-carboxymethylaminomethyluridine |
|
d |
dihydrouridine |
|
fm |
2'-O-methylpseudouridine |
|
gal q |
beta, D-galactosylqueuosine |
|
gm |
2'-O-methylguanosine |
|
i |
inosine |
|
i6a |
N6-isopentenyladenosine |
|
m1a |
1-methyladenosine |
|
m1f |
1-methylpseudouridine |
|
m1g |
1-methylguanosine |
|
m1i |
1-methylinosine |
|
m22g |
2,2-dimethylguanosine |
|
m2a |
2-methyladenosine |
|
m2g |
2-methylguanosine |
|
m3c |
3-methylcytidine |
|
m5c |
5-methylcytidine |
|
m6a |
N6-methyladenosine |
|
m7g |
7-methylguanosine |
|
mam5u |
5-methylaminomethyluridine |
|
mam5s2u |
5-methoxyaminomethyl-2-thiouridine |
|
man q |
beta, D-mannosylqueuosine |
|
mcm5s2u |
5-methoxycarbonylmethyl-2-thiouridine |
|
mcm5u |
5-methoxycarbonylmethyluridine |
|
mo5u |
5-methoxyuridine |
|
ms2i6a |
2-methylthio-N6-isopentenyladenosine |
|
ms2t6a |
N-((9-beta-D-ribofuranosyl-2-methylthiopurine-6-yl)carbamoyl)threonine |
|
mt6a |
N-((9-beta-D-ribofuranosylpurine-6-yl)N-methylcarbamoyl)threonine |
|
mv |
uridine-5-oxyacetic acid-methylester |
|
o5u |
uridine-5-oxyacetic acid |
|
osyw |
wybutoxosine |
|
p |
pseudouridine |
|
q |
queuosine |
|
s2c |
2-thiocytidine |
|
s2t |
5-methyl-2-thiouridine |
|
s2u |
2-thiouridine |
|
s4u |
4-thiouridine |
|
t |
5-methyluridine |
|
t6a |
N-((9-beta-D-ribofuranosylpurine-6-yl)-carbamoyl)threonine |
|
tm |
2'-O-methyl-5-methyluridine |
|
um |
2'-O-methyluridine |
|
yw |
wybutosine |
|
x |
3-(3-amino-3-carboxy-propyl)uridine, (acp3)u |
Table 3: List of Amino Acids
|
Symbol |
Meaning |
|
Ala |
Alanine |
|
Cys |
Cysteine |
|
Asp |
Aspartic Acid |
|
Glu |
Glutamic Acid |
|
Phe |
Phenylalanine |
|
Gly |
Glycine |
|
His |
Histidine |
|
Ile |
Isoleucine |
|
Lys |
Lysine |
|
Leu |
Leucine |
|
Met |
Methionine |
|
Asn |
Asparagine |
|
Pro |
Proline |
|
Gln |
Glutamine |
|
Arg |
Arginine |
|
Ser |
Serine |
|
Thr |
Threonine |
|
Val |
Valine |
|
Trp |
Tryptophan |
|
Tyr |
Tyrosine |
|
Asx |
Asp or Asn |
|
Glx |
Glu or Gln |
|
Xaa |
unknown or other |
Table 4: List of Modified and Unusual Amino Acids
|
Symbol |
Meaning |
|
Aad |
2-Aminoadipic acid |
|
bAad |
3-Aminoadipic acid |
|
bAla |
beta-Alanine, beta-Aminopropionic acid |
|
Abu |
2-Aminobutyric acid |
|
4Abu |
4-Aminobutyric acid, piperidinic acid |
|
Acp |
6-Aminocaproic acid |
|
Ahe |
2-Aminoheptanoic acid |
|
Aib |
2-Aminoisobutyric acid |
|
bAib |
3-Aminoisobutyric acid |
|
Apm |
2-Aminopimelic acid |
|
Dbu |
2,4 Diaminobutyric acid |
|
Des |
Desmosine |
|
Dpm |
2,2'-Diaminopimelic acid |
|
Dpr |
2,3-Diaminopropionic acid |
|
EtGly |
N-Ethylglycine |
|
EtAsn |
N-Ethylasparagine |
|
Hyl |
Hydroxylysine |
|
aHyl |
allo-Hydroxylysine |
|
3Hyp |
3-Hydroxyproline |
|
4Hyp |
4-Hydroxyproline |
|
Ide |
Isodesmosine |
|
aIle |
allo-Isoleucine |
|
MeGly |
N-Methylglycine, sarcosine |
|
MeIle |
N-Methylisoleucine |
|
MeLys |
6-N-Methyllysine |
|
MeVal |
N-Methylvaline |
|
Nva |
Norvaline |
|
Nle |
Norleucine |
|
Orn |
Ornithine |
Table 5: List of Feature Keys Related to Nucleotide Sequences
|
Key |
Description |
|
allele |
a related individual or strain contains stable, alternative forms of the same gene which differs from the presented sequence at this location (and perhaps others) |
|
attenuator |
(1) region of DNA at which regulation of termination of transcription occurs, which controls the expression of some bacterial operons; (2) sequence segment located between the promoter and the first structural gene that causes partial termination of transcription |
|
C_region |
constant region of immunoglobulin light and heavy chains, and T-cell receptor alpha, beta, and gamma chains; includes one or more exons depending on the particular chain |
|
CAAT_signal |
CAAT box; part of a conserved sequence located about 75 bp up-stream of the start point of eukaryotic transcription units which may be involved in RNA polymerase binding; consensus=GG (C or T) CAATCT |
|
CDS |
coding sequence; sequence of nucleotides that corresponds with the sequence of amino acids in a protein (location includes stop codon); feature includes amino acid conceptual translation |
|
conflict |
independent determinations of the "same" sequence differ at this site or region |
|
D-loop |
displacement loop; a region within mitochondrial DNA in which a short stretch of RNA is paired with one strand of DNA, displacing the original partner DNA strand in this region; also used to describe the displacement of a region of one strand of duplex DNA by a single stranded invader in the reaction catalyzed by RecA protein |
|
D-segment |
diversity segment of immunoglobulin heavy chain, and T-cell receptor beta chain |
|
enhancer |
a cis-acting sequence that increases the utilization of (some) eukaryotic promoters, and can function in either orientation and in any location (upstream or downstream) relative to the promoter |
|
exon |
region of genome that codes for portion of spliced mRNA; may contain 5'UTR all CDSs, and 3'UTR |
|
GC_signal |
GC box; a conserved GC-rich region located upstream of the start point of eukaryotic transcription units which may occur in multiple copies or in either orientation; consensus=GGGCGG |
|
gene |
region of biological interest identified as a gene and for which a name has been assigned |
|
iDNA |
intervening DNA; DNA which is eliminated through any of several kinds of recombination |
|
intron |
a segment of DNA that is transcribed, but removed from within the transcript by splicing together the sequences (exons) on either side of it |
|
J_segment |
joining segment of immunoglobulin light and heavy chains, and T-cell receptor alpha, beta, and gamma chains |
|
LTR |
long terminal repeat, a sequence directly repeated at both ends of a defined sequence, of the sort typically found in retroviruses |
|
mat_peptide |
mature peptide or protein coding sequence; coding sequence for the mature or final peptide or protein product following post-translational modification; the location does not include the stop codon (unlike the corresponding CDS) |
|
misc_binding |
site in nucleic acid which covalently or non-covalently binds another moiety that cannot be described by any other Binding key (primer_bind or protein_bind) |
|
misc_difference |
feature sequence is different from that presented in the entry and cannot be described by any other Difference key (conflict, unsure, old_sequence, mutation, variation, allele, or modified_base) |
|
misc_feature |
region of biological interest which cannot be described by any other feature key; a new or rare feature |
|
misc_recomb |
site of any generalized, site-specific or replicative recombination event where there is a breakage and reunion of duplex DNA that cannot be described by other recombination keys (iDNA and virion) or qualifiers of source key (/insertion_seq, /transposon, /proviral) |
|
misc_RNA |
any transcript or RNA product that cannot be defined by other RNA keys (prim_transcript, precursor_RNA, mRNA, 5'clip, 3'clip, 5'UTR, 3'UTR, exon, CDS, sig_peptide, transit_peptide, mat_peptide, intron, polyA_site, rRNA, tRNA, scRNA, and snRNA) |
|
misc_signal |
any region containing a signal controlling or altering gene function or expression that cannot be described by other Signal keys (promoter, CAAT_signal, TATA_signal, -35_signal, -10_signal, GC_signal, RBS, polyA_signal, enhancer, attenuator, terminator, and rep_origin) |
|
misc_structure |
any secondary or tertiary structure or conformation that cannot be described by other Structure keys (stem_loop and D-loop) |
|
modified_base |
the indicated nucleotide is a modified nucleotide and should be substituted for by the indicated molecule (given in the mod_base qualifier value) |
|
mRNA |
messenger RNA; includes 5' untranslated region (5'UTR), coding sequences (CDS, exon) and 3' untranslated region (3'UTR) |
|
mutation |
a related strain has an abrupt, inheritable change in the sequence at this location |
|
N_region |
extra nucleotides inserted between rearranged immunoglobulin segments |
|
old_sequence |
the presented sequence revises a previous version of the sequence at this location |
|
polyA_signal |
recognition region necessary for endonuclease cleavage of an RNA transcript that is followed by polyadenylation; consensus=AATAAA |
|
polyA_site |
site on an RNA transcript to which will be added adenine residues by post-transcriptional polyadenylation |
|
precursor_RNA |
any RNA species that is not yet the mature RNA product; may include 5' clipped region (5'clip), 5' untranslated region (5'UTR), coding sequences (CDS, exon), intervening sequences (intron), 3' untranslated region (3'UTR), and 3' clipped region (3'clip) |
|
prim_transcript |
primary (initial, unprocessed) transcript; includes 5' clipped region (5'clip), 5' untranslated region (5'UTR), coding sequences (CDS, exon), intervening sequences (intron), 3' untranslated region (3'UTR), and 3' clipped region (3' clip) |
|
primer_bind |
non-covalent primer binding site for initiation of replication, transcription, or reverse transcription; includes site(s) for synthetic, for example, PCR primer elements |
|
promoter |
region on a DNA molecule involved in RNA polymerase binding to initiate transcription |
|
protein_bind |
non-covalent protein binding site on nucleic acid |
|
RBS |
ribosome binding site |
|
repeat_region |
region of genome containing repeating units |
|
repeat_unit |
single repeat element |
|
rep_origin |
origin of replication; starting site for duplication of nucleic acid to give two identical copies |
|
rRNA |
mature ribosomal RNA; the RNA component of the ribonucleoprotein particle (ribosome) which assembles amino acids into proteins |
|
S_region |
switch region of immunoglobulin heavy chains; involved in the rearrangement of heavy chain DNA leading to the expression of a different immunoglobulin class from the same B-cell |
|
satellite |
many tandem repeats (identical or related) of a short basic repeating unit; many have a base composition or other property different from the genome average that allows them to be separated from the bulk (main band) genomic DNA |
|
scRNA |
small cytoplasmic RNA; any one of several small cytoplasmic RNA molecules present in the cytoplasm and (sometimes) nucleus of a eukaryote |
|
sig_peptide |
signal peptide coding sequence; coding sequence for an N-terminal domain of a secreted protein; this domain is involved in attaching nascent polypeptide to the membrane; leader sequence |
|
snRNA |
small nuclear RNA; any one of many small RNA species confined to the nucleus; several of the snRNAs are involved in splicing or other RNA processing reactions |
|
source |
identifies the biological source of the specified span of the sequence; this key is mandatory; every entry will have, as a minimum, a single source key spanning the entire sequence; more than one source key per sequence is permissable |
|
stem_loop |
hairpin; a double-helical region formed by base-pairing between adjacent (inverted) complementary sequences in a single strand of RNA or DNA |
|
STS |
Sequence Tagged Site; short, single-copy DNA sequence that characterizes a mapping landmark on the genome and can be detected by PCR; a region of the genome can be mapped by determining the order of a series of STSs |
|
TATA_signal |
TATA box; Goldberg-Hogness box; a conserved AT-rich septamer found about 25 bp before the start point of each eukaryotic RNA polymerase II transcript unit which may be involved in positioning the enzyme for correct initiation; consensus=TATA(A or T)A(A or T) |
|
terminator |
sequence of DNA located either at the end of the transcript or adjacent to a promoter region that causes RNA polymerase to terminate transcription; may also be site of binding of repressor protein |
|
transit_peptide |
transit peptide coding sequence; coding sequence for an N-terminal domain of a nuclear-encoded organellar protein; this domain is involved in post-translational import of the protein into the organelle |
|
tRNA |
mature transfer RNA, a small RNA molecule (75-85 bases long) that mediates the translation of a nucleic acid sequence into an amino acid sequence |
|
unsure |
author is unsure of exact sequence in this region |
|
V_region |
variable region of immunoglobulin light and heavy chains, and T-cell receptor alpha, beta, and gamma chains; codes for the variable amino terminal portion; can be made up from V_segments, D_segments, N_regions, and J_segments |
|
V_segment |
variable segment of immunoglobulin light and heavy chains, and T-cell receptor alpha, beta, and gamma chains; codes for most of the variable region (V_region) and the last few amino acids of the leader peptide |
|
variation |
a related strain contains stable mutations from the same gene (for example, RFLPs, polymorphisms, etc.) which differ from the presented sequence at this location (and possibly others) |
|
3'clip |
3'-most region of a precursor transcript that is clipped off during processing |
|
3'UTR |
region at the 3′ end of a mature transcript (following the stop codon) that is not translated into a protein |
|
5'clip |
5'-most region of a precursor transcript that is clipped off during processing |
|
5'UTR |
region at the 5' end of a mature transcript (preceding the initiation codon) that is not translated into a protein |
|
-10_signal |
pribnow box; a conserved region about 10 bp upstream of the start point of bacterial transcription units which may be involved in binding RNA polymerase; consensus=TAtAaT |
|
-35_signal |
a conserved hexamer about 35 bp upstream of the start point of bacterial transcription units; consensus=TTGACa [ ] or TGTTGACA [ ] |
Table 6: List of Feature Keys Related to Protein Sequences
|
Key |
Description |
|
CONFLICT |
different papers report differing sequences |
|
VARIANT |
authors report that sequence variants exist |
|
VARSPLIC |
description of sequence variants produced by alternative splicing |
|
MUTAGEN |
site which has been experimentally altered |
|
MOD_RES |
post-translational modification of a residue |
|
|
ACETYLATION |
N-terminal or other |
|
|
AMIDATION |
generally at the C-terminal of a mature active peptide |
|
|
BLOCKED |
undetermined N- or C-terminal blocking group |
|
|
FORMYLATION |
of the N-terminal methionine |
|
|
GAMMA-CARBOXYGLUTAMIC ACID HYDROXYLATION |
of asparagine, aspartic acid, proline or lysine |
|
|
METHYLATION |
generally of lysine or arginine |
|
|
PHOSPHORYLATION |
of serine, threonine, tyrosine, aspartic acid or histidine |
|
|
PYRROLIDONE CARBOXYLIC ACID |
N-terminal glutamate which has formed an internal cyclic lactam |
|
|
SULFATATION |
generally of tyrosine |
|
LIPID |
covalent binding of a lipidic moiety |
|
|
MYRISTATE |
myristate group attached through an amide bond to the N-terminal glycine residue of the mature form of a protein or to an internal lysine residue |
|
|
PALMITATE |
palmitate group attached through a thioether bond to a cysteine residue or through an ester bond to a serine or threonine residue |
|
|
FARNESYL |
farnesyl group attached through a thioether bond to a cysteine residue |
|
|
GERANYL-GERANYL |
geranyl-geranyl group attached through a thioether bond to a cysteine residue |
|
|
GPI-ANCHOR |
glycosyl-phosphatidylinositol (GPI) group linked to the alpha-carboxyl group of the C-terminal residue of the mature form of a protein |
|
|
N-ACYL DIGLYCERIDE |
N-terminal cysteine of the mature form of a prokaryotic lipoprotein with an amide-linked fatty acid and a glyceryl group to which two fatty acids are linked by ester linkages |
|
DISULFID |
disulfide bond; the `FROM' and `TO' endpoints represent the two residues which are linked by an intra-chain disulfide bond; if the `FROM' and `TO' endpoints are identical, the disulfide bond is an interchain one and the description field indicates the nature of the cross-link |
|
THIOLEST |
thiolester bond; the `FROM' and `TO' endpoints represent the two residues which are linked by the thiolester bond |
|
THIOETH |
thioether bond; the `FROM' and `TO'endpoints represent the two residues which are linked by the thioether bond |
|
CARBOHYD |
glycosylation site; the nature of the carbohydrate (if known) is given in the description field |
|
METAL |
binding site for a metal ion; the description field indicates the nature of the metal |
|
BINDING |
binding site for any chemical group (co-enzyme, prosthetic group, etc.); the chemical nature of the group is given in the description field |
|
SIGNAL |
extent of a signal sequence (prepeptide) |
|
TRANSIT |
extent of a transit peptide (mitochondrial, chloroplastic, or for a microbody) |
|
PROPEP |
extent of a propeptide |
|
CHAIN |
extent of a polypeptide chain in the mature protein |
|
PEPTIDE |
extent of a released active peptide |
|
DOMAIN |
extent of a domain of interest on the sequence; the nature of that domain is given in the description field |
|
CA_BIND |
extent of a calcium-binding region |
|
DNA_BIND |
extent of a DNA-binding region |
|
NP_BIND |
extent of a nucleotide phosphate binding region; the nature of the nucleotide phosphate is indicated in the description field |
|
TRANSMEM |
extent of a transmembrane region |
|
ZN_FING |
extent of a zinc finger region |
|
SIMILAR |
extent of a similarity with another protein sequence; precise information, relative to that sequence is given in the description field |
|
REPEAT |
extent of an internal sequence repetition |
|
HELIX |
secondary structure: Helices, for example, Alpha-helix, 3(10) helix, or Pi-helix |
|
STRAND |
secondary structure: Beta-strand, for example, Hydrogen bonded beta-strand, or Residue in an isolated beta-bridge |
|
TURN |
secondary structure Turns, for example, H-bonded turn (3-turn, 4-turn or 5-turn) |
|
ACT_SITE |
amino acid(s) involved in the activity of an enzyme |
|
SITE |
any other interesting site on the sequence |
|
INIT_MET |
the sequence is known to start with an initiator methionine |
|
NON_TER |
the residue at an extremity of the sequence is not the terminal residue; if applied to position 1, this signifies that the first position is not the N-terminus of the complete molecule; if applied to the last position, it signifies that this position is not the C-terminus of the complete molecule; there is no description field for this key |
|
NON_CONS |
non consecutive residues; indicates that two residues in a sequence are not consecutive and that there are a number of unsequenced residues between them |
|
UNSURE |
uncertainties in the sequence; used to describe region(s) of a sequence for which the authors are unsure about the sequence assignment |
[Annex C, Appendix 3, follows]
Specimen Sequence Listing


[Annex C-bis follows]
ANNEX C-bisTECHNICAL REQUIREMENTS FOR THE PRESENTATION OF TABLES RELATED TO NUCLEOTIDE AND AMINO ACID SEQUENCE LISTINGS IN INTERNATIONAL PATENT APPLICATIONS UNDER THE PCT
Introduction
1. These technical requirements have been elaborated so as to provide standardization of the presentation of tables related to nucleotide and amino acid sequence listings in international patent applications. These technical requirements are intended to allow the applicant to draw up such tables in a manner which is acceptable to all receiving Offices, International Searching Authorities, International Preliminary Examining Authorities and to the International Bureau for the purposes of the international phase and to all designated and elected Offices for the purposes of the national phase.
Definition
2. For the purposes of these technical requirements, "competent Authority" is the International Searching Authority that is to carry out the international search on the international application, or the International Preliminary Examining Authority that is to carry out the international preliminary examination on the international application, or the designated/elected Office before which the processing of the international application has started.
Tables related to sequence listings
3. Tables filed in electronic form under Section 801(a) shall comply with one of the following character formats: (i) UTF-8-encoded Unicode 3.0; or
(ii) XML format conforming to the "Application-Body" Document Type Definition referred to in Appendix I of Annex F;
at the option of the competent Authority.
4. The spatial relationships (e.g., columns and rows) of the table elements shall be maintained.
5. At the option of the competent Authority, file compression is acceptable, so long as the compressed file is in a self-extracting format that will decompress on a Personal Computer Operating system that is acceptable to the competent Authority and to the International Bureau.
6. Each table shall be contained within a separate electronic file on any electronic medium that is acceptable to the competent Authority. The file recorded on the electronic medium that is acceptable to the competent Authority shall be encoded using IBM Code Page 437, IBM Code Page 932 or a compatible code page. A compatible code page, as would be required for, for example, Japanese, Chinese, Cyrillic, Arabic, Greek or Hebrew characters, is one that assigns the Roman alphabet and numerals to the same hexadecimal positions as do the specified code pages.
7. Tables filed in electronic form may be created by any means, as long as the table on an electronic medium that is acceptable to the competent Authority is readable under a Personal Computer Operating system that is acceptable to the competent Authority and to the International Bureau.
8. If the electronic medium that is acceptable to the competent Authority is submitted after the date of filing of an application, the labels shall also include the filing date of the application and the application number.
[Annex D follows]
ANNEX DINFORMATION FROM PAMPHLET FRONT PAGE TO BE INCLUDED IN THE GAZETTE UNDER RULE 86.1(a)(i)
The following information shall be extracted from the front page of the pamphlet of each published international application and shall, in accordance with Rule 86.1(a)(i), appear in the corresponding entry of the Gazette:
1. as to the international publication: 1.1 the international publication number
1.2 the date of the international publication
1.3 an indication whether the following items were published in the pamphlet: 1.31 international search report
1.32 declaration under Article 17(2)
1.33 claims amended under Article 19(1)
1.34 statement under Article 19(1)
1.35 [Deleted]
1.36 request for rectification under the third sentence of Rule 91.1(f)
1.37 information concerning a priority claim which was considered not to have been made, published upon request made under Rule 26bis.2(c)
1.4 the language in which the international application was filed
1.5 the language of publication of the international application
2. as to the international application: 2.1 the title of the invention
2.2 the symbol(s) of the International Patent Classification (IPC)
2.3 the international application number
2.4 the international filing date
3. as to any priority claim: 3.1 the application number of the earlier application
3.2 the date on which the earlier application was filed
3.3 where the earlier application is: 3.31 a national application: the country in which the earlier application was filed
3.32 a regional application: the authority entrusted with the granting of regional patents under the applicable regional patent treaty and, in the case referred to in Rule 4.10(b)(ii), a country party to the Paris Convention for the Protection of Industrial Property for which that earlier application was filed
3.33 an international application: the receiving Office with which it was filed
4. as to the applicant, inventor and agent: 4.1 their name(s)
4.2 their mailing address(es)
5. as to the designated States: 5.1 their names
5.2 the indication of any wish for a regional patent
5.3 the indication that every kind of protection available is sought, unless otherwise indicated
6. as to a statement concerning non-prejudicial disclosure or exception to lack of novelty: 6.1 the date of the disclosure
6.2 the place of the disclosure
6.3 the kind of the disclosure (e.g., exhibition, scientific publication, conference reports, etc.)
6.4 the title of the exhibition, publication or conference
7. as to any indication in relation to deposited biological material furnished under Rule 13bis separately from the description: 7.1 the fact that such indication is published
7.2 the date on which the International Bureau received such indication
8. as to any declaration referred to in Rule 4.17 which was received by the International Bureau before the expiration of the time limit under Rule 26ter.1: 8.1 the fact that such a declaration was made and a reference to the applicable item in Rule 4.17 under which it was made
8.2 an indication of those designations for the purposes of which such declaration was made.
[Annex E follows]
ANNEX EINFORMATION TO BE PUBLISHED IN THE GAZETTE UNDER RULE 86.1(a)(v)
1. The time limits applicable under Articles 22 and 39 in respect of each Contracting State.
2. The list of the non-patent literature agreed upon by the International Searching Authorities for inclusion in the minimum documentation.
3. The names of the national Offices which do not wish to receive copies under Article 13(2)(c).
4. The provisions of the national laws of Contracting States concerning international-type search.
5. The text of the agreements entered into between the International Bureau and the International Searching Authorities or the International Preliminary Examining Authorities.
6. The names of the national Offices which entirely or in part waived their rights to any communication under Article 20.
7. The names of the Contracting States which are bound by Chapter II of the PCT.
8. Index of concordance of international application numbers and international publication numbers, listed according to international application numbers.
9. Index of applicants' names giving, for each name, the corresponding international publication number(s).
10. Index of international publication numbers, grouped according to the International Patent Classification symbols.
11. Indication of any subject matter that will not be searched or examined by the various International Searching and Preliminary Examining Authorities under Rules 39 and 67.
12. Requirements of designated and elected Offices under Rules 49.5 and 76.5 in relation to the furnishing of translations.
13. The dates defining the period referred to in Rule 32.1(b) during which the international application, whose effects may be extended to a successor State under Rule 32.1, must have been filed.
[Annex F follows]
ANNEX FSTANDARD FOR THE ELECTRONIC FILING AND PROCESSING OF INTERNATIONAL APPLICATIONS
[The text of Annex F, which is not reproduced here, is available from WIPO's website at http://www.wipo.org/pct/en/index">1. Introduction
2. The E-PCT standard: Overview and vision
3. E-PCT submission structure and format
4. IA documents packaging
5. Transmission
6. Electronic filing software
7. [Deleted]
8. Principles of electronic records management
9. Abbreviated expressions, interpretation and glossary
Appendix I XML DTDs for the E-PCT Standard
Appendix II PKI Architecture for the E-PCT Standard
Appendix III Basic Common Standard for Electronic Filing
Appendix IV Use of Physical Media for the E-PCT Standard] [End of Appendix, Annex and document]
>
|