«  Secondary Structure Prediction   ::   Contents   ::   Smith-Waterman Search  »


SITECON

SITECON — is a program package for recognition of potential transcription factor binding sites basing on the data about conservative conformational and physicochemical properties revealed on the basis of the binding sites sets analysis.

To cite SITECON use the following article:

“Oshchepkov D.Y., Vityaev E.E., Grigorovich D.A., Ignatieva E.V., Khlebodarova T.M.SITECON: a tool for detecting conservative conformational and physicochemical properties in transcription factor binding site alignments and for siterecognition. //Nucleic Acids Res. 2004 Jul 1;32(Web Server issue):W208-12.”

UGENE version of SITECON provides a tool for recognition of potential binding sites for over 90 types of transcription factors. Also UGENE version of SITECON provides a tool for recognition of potential binding sites basing site alignment proposed by user. For the detailed method description see the original SITECON site.

Data about used context-dependent conformational and physicochemical properties are available in the PROPERTY Database.

SITECON Searching Transcription Factors Binding Sites

To search transcription factor binding sites in a DNA sequence select the Analyze ‣ Search TFBS with SITECON... context menu item.

In the appeared search dialog you must select a file with TFBS profile. The profiles supplied with UGENE are placed in the $UGENE/data/sitecon_models folder.

After the profile is loaded the threshold-filter is populated with values read from profile. You can use the filter to remove low-scoring regions from the result.

../_images/sitecon_dialog.png

The regions found by SITECON algorithm can be saved as annotations to the DNA sequence in the Genbank format.

Every SITECON profile supplied with UGENE contains complete information about calibration settings provided to UGENE team by the author of SITECON.

The original TFBS alignments used to calculate profiles can be requested directly from the author of SITECON.

Types of SITECON Models

Eukaryotic

Name Description
CEBP_a CCAAT-enhancer-binding protein_alpha
CEBP_all CCAAT-enhancer-binding proteins
CLOCK Circadian Locomotor Output Cycles Kaput
cMyc_can Myc (c-Myc) is a regulator gene that codes for a transcription factor. A mutated version of Myc is found in many cancers.
CRE Cyclic AMP response element
E2F1 Transcription factor E2F1 is a protein that in humans is encoded by the E2F1 gene.
E2F1/DP1sel1 E2F factors bind to DNA as homodimers or heterodimers in association with dimerization partner DP1.
EGR1 Early growth response protein 1
EKLf Erythroid Kruppel-like Factor
ER2 Estrogen receptor beta
GATA_all GATA transcription factors are a family of transcription factors characterized by their ability to bind to the DNA sequence “GATA”
GATA-1 GATA-binding factor 1
GATA-2 GATA-binding protein 2
GATA-3 Trans-acting T-cell-specific transcription factor GATA-3
HMG-1 High-mobility group protein 1
HNF-1 Hepatocyte nuclear factor 1
HNF-3 Hepatocyte nuclear factor 3
HNF-4 Hepatocyte nuclear factor 4
IRF Interferon regulatory factors
isre Interferon stimulation response element
MyoD MyoD belongs to a family of proteins known as myogenic regulatory factors (MRFs)
MyOGsel3 Myogenin
NF-1 Neurofibromin 1
NF-E2 Transcription factor NF-E2 45 kDa subunit is a protein that in humans is encoded by the NFE2 gene.
NFATp Pre-existing component of the NFAT(Nuclear factor of activated T-cells) transcription complex.
NFkB_all Nuclear factor kappa-light-chain-enhancer of activated B cells
NFkB_hetero The p50 (NFKB1)/p65 (RELA) heterodimer is the most abundant form of NF-kB
NFkB_ homo The c-Rel protein is a member of the NF-kB family of transcription factors and contains a Rel homology domain
Nfy Nuclear transcription factor Y
Nrf2 Nuclear factor (erythroid-derived 2)-like 2
Oct-1 Octamer transcription factor 1
Oct_all Octamer transcription factors
p53 Protein 53
PPRF Paramedian pontine reticular formation
Pu1 Is a protein that in humans is encoded by the SPI1 gene
setCREB cAMP response element-binding
setCREBzag cAMP response element-binding
SRE_san Serum response element
SRF Serum response factor
STAT1 Signal Transducer and Activator of Transcription 1
STAT Signal Transducer and Activator of Transcription
TTF1 Thyroid transcription factor 1
USF Upstream stimulatory factors
yy1 Is a protein that in humans is encoded by the YY1 gene

Prokaryotic

Name Description
AgaR N-acetylgalactosamine repressor, AgaR, negatively controls the expression of the aga gene cluster
AgaC AgaC is the Enzyme IIC domain of a predicted N-acetylgalactosamine-transporting PEP-dependent phosphotransferase system
ArcA ArcA transcriptional dual regulator
ArgR ArgR complexed with L-arginine represses the transcription of several genes involved in biosynthesis and transport of arginine, transport of histidine, and its own synthesis and activates genes for arginine catabolism.
CpxR DNA-binding response regulator in two-component regulatory system with CpxA
Crp cAMP receptor protein
CysB Cysteine B
CytR Cytidine Regulator
DeoR Deoxyribose Regulator
DnaA DnaA is the linchpin element in the initiation of DNA replication in E. coli.
FadR Fatty acid degradation Regulon
fis Factor for inversion stimulation
FlhDC Operon that encodes two transcriptional regulators
Fnr FNR is the primary transcriptional regulator that mediates the transition from aerobic to anaerobic growth through the regulation of hundreds of genes.
Frur Fructose repressor
FUR Ferric Uptake Regulation
GALR Galactose repressor
GALS Galactose isorepressor
GLPR sn-Glycerol-3-phosphate repressor
GNTP Is a member of the GntP family transporters
HNS Histone-like nucleoid structuring protein
ICLR Isocitrate lyase Regulator
IHF Integration host factor
ISCR1 Iron-sulfur cluster Regulator 1
ISCR3 Iron-sulfur cluster Regulator 3
LEXA LexA represses the transcription of several genes involved in the cellular response to DNA damage or inhibition of DNA replication
Lrp Leucine-responsive regulatory protein
MALT Maltose regulator
MARA Multiple antibiotic resistance
MELR Melibiose regulator
MEtJ MetJ represses the expression of genes involved in biosynthesis and transport of methionine
MetR1 MetR participates in controlling several genes involved in methionine biosynthesis [ Weissbach91 ] and a gene involved in protection against nitric oxide
MLC DgsA, better known as Mlc, “makes large colonies,” is a transcriptional dual regulator that controls the expression of a number of genes encoding enzymes of the Escherichia coli phosphotransferase (PTS) and phosphoenolpyruvate (PEP) systems
MODE Molybdate-responsive transcription factor
NAC Nitrogen assimilation control
NAGC_new2 N-acetylglucosamine
NANR N-acetyl-neuraminic acid regulator
NARL2 Nitrate/nitrite response regulator NarL
NARL Nitrate/nitrite response regulator NarL
NARP Nitrate/nitrite response regulator NarP
NIRC NirC is a nitrite transporter which is a member of the FNT family of formate and nitrite transporters
OmpC OmpC is a member of the GMP family
OxyR Oxidative stress regulator
PHOB PhoB is a dual transcription regulator that activates expression of the Pho regulon in response to environmental Pi
PHOP Member of the two-component regulatory system phoQ/phoP involved in adaptation to low Mg2+ environments and the control of acid resistance genes
PurR PurR dimer controls several genes involved in purine nucleotide biosynthesis and its own synthesis
RcsB_1 Regulator capsule synthesis B
RcsB_2 Regulator capsule synthesis B
Rob2 Right origin-binding protein
ROB Right origin-binding protein
soxS SoxS is a dual transcriptional activator and participates in the removal of superoxide and nitric oxide and protection from organic solvents and antibiotics
TORR TorR response regulator
TRPR Tryptophan (trp) transcriptional repressor
TyrR Tyrosine repressor