-
Notifications
You must be signed in to change notification settings - Fork 0
Expand file tree
/
Copy pathaligned_data_sample.json
More file actions
10 lines (10 loc) · 26.1 KB
/
Copy pathaligned_data_sample.json
File metadata and controls
10 lines (10 loc) · 26.1 KB
1
2
3
4
5
6
7
8
9
10
{"seq": "MSLEQKKGADIISKILQIQNSIGKTTSPSTLKTKLSEISRKEQENARIQSKLSDLQKKKIDIDNKLLKEKQNLIKEEILERKKLEVLTKKQQKDEIEHQKKLKREIDAIKASTQYITDVSISSYNNTIPETEPEYDLFISHASEDKEDFVRPLAETLQQLGVNVWYDEFTLKVGDSLRQKIDSGLRNSKYGTVVLSTDFIKKDWTNYELDGLVAREMNGHKMILPIWHKITKNDVLDYSPNLADKVALNTSVNSIEEIAHQLADVILNR", "text": "PROTEIN NAME: 2' cyclic ADP-D-ribose synthase AbTIR (2'cADPR synthase AbTIR) (EC 3.2.2.-) (NAD(+) hydrolase AbTIR) (EC 3.2.2.6) (TIR domain-containing protein in A.baumannii) (AbTIR). FUNCTION: NAD(+) hydrolase (NADase) that catalyzes cleavage of NAD(+) into ADP-D-ribose (ADPR) and nicotinamide (PubMed:29395922). In addition to ADPR, also generates a cyclization variant of cyclic ADPR (cADPR), termed 2'cADPR (v-cADPR) (PubMed:29395922, PubMed:36048923). Cleaves NADP(+), but does not cyclize the product (PubMed:36048923). {ECO:0000269|PubMed:29395922, ECO:0000269|PubMed:36048923}."}
{"seq": "MRFQVIVAAATITMITSYIPGVASQSTSDGDDLFVPVSNFDPKSIFPEIKHPFEPMYANTENGKIVPTNSWISNLFYPSADNLAPTTPDPYTLRLLDGYGGNPGLTIRQPSAKVLGSYPPTNDVPYTDAGYMINSVVVDLRLTSSEWSDVVPDRQVTDWDHLSANLRLSTPQDSNSYIDFPIVRGMAYITANYNNLTPQFLSQHAIISVEADEKKSDDNTSTFSGRKFKITMNDDPTSTFIIYSLGDKPLELRKQDNSNLVASKPYTGVIRVAKLPAPEFETLLDASRAVWPTGGDISARSDDNNGASYTIKWKTNSNEAPLLTYAYAHHLTSIDDSNVKRTDMTLQSATKGPMTALVGNEWTLRETELSPVEWLPLQAAPNPTTINEIMTEINKDIASNYTQETAKEDNYFSGKGLQKFAMLALILNKSDQTQLRNPELAQIALDKLKAAFLPYLQNEQADPFRYDTLYKGIVAKAGLPTSMGGTDDLSAEFGHSYYSDHHYHQGYFVVTAAIIHHLDPTWNADRLKAWTEALIRDVNNANDGDEYFAAFRNWDWFAGHSWAGGIKPDGALDGRDQESVPESVNFYWGAKLWGLATGNTPLTKLASLQLAVTKRTTYEYFWMLDGNKNRPENIVRNKVIGIYFEQKTDYTTYFGRFLEYIHGIQQLPMTPELMEYIRTPEFVSQEWDEKLGAIAPTVQSPWAGVLYLNYAIINPAEAYPALRKVQMDDGQTRSYSLYLTATRPHFFRRSLLAALARHGSTRRPSLPSSGDDDKHEDGFLLRFRRLNPFNLKHRIY", "text": "PROTEIN NAME: Glucan endo-1,3-beta-D-glucosidase 1 (Endo-1,3-beta-glucanase 1) (EC 3.2.1.39) (Laminarinase) (RmLam81A). FUNCTION: Cleaves internal linkages in 1,3-beta-glucan. {ECO:0000269|PubMed:34801773}. SUBCELLULAR LOCATION: Secreted, cell wall {ECO:0000250|UniProtKB:P53753}. SIMILARITY: Belongs to the glycosyl hydrolase 81 family. {ECO:0000255|PROSITE-ProRule:PRU01352, ECO:0000305}."}
{"seq": "MKNPKKKSGGFRIVNMLKRGVARVSPFGGLKRLPAGLLLGHGPIRMVLAILAFLRFTAIKPSLGLINRWGSVGKKEAMEIIKKFKKDLAAMLRIINARKEKKRRGADTSVGIVGLLLTTAMAAEVTRRGSAYYMYLDRNDAGEAISFPTTLGMNKCYIQIMDLGHMCDATMSYECPMLDEGVEPDDVDCWCNTTSTWVVYGTCHHKKGEARRSRRAVTLPSHSTRKLQTRSQTWLESREYTKHLIRVENWIFRNPGFALAAAAIAWLLGSSTSQKVIYLVMILLIAPAYSIRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRCPTQGEAYLDKQSDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFACSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDTGHETDENRAKVEITPNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALVEFKDAHAKRQTVVVLGSQEGAVHTALAGALEAEMDGAKGRLSSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKIPAETLHGTVTVEVQYAGTDGPCKVPAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGEKKITHHWHRSGSTIGKAFEATVRGAKRMAVLGDTAWDFGSVGGALNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLMWLGLNTKNGSISLMCLALGGVLIFLSTAVSADVGCSVDFSKKETRCGTGVFVYNDVEAWRDRYKYHPDSPRRLAAAVKQAWEDGICGISSVSRMENIMWRSVEGELNAILEENGVQLTVVVGSVKNPMWRGPQRLPVPVNELPHGWKAWGKSYFVRAAKTNNSFVVDGDTLKECPLKHRAWNSFLVEDHGFGVFHTSVWLKVREDYSLECDPAVIGTAVKGKEAVHSDLGYWIESEKNDTWRLKRAHLIEMKTCEWPKSHTLWTDGIEESDLIIPKSLAGPLSHHNTREGYRTQMKGPWHSEELEIRFEECPGTKVHVEETCGTRGPSLRSTTASGRVIEEWCCRECTMPPLSFRAKDGCWYGMEIRPRKEPESNLVRSMVTAGSTDHMDHFSLGVLVILLMVQEGLKKRMTTKIIISTSMAVLVAMILGGFSMSDLAKLAILMGATFAEMNTGGDVAHLALIAAFKVRPALLVSFIFRANWTPRESMLLALASCLLQTAISALEGDLMVLINGFALAWLAIRAMVVPRTDNITLAILAALTPLARGTLLVAWRAGLATCGGFMLLSLKGKGSVKKNLPFVMALGLTAVRLVDPINVVGLLLLTRSGKRSWPPSEVLTAVGLICALAGGFAKADIEMAGPMAAVGLLIVSYVVSGKSVDMYIERAGDITWEKDAEVTGNSPRLDVALDESGDFSLVEDDGPPMREIILKVVLMTICGMNPIAIPFAAGAWYVYVKTGKRSGALWDVPAPKEVKKGETTDGVYRVMTRRLLGSTQVGVGVMQEGVFHTMWHVTKGSALRSGEGRLDPYWGDVKQDLVSYCGPWKLDAAWDGHSEVQLLAVPPGERARNIQTLPGIFKTKDGDIGAVALDYPAGTSGSPILDKCGRVIGLYGNGVVIKNGSYVSAITQGRREEETPVECFEPSMLKKKQLTVLDLHPGAGKTRRVLPEIVREAIKTRLRTVILAPTRVVAAEMEEALRGLPVRYMTTAVNVTHSGTEIVDLMCHATFTSRLLQPIRVPNYNLYIMDEAHFTDPSSIAARGYISTRVEMGEAAAIFMTATPPGTRDAFPDSNSPIMDTEVEVPERAWSSGFDWVTDHSGKTVWFVPSVRNGNEIAACLTKAGKRVIQLSRKTFETEFQKTKHQEWDFVVTTDISEMGANFKADRVIDSRRCLKPVILDGERVILAGPMPVTHASAAQRRGRIGRNPNKPGDEYLYGGGCAETDEDHAHWLEARMLLDNIYLQDGLIASLYRPEADKVAAIEGEFKLRTEQRKTFVELMKRGDLPVWLAYQVASAGITYTDRRWCFDGTTNNTIMEDSVPAEVWTRHGEKRVLKPRWMDARVCSDHAALKSFKEFAAGKRGAAFGVMEALGTLPGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLLLVVLIPEPEKQRSPQDNQMAIIIMVAVGLLGLITANELGWLERTKSDLSHLMGRREEGATIGFSMDIDLRPASAWAIYAALTTFITPAVQHAVTTSYNNYSLMAMATQAGVLFGMGKGMPFYAWDFGVPLLMIGCYSQLTPLTLIVAIILLVAHYMYLIPGLQAAAARAAQKRTAAGIMKNPVVDGIVVTDIDTMTIDPQVEKKMGQVLLIAVAVSSAILSRTAWGWGEAGALITAATSTLWEGSPNKYWNSSTATSLCNIFRGSYLAGASLIYTVTRNAGLVKRRGGGTGETLGEKWKARLNQMSALEFYSYKKSGITEVCREEARRALKDGVATGGHAVSRGSAKLRWLVERGYLQPYGKVIDLGCGRGGWSYYAATIRKVQEVKGYTKGGPGHEEPMLVQSYGWNIVRLKSGVDVFHMAAEPCDTLLCDIGESSSSPEVEEARTLRVLSMVGDWLEKRPGAFCIKVLCPYTSTMMETLERLQRRYGGGLVRVPLSRNSTHEMYWVSGAKSNTIKSVSTTSQLLLGRMDGPRRPVKYEEDVNLGSGTRAVVSCAEAPNMKIIGNRIERIRSEHAETWFFDENHPYRTWAYHGSYEAPTQGSASSLINGVVRLLSKPWDVVTGVTGIAMTDTTPYGQQRVFKEKVDTRVPDPQEGTRQVMSMVSSWLWKELGKHKRPRVCTKEEFINKVRSNAALGAIFEEEKEWKTAVEAVNDPRFWALVDKEREHHLRGECQSCVYNMMGKREKKQGEFGKAKGSRAIWYMWLGARFLEFEALGFLNEDHWMGRENSGGGVEGLGLQRLGYVLEEMSRIPGGRMYADDTAGWDTRISRFDLENEALITNQMEKGHRALALAIIKYTYQNKVVKVLRPAEKGKTVMDIISRQDQRGSGQVVTYALNTFTNLVVQLIRNMEAEEVLEMQDLWLLRRSEKVTNWLQSNGWDRLKRMAVSGDDCVVKPIDDRFAHALRFLNDMGKVRKDTQEWKPSTGWDNWEEVPFCSHHFNKLHLKDGRSIVVPCRHQDELIGRARVSPGAGWSIRETACLAKSYAQMWQLLYFHRRDLRLMANAICSSVPVDWVPTGRTTWSIHGKGEWMTTEDMLVVWNRVWIEENDHMEDKTPVTKWTDIPYLGKREDLWCGSLIGHRPRTTWAENIKNTVNMVRRIIGDEEKYMDYLSTQVRYLGEEGSTPGVL", "text": "PROTEIN NAME: Genome polyprotein [Cleaved into: Capsid protein C (Capsid protein) (Core protein); Protein prM (Precursor membrane protein); Peptide pr (Peptide precursor); Small envelope protein M (Matrix protein); Envelope protein E; Non-structural protein 1 (NS1); Non-structural protein 2A (NS2A); Serine protease subunit NS2B (Flavivirin protease NS2B regulatory subunit) (Non-structural protein 2B); Serine protease NS3 (EC 3.4.21.91) (EC 3.6.1.15) (EC 3.6.4.13) (Flavivirin protease NS3 catalytic subunit) (Non-structural protein 3); Non-structural protein 4A (NS4A); Peptide 2k; Non-structural protein 4B (NS4B); RNA-directed RNA polymerase NS5 (EC 2.1.1.56) (EC 2.1.1.57) (EC 2.7.7.48) (NS5)]. FUNCTION: [Capsid protein C]: Plays a role in virus budding by binding to the cell membrane and gathering the viral RNA into a nucleocapsid that forms the core of the mature virus particle (By similarity). During virus entry, may induce genome penetration into the host cytoplasm after hemifusion induced by the surface proteins (By similarity). Can migrate to the cell nucleus where it modulates host functions (By similarity). Inhibits the integrated stress response (ISR) in the infected cell (PubMed:28592527). {ECO:0000250|UniProtKB:P17763, ECO:0000269|PubMed:28592527}.; FUNCTION: [Capsid protein C]: Inhibits RNA silencing by interfering with host Dicer. {ECO:0000250|UniProtKB:P03314}.; FUNCTION: [Peptide pr]: Prevents premature fusion activity of envelope proteins in trans-Golgi by binding to envelope protein E at pH 6.0. After virion release in extracellular space, gets dissociated from E dimers. {ECO:0000250|UniProtKB:P17763}.; FUNCTION: [Protein prM]: Plays a role in host immune defense modulation and protection of envelope protein E during virion synthesis. PrM-E cleavage is inefficient, many virions are only partially matured and immature prM-E proteins could play a role in immune evasion. Contributes to fetal microcephaly in humans. Acts as a chaperone for envelope protein E during intracellular virion assembly by masking and inactivating envelope protein E fusion peptide. prM is the only viral peptide matured by host furin in the trans-Golgi network probably to avoid catastrophic activation of the viral fusion activity in acidic Golgi compartment prior to virion release. {ECO:0000250|UniProtKB:P17763}.; FUNCTION: [Small envelope protein M]: May play a role in virus budding. Exerts cytotoxic effects by activating a mitochondrial apoptotic pathway through M ectodomain. May display a viroporin activity. {ECO:0000250|UniProtKB:P17763}.; FUNCTION: [Envelope protein E]: Binds to host cell surface receptors and mediates fusion between viral and cellular membranes. Efficient virus attachment to cell is, at least in part, mediated by host HAVCR1 in a cell-type specific manner (By similarity). In addition, host NCAM1 can also be used as entry receptor (By similarity). Interaction with host HSPA5 plays an important role in the early stages of infection as well (By similarity). Envelope protein is synthesized in the endoplasmic reticulum and forms a heterodimer with protein prM. The heterodimer plays a role in virion budding in the ER, and the newly formed immature particle is covered with 60 spikes composed of heterodimers between precursor prM and envelope protein E. The virion is transported to the Golgi apparatus where the low pH causes the dissociation of PrM-E heterodimers and formation of E homodimers. PrM-E cleavage is inefficient, many virions are only partially matured and immature prM-E proteins could play a role in immune evasion (By similarity). {ECO:0000250|UniProtKB:A0A142I5B9, ECO:0000250|UniProtKB:P17763}.; FUNCTION: [Non-structural protein 1]: Plays a role in the inhibition of host RLR-induced interferon-beta activation by targeting TANK-binding kinase 1/TBK1. In addition, recruits the host deubiquitinase USP8 to cleave 'Lys-11'-linked polyubiquitin chains from caspase-1/CASP1 thus inhibiting its proteasomal degradation. In turn, stabilized CASP1 promotes cleavage of cGAS, which inhibits its ability to recognize mitochondrial DNA release and initiate type I interferon signaling. {ECO:0000250|UniProtKB:Q32ZE1}.; FUNCTION: [Non-structural protein 2A]: Component of the viral RNA replication complex that recruits genomic RNA, the structural protein prM/E complex, and the NS2B/NS3 protease complex to the virion assembly site and orchestrates virus morphogenesis (By similarity). Antagonizes also the host MDA5-mediated induction of alpha/beta interferon antiviral response (By similarity). May disrupt adherens junction formation and thereby impair proliferation of radial cells in the host cortex (By similarity). {ECO:0000250|UniProtKB:A0A142I5B9, ECO:0000250|UniProtKB:Q32ZE1}.; FUNCTION: [Serine protease subunit NS2B]: Required cofactor for the serine protease function of NS3. {ECO:0000250|UniProtKB:Q32ZE1}.; FUNCTION: [Serine protease NS3]: Displays three enzymatic activities: serine protease, NTPase and RNA helicase. NS3 serine protease, in association with NS2B, performs its autocleavage and cleaves the polyprotein at dibasic sites in the cytoplasm: C-prM, NS2A-NS2B, NS2B-NS3, NS3-NS4A, NS4A-2K and NS4B-NS5. NS3 RNA helicase binds RNA and unwinds dsRNA in the 3' to 5' direction (By similarity). Inhibits the integrated stress response (ISR) in the infected cell by blocking stress granules assembly (PubMed:28592527). Disrupts host centrosome organization in a CEP63-dependent manner to degrade host TBK1 and inhibits innate immune response (PubMed:35793002). {ECO:0000250|UniProtKB:Q32ZE1, ECO:0000269|PubMed:28592527, ECO:0000269|PubMed:35793002}.; FUNCTION: [Non-structural protein 4A]: Regulates the ATPase activity of the NS3 helicase activity (By similarity). NS4A allows NS3 helicase to conserve energy during unwinding (By similarity). Cooperatively with NS4B suppresses the Akt-mTOR pathway and leads to cellular dysregulation (PubMed:27524440). By inhibiting host ANKLE2 functions, may cause defects in brain development, such as microcephaly (PubMed:30550790). Antagonizes also the host MDA5-mediated induction of alpha/beta interferon antiviral response (By similarity). Inhibits the integrated stress response (ISR) in the infected cell by blocking stress granules assembly (PubMed:28592527). {ECO:0000250|UniProtKB:Q32ZE1, ECO:0000250|UniProtKB:Q9Q6P4, ECO:0000269|PubMed:27524440, ECO:0000269|PubMed:28592527, ECO:0000269|PubMed:30550790}.; FUNCTION: [Peptide 2k]: Functions as a signal peptide for NS4B and is required for the interferon antagonism activity of the latter. {ECO:0000250|UniProtKB:P17763}.; FUNCTION: [Non-structural protein 4B]: Induces the formation of ER-derived membrane vesicles where the viral replication takes place (By similarity). Also plays a role in the inhibition of host RLR-induced interferon-beta production at TANK-binding kinase 1/TBK1 level (By similarity). Cooperatively with NS4A suppresses the Akt-mTOR pathway and leads to cellular dysregulation (PubMed:27524440). {ECO:0000250|UniProtKB:Q32ZE1, ECO:0000250|UniProtKB:Q9Q6P4, ECO:0000269|PubMed:27524440}.; FUNCTION: [RNA-directed RNA polymerase NS5]: Replicates the viral (+) and (-) RNA genome, and performs the capping of genomes in the cytoplasm (PubMed:30951555). Methylates viral RNA cap at guanine N-7 and ribose 2'-O positions. Once sufficient NS5 is expressed, binds to the cap-proximal structure and inhibits further translation of the viral genome (By similarity). Besides its role in RNA genome replication, also prevents the establishment of a cellular antiviral state by blocking the interferon-alpha/beta (IFN-alpha/beta) signaling pathway. Mechanistically, interferes with host kinases TBK1 and IKKE upstream of interferon regulatory factor 3/IRF3 to inhibit the RIG-I pathway (By similarity). Antagonizes also type I interferon signaling by targeting STAT2 for degradation by the proteasome thereby preventing activation of JAK-STAT signaling pathway (By similarity). Mechanistically, acts as a scaffold protein to connect host ZSWIM8/CUL3 ligase complex and STAT2, leading to STAT2 degradation. Within the host nucleus, disrupts host SUMO1 and STAT2 co-localization with PML, resulting in PML degradation (PubMed:32699085). May also reduce immune responses by preventing the recruitment of the host PAF1 complex to interferon-responsive genes (PubMed:30550790). {ECO:0000250|UniProtKB:Q32ZE1, ECO:0000269|PubMed:30550790, ECO:0000269|PubMed:30951555, ECO:0000269|PubMed:32699085}. SUBCELLULAR LOCATION: [Capsid protein C]: Virion {ECO:0000250|UniProtKB:P17763}. Host nucleus {ECO:0000269|PubMed:36594413}. Host cytoplasm {ECO:0000250|UniProtKB:P06935}. Host cytoplasm, host perinuclear region {ECO:0000250|UniProtKB:P06935}.; SUBCELLULAR LOCATION: [Peptide pr]: Secreted {ECO:0000250|UniProtKB:P17763}.; SUBCELLULAR LOCATION: [Small envelope protein M]: Virion membrane {ECO:0000250|UniProtKB:P17763}; Multi-pass membrane protein {ECO:0000255}. Host endoplasmic reticulum membrane {ECO:0000250|UniProtKB:P17763}; Multi-pass membrane protein {ECO:0000255}.; SUBCELLULAR LOCATION: [Envelope protein E]: Virion membrane {ECO:0000250|UniProtKB:P17763}; Multi-pass membrane protein {ECO:0000255}. Host endoplasmic reticulum membrane {ECO:0000250|UniProtKB:P17763}; Multi-pass membrane protein {ECO:0000255}.; SUBCELLULAR LOCATION: [Non-structural protein 1]: Secreted {ECO:0000250|UniProtKB:P17763}. Host endoplasmic reticulum membrane {ECO:0000250|UniProtKB:Q32ZE1}; Peripheral membrane protein {ECO:0000250|UniProtKB:Q32ZE1}; Lumenal side {ECO:0000250|UniProtKB:P17763}. Note=Located in RE-derived vesicles hosting the replication complex. {ECO:0000250|UniProtKB:Q9Q6P4}.; SUBCELLULAR LOCATION: [Non-structural protein 2A]: Host endoplasmic reticulum membrane {ECO:0000250|UniProtKB:P17763}; Multi-pass membrane protein {ECO:0000250|UniProtKB:P17763}.; SUBCELLULAR LOCATION: [Serine protease NS3]: Host endoplasmic reticulum membrane {ECO:0000255|PROSITE-ProRule:PRU00860}; Peripheral membrane protein {ECO:0000255|PROSITE-ProRule:PRU00860}; Cytoplasmic side {ECO:0000255|PROSITE-ProRule:PRU00860}. Note=Remains non-covalently associated to serine protease subunit NS2B. {ECO:0000255|PROSITE-ProRule:PRU00860}.; SUBCELLULAR LOCATION: [Non-structural protein 4A]: Host endoplasmic reticulum membrane {ECO:0000250|UniProtKB:P17763}; Multi-pass membrane protein {ECO:0000250|UniProtKB:P17763}. Note=Located in RE-associated vesicles hosting the replication complex. {ECO:0000250|UniProtKB:P17763}.; SUBCELLULAR LOCATION: [Non-structural protein 4B]: Host endoplasmic reticulum membrane {ECO:0000250|UniProtKB:P17763}; Multi-pass membrane protein {ECO:0000250|UniProtKB:P17763}. Note=Located in RE-derived vesicles hosting the replication complex. {ECO:0000250|UniProtKB:Q9Q6P4}.; SUBCELLULAR LOCATION: [RNA-directed RNA polymerase NS5]: Host endoplasmic reticulum membrane {ECO:0000250|UniProtKB:Q32ZE1}; Peripheral membrane protein {ECO:0000250|UniProtKB:Q32ZE1}; Cytoplasmic side {ECO:0000250|UniProtKB:Q32ZE1}. Host nucleus {ECO:0000269|PubMed:32699085}. Note=Located in RE-associated vesicles hosting the replication complex. NS5 protein is mainly localized in the nucleus rather than in ER vesicles. {ECO:0000250|UniProtKB:P17763}. SIMILARITY: In the N-terminal section; belongs to the class I-like SAM-binding methyltransferase superfamily. mRNA cap 0-1 NS5-type methyltransferase family. {ECO:0000255|PROSITE-ProRule:PRU00924}."}
{"seq": "MYRKLAVISAFLATARAQSACTLQSETHPPLTWQKCSSGGTCTQQTGSVVIDANWRWTHATNSSTNCYDGNTWSSTLCPDNETCAKNCCLDGAAYASTYGVTTSGNSLSIGFVTQSAQKNVGARLYLMASDTTYQEFTLLGNEFSFDVDVSQLPCGLNGALYFVSMDADGGVSKYPTNTAGAKYGTGYCDSQCPRDLKFINGQANVEGWEPSSNNANTGIGGHGSCCSEMDIWEANSISEALTPHPCTTVGQEICEGDGCGGTYSDNRYGGTCDPDGCDWNPYRLGNTSFYGPGSSFTLDTTKKLTVVTQFETSGAINRYYVQNGVTFQQPNAELGSYSGNELNDDYCTAEEAEFGGSSFSDKGGLTQFKKATSGGMVLVMSLWDDYYANMLWLDSTYPTNETSSTPGAVRGSCSTSSGVPAQVESQSPNAKVTFSNIKFGPIGSTGNPSGGNPPGGNPPGTTTTRRPATTTGSSPGPTQSHYGQCGGIGYSGPTVCASGTTCQVLNPYYSQCL", "text": "PROTEIN NAME: Exoglucanase 1 (EC 3.2.1.91) (1,4-beta-cellobiohydrolase) (Cellobiohydrolase 7A) (Cel7A) (Exocellobiohydrolase I) (CBHI) (Exoglucanase I). FUNCTION: Exocellobiohydrolases (CBH) that catalyzes the hydrolysis of 1,4-beta-D-glucosidic bonds in cellulose to release the disaccharide cellobiose. The degradation of cellulose involves an interplay between different cellulolytic enzymes. Hydrolysis starts with endoglucanases (EGs), which cut internal beta-1,4-glucosidic bonds in cellulose to reduce the polymerization degree of the substrate and create new chain ends for exocellobiohydrolases (CBHs). The CBHs release the disaccharide cellobiose from the non-reducing end of the cellulose polymer chain. Finally, beta-1,4-glucosidases hydrolyze the cellobiose and other short cello-oligosaccharides into glucose units. {ECO:0000250|UniProtKB:P62694}. SUBCELLULAR LOCATION: Secreted {ECO:0000250|UniProtKB:P62694}. SIMILARITY: Belongs to the glycosyl hydrolase 7 (cellulase C) family. {ECO:0000305}."}
{"seq": "MRSLAILTTLLAGHAFAYPKPAPQSVNRRDWPSINEFLSELAKVMPIGDTITAACDLISDGEDAAASLFGISETENDPCGDVTVLFARGTCDPGNVGVLVGPWFFDSLQTALGSRTLGVKGVPYPASVQDFLSGSVQNGINMANQIKSVLQSCPNTKLVLGGYSQGSMVVHNAASNLDAATMSKISAVVLFGDPYYGKPVANFDAAKTLVVCHDGDNICQGGDIILLPHLTYAEDADTAAAFVVPLVS", "text": "PROTEIN NAME: Cutinase (EC 3.1.1.74). FUNCTION: Catalyzes the hydrolysis of complex carboxylic polyesters found in the cell wall of plants (PubMed:25219509). Degrades cutin, a macromolecule that forms the structure of the plant cuticle (PubMed:25219509). {ECO:0000269|PubMed:25219509}. SUBCELLULAR LOCATION: Secreted {ECO:0000255|RuleBase:RU361263}. SIMILARITY: Belongs to the cutinase family. {ECO:0000255|RuleBase:RU361263}."}
{"seq": "MIVGILTTLATLATLAASVPLEERQACSSVWGQCGGQNWSGPTCCASGSTCVYSNDYYSQCLPGAASSSSSTRAASTTSRVSPTTSRSSSATPPPGSTTTRVPPVGSGTATYSGNPFVGVTPWANAYYASEVSSLAIPSLTGAMATAAAAVAKVPSFMWLDTLDKTPLMEQTLADIRTANKNGGNYAGQFVVYDLPDRDCAALASNGEYSIADGGVAKYKNYIDTIRQIVVEYSDIRTLLVIEPDSLANLVTNLGTPKCANAQSAYLECINYAVTQLNLPNVAMYLDAGHAGWLGWPANQDPAAQLFANVYKNASSPRALRGLATNVANYNGWNITSPPSYTQGNAVYNEKLYIHAIGPLLANHGWSNAFFITDQGRSGKQPTGQQQWGDWCNVIGTGFGIRPSANTGDSLLDSFVWVKPGGECDGTSDSSAPRFDSHCALPDALQPAPQAGAWFQAYFVQLLTNANPSFL", "text": "PROTEIN NAME: Exoglucanase 2 (EC 3.2.1.91) (1,4-beta-cellobiohydrolase) (Cellobiohydrolase 6A) (Cel6A) (Exocellobiohydrolase II) (CBHII) (Exoglucanase II). FUNCTION: Exocellobiohydrolases (CBH) that catalyzes the hydrolysis of 1,4-beta-D-glucosidic bonds in cellulose to release the disaccharide cellobiose. The degradation of cellulose involves an interplay between different cellulolytic enzymes. Hydrolysis starts with endoglucanases (EGs), which cut internal beta-1,4-glucosidic bonds in cellulose to reduce the polymerization degree of the substrate and create new chain ends for exocellobiohydrolases (CBHs). The CBHs release the disaccharide cellobiose from the non-reducing end of the cellulose polymer chain. Finally, beta-1,4-glucosidases hydrolyze the cellobiose and other short cello-oligosaccharides into glucose units. {ECO:0000250|UniProtKB:P07987}. SUBCELLULAR LOCATION: Secreted {ECO:0000250|UniProtKB:P07987}. SIMILARITY: Belongs to the glycosyl hydrolase 6 (cellulase B) family. {ECO:0000305}."}
{"seq": "MMKMKQQGLVADLLPNIRVMKTFGHFVFNYYNDNSSKYLHKVYCCVNLFMLLLQFGLCAVNLIVESADVDDLTANTITLLFFTHSIVKICYFAIRSKYFYRTWAIWNNPNSHPLFAESNARYHAIALKKMRLLLFLVGGTTMLAAVAWTVLTFFEHPIRKIVDPVTNETEIIELPQLLIRSFYPFDAGKGITHVLVLVYQFYWVLFMLIDANSLDVLFCSWLLFACEQLQHLKQIMKPLMELSATLDTVVPNSSELFKAGSADHLRDGDNPPPPPPPQSDNMLDLDLRNIYSNRQDFTATFRPTAGMTFNGGVGPNGLTKKQEALVRSAIKYWVERHKHIVRLVTAVGDAYGFALLLHMLTTTITLTLLAYQATKVNGINVYAASTIGYILYTFGQVFLFCIFGNRLIEESTSVMEAAYSCHWYDGSEEAKTFVQIVCQQCQKAMSISGAKFFTVSLDLFASVLGAVVTYFMVLVQLK", "text": "PROTEIN NAME: Odorant receptor coreceptor. FUNCTION: Odorant coreceptor which complexes with conventional odorant receptors (ORs) to form odorant-sensing units, providing sensitive and prolonged odorant signaling and calcium permeability (By similarity). Obligate coreceptor of all odorant receptors (By similarity). Orco is a universal and integral part of the functional odorant receptor, involved in the dendritic localization of other olfactory receptors. Can form functional ion channels in the absence of an odor-binding odorant receptor (By similarity). Plays a central role in the perception of olfactory stimuli in ants and is essential for ant social organization (PubMed:28802042). Required for pheromone sensing (PubMed:28802042). Also required for the development and maintenance of odorant receptor neurons (ORNs) and of antennal lobe glomeruli (PubMed:28802042). {ECO:0000250|UniProtKB:Q7QCC7, ECO:0000269|PubMed:28802042}. SUBCELLULAR LOCATION: Cell membrane {ECO:0000305}; Multi-pass membrane protein {ECO:0000255}. SIMILARITY: Belongs to the insect chemoreceptor superfamily. Heteromeric odorant receptor channel (TC 1.A.69) family. Orco subfamily. {ECO:0000305}."}
{"seq": "MYWQLVRILVLFDCLQKILAIEHDSICIADVDDACPEPSHTVMRLRERNDKKAHLIAKQHGLEIRGQPFLDGKSYFVTHISKQRSRRRKREIISRLQEHPDILSIEEQRPRVRRKRDFLYPDIAHELAGSSTNIRHTGLISNTEPRIDFIQHDAPVLPFPDPLYKEQWYLNNGAQGGFDMNVQAAWLLGYAGRNISVSILDDGIQRDHPDLAANYDPLASTDINGHDDDPTPQDDGDNKHGTRCAGEVASIAGNVYCGVGVAFHAKIGGVRMLDGPVSDSVEAASLSLNRHHIDIYSASWGPEDDGRTFDGPGPLAREAFYRGVKAGRGGKGSIFVWASGNGGSRQDSCSADGYTTSVYTLSVSSATIDNRSPWYLEECPSTIATTYSSANMNQPAIITVDVPHGCTRSHTGTSASAPLAAGIIALALEANPNLTWRDMQHIVLRTANPVPLLNNPGWSVNGVGRRINNKFGYGLMDAGALVKLALIWKTVPEQHICTYDYKLEKPNPRPITGNFQMNFSLEVNGCESGTPVLYLEHVQVLATFRFGKRGDLKLTLFSPRGTSSVLLPPRPQDFNSNGIHKWPFLSVQTWGEDPRGKWTLMVESVSTNRNVGGTFHDWSLLLYGTAEPAQPNDPRHSSVVPSSVSAESPFDRITQHIASQEKKKKQRDSRDWQPKKVENKKSLLVSAQPELRV", "text": "PROTEIN NAME: Endoprotease bli (EC 3.4.21.75) (Blisterase). FUNCTION: Serine endoprotease which cleaves substrates at the RX(K/R)R consensus motif. {ECO:0000269|PubMed:12855702}. SUBCELLULAR LOCATION: Secreted {ECO:0000305|PubMed:12855702}. SIMILARITY: Belongs to the peptidase S8 family. Furin subfamily. {ECO:0000305}."}
{"seq": "MRSVSGQVVCVTGAGGFIASWLVKILLEKGYTVRGTVRNPDDPKNGHLRELEGAKERLTLCKADLLDYQSLREAINGCDGVFHTASPVTDDPEQMVEPAVIGTKNVINAAAEANVRRVVFTSSIGAVYMDPNRDPETVVDETCWSDPDFCKNTKNWYCYGKMVAEQAAWEEAKEKGVDLVVINPVLVQGPLLQTTVNASVLHILKYLTGSAKTYANSVQAYVDVKDVALAHILLYETPEASGRYLCAESVLHRGDVVEILSKFFPEYPIPTKCSDVTKPRVKPYKFSNQKLKDLGLEFTPVKQCLYETVKSLQEKGHLPIPTQKDEPIIRIQP", "text": "PROTEIN NAME: Cinnamoyl-CoA reductase 1 (Ph-CCR1) (EC 1.2.1.44) (Coniferylaldehyde synthase) (Coumaroyl-CoA reductase) (Feruloyl-CoA reductase) (Sinapoyl-CoA reductase). FUNCTION: Involved in the latter stages of lignin biosynthesis (PubMed:24985707). Catalyzes one of the last steps of monolignol biosynthesis, the conversion of cinnamoyl-CoAs into their corresponding cinnamaldehydes (PubMed:24985707, PubMed:25217505). Mediates the conversion of feruloyl CoA to coniferylaldehyde (PubMed:24985707, PubMed:25217505). Also active toward p-coumaroyl-CoA and sinapoyl-CoA (PubMed:24985707, PubMed:25217505). Involved in the production of floral volatile phenylpropanoids in flowers of fragrant cultivars (e.g. cv. Mitchell and cv. V26) from cinnamic acid, a common precursor with the anthocyanin biosynthesis pathway involved in flower pigmentation (PubMed:24985707). {ECO:0000269|PubMed:24985707, ECO:0000269|PubMed:25217505}. SUBCELLULAR LOCATION: Cytoplasm {ECO:0000269|PubMed:24985707}. SIMILARITY: Belongs to the NAD(P)-dependent epimerase/dehydratase family. Dihydroflavonol-4-reductase subfamily. {ECO:0000305}."}
{"seq": "MKFLAFGLIYFHFCILNRCEYITSSTIQKCYNSSNEPNNCSQKAVIVLSLENGQIANTEQVVATLNQLSDSGVNKQLQNSFIFEVTKSPVTALFPLIYLQDFNSQPLEQVIATTLFSCKDGFYDSSPTCKFQYDSKGQKILDSQGYCCYCSLSDILGMGNDLSRGKVCYALNLGAGSATAHCLKFSPLWYSAFKIQQYQLYFEVNINIYTVDSQNQKNLKQTLKLSTSNPTMKSSDNSTISKIIGTFTPTQPPADLSSYYLVKPSFPATDPRVLQGISSWMFVDKTMFTLDGTQCNKIGVSYSGFRQQSSSCSQPVGSCLQNQLENLYQSDLILLSQNKQPKYLLESQGNFNQVQFQGQTILQQGLSGSASTLITIEIDAAQIKFVTNLGIGCISQCSINNFESHSGNGKLVALVQNQGNYSAEFVLGFNCSSNVQPIQGQKLFLTANQLYNFNCSVSVNSDISAINNNCTINLYDAIGNQLDSKNILFNTTSTNHTSNQGNNTGQQQSSQEYKSSQSCSDKCSSFWSFWCYFSAGCIKEAFKSIASIAGVASALALVIFLAKNGYLVPIIRFLCCCCCKSKKKENEKNKDKTDKKSIQESCSYDRSCCSHSISQSYQVENKNKYKRSKIQRSFSSESCQDKSKKIINELSNLEETFEANKLYANIDKNSSIFEYFGFKKSFTFILYERNDILFLPQNSTILDMIGALQPQKGSYLAQKFLEIVNKNALKVVSTSPLYLLIE", "text": "PROTEIN NAME: Hapless 2 (Generative cell specific-1). FUNCTION: During fertilization, required for the formation of intercellular membrane pores and subsequent exchange of gametic pronuclei between cells. Probably initiates the formation of intercellular membrane pores by inserting part of its extracellular domain into the cell membrane of the adjoining cell in the mating pair. Mating requires the presence of HAP2 on at least one of the two cells. Mating efficiency is high when HAP2 is present on both cells, and is strongly reduced when HAP2 is present on only one of the two cells. {ECO:0000269|PubMed:25155508, ECO:0000269|PubMed:28238660}. SUBCELLULAR LOCATION: Cell membrane {ECO:0000305|PubMed:25155508, ECO:0000305|PubMed:28238660}; Single-pass type I membrane protein {ECO:0000255}. Cell junction {ECO:0000269|PubMed:25155508, ECO:0000269|PubMed:28238660}. Note=Detected at the mating junction. {ECO:0000269|PubMed:25155508, ECO:0000269|PubMed:28238660}. SIMILARITY: Belongs to the HAP2/GCS1 family. {ECO:0000305}."}