Predicting protein stability change upon double mutation from partial sequence information using data mining approach

Lien Fu Lai, Chao Chin Wu, Liang Tsung Huang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Citations (Scopus)

Abstract

The prediction of stability change for protein mutants is one of the important issues in protein design. Recently, the prediction upon double mutation has attracted more and more attention. In this work, we have employed a data mining approach to discriminating stability change for protein double mutants. We incorporated a reliable rule induction algorithm along with accuracy of 82.2% to construct rule-based knowledge patterns. Further, a fuzzy query method was utilized to value important and similar rule patterns for an input with partial sequence information. The results showed that the approach has two major advantages: (i) A rule-based knowledge representation offers intuitive interpretation on raw data, which is helpful to understand the content; and (ii) A fuzzy query method incorporates the concept of uncertainty, which can make predictions from partial information. Based on the proposed approach, we have also developed a web service for predicting protein stability change upon double mutation from partial sequence information and it is available at http://bioinformatics.myweb.hinet.net/tandem.htm.

Original languageEnglish
Title of host publicationAdvanced Intelligent Computing Theories and Applications - 6th International Conference on Intelligent Computing, ICIC 2010, Proceedings
Pages664-671
Number of pages8
DOIs
Publication statusPublished - 2010 Oct 29
Event6th International Conference on Intelligent Computing, ICIC 2010 - Changsha, China
Duration: 2010 Aug 182010 Aug 21

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume6215 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

Other6th International Conference on Intelligent Computing, ICIC 2010
CountryChina
CityChangsha
Period10-08-1810-08-21

Fingerprint

Data mining
Data Mining
Mutation
Fuzzy Query
Proteins
Protein
Partial
Mutant
Prediction
Rule Induction
Partial Information
Knowledge representation
Bioinformatics
Knowledge Representation
Web services
Web Services
Intuitive
Uncertainty

All Science Journal Classification (ASJC) codes

  • Theoretical Computer Science
  • Computer Science(all)

Cite this

Lai, L. F., Wu, C. C., & Huang, L. T. (2010). Predicting protein stability change upon double mutation from partial sequence information using data mining approach. In Advanced Intelligent Computing Theories and Applications - 6th International Conference on Intelligent Computing, ICIC 2010, Proceedings (pp. 664-671). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 6215 LNCS). https://doi.org/10.1007/978-3-642-14922-1_83
Lai, Lien Fu ; Wu, Chao Chin ; Huang, Liang Tsung. / Predicting protein stability change upon double mutation from partial sequence information using data mining approach. Advanced Intelligent Computing Theories and Applications - 6th International Conference on Intelligent Computing, ICIC 2010, Proceedings. 2010. pp. 664-671 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
@inproceedings{1296939859ad402ea1cfefba19489ee6,
title = "Predicting protein stability change upon double mutation from partial sequence information using data mining approach",
abstract = "The prediction of stability change for protein mutants is one of the important issues in protein design. Recently, the prediction upon double mutation has attracted more and more attention. In this work, we have employed a data mining approach to discriminating stability change for protein double mutants. We incorporated a reliable rule induction algorithm along with accuracy of 82.2{\%} to construct rule-based knowledge patterns. Further, a fuzzy query method was utilized to value important and similar rule patterns for an input with partial sequence information. The results showed that the approach has two major advantages: (i) A rule-based knowledge representation offers intuitive interpretation on raw data, which is helpful to understand the content; and (ii) A fuzzy query method incorporates the concept of uncertainty, which can make predictions from partial information. Based on the proposed approach, we have also developed a web service for predicting protein stability change upon double mutation from partial sequence information and it is available at http://bioinformatics.myweb.hinet.net/tandem.htm.",
author = "Lai, {Lien Fu} and Wu, {Chao Chin} and Huang, {Liang Tsung}",
year = "2010",
month = "10",
day = "29",
doi = "10.1007/978-3-642-14922-1_83",
language = "English",
isbn = "3642149219",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
pages = "664--671",
booktitle = "Advanced Intelligent Computing Theories and Applications - 6th International Conference on Intelligent Computing, ICIC 2010, Proceedings",

}

Lai, LF, Wu, CC & Huang, LT 2010, Predicting protein stability change upon double mutation from partial sequence information using data mining approach. in Advanced Intelligent Computing Theories and Applications - 6th International Conference on Intelligent Computing, ICIC 2010, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 6215 LNCS, pp. 664-671, 6th International Conference on Intelligent Computing, ICIC 2010, Changsha, China, 10-08-18. https://doi.org/10.1007/978-3-642-14922-1_83

Predicting protein stability change upon double mutation from partial sequence information using data mining approach. / Lai, Lien Fu; Wu, Chao Chin; Huang, Liang Tsung.

Advanced Intelligent Computing Theories and Applications - 6th International Conference on Intelligent Computing, ICIC 2010, Proceedings. 2010. p. 664-671 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 6215 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - Predicting protein stability change upon double mutation from partial sequence information using data mining approach

AU - Lai, Lien Fu

AU - Wu, Chao Chin

AU - Huang, Liang Tsung

PY - 2010/10/29

Y1 - 2010/10/29

N2 - The prediction of stability change for protein mutants is one of the important issues in protein design. Recently, the prediction upon double mutation has attracted more and more attention. In this work, we have employed a data mining approach to discriminating stability change for protein double mutants. We incorporated a reliable rule induction algorithm along with accuracy of 82.2% to construct rule-based knowledge patterns. Further, a fuzzy query method was utilized to value important and similar rule patterns for an input with partial sequence information. The results showed that the approach has two major advantages: (i) A rule-based knowledge representation offers intuitive interpretation on raw data, which is helpful to understand the content; and (ii) A fuzzy query method incorporates the concept of uncertainty, which can make predictions from partial information. Based on the proposed approach, we have also developed a web service for predicting protein stability change upon double mutation from partial sequence information and it is available at http://bioinformatics.myweb.hinet.net/tandem.htm.

AB - The prediction of stability change for protein mutants is one of the important issues in protein design. Recently, the prediction upon double mutation has attracted more and more attention. In this work, we have employed a data mining approach to discriminating stability change for protein double mutants. We incorporated a reliable rule induction algorithm along with accuracy of 82.2% to construct rule-based knowledge patterns. Further, a fuzzy query method was utilized to value important and similar rule patterns for an input with partial sequence information. The results showed that the approach has two major advantages: (i) A rule-based knowledge representation offers intuitive interpretation on raw data, which is helpful to understand the content; and (ii) A fuzzy query method incorporates the concept of uncertainty, which can make predictions from partial information. Based on the proposed approach, we have also developed a web service for predicting protein stability change upon double mutation from partial sequence information and it is available at http://bioinformatics.myweb.hinet.net/tandem.htm.

UR - http://www.scopus.com/inward/record.url?scp=77958501180&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=77958501180&partnerID=8YFLogxK

U2 - 10.1007/978-3-642-14922-1_83

DO - 10.1007/978-3-642-14922-1_83

M3 - Conference contribution

AN - SCOPUS:77958501180

SN - 3642149219

SN - 9783642149214

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 664

EP - 671

BT - Advanced Intelligent Computing Theories and Applications - 6th International Conference on Intelligent Computing, ICIC 2010, Proceedings

ER -

Lai LF, Wu CC, Huang LT. Predicting protein stability change upon double mutation from partial sequence information using data mining approach. In Advanced Intelligent Computing Theories and Applications - 6th International Conference on Intelligent Computing, ICIC 2010, Proceedings. 2010. p. 664-671. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). https://doi.org/10.1007/978-3-642-14922-1_83