Mechanisms through which tissues are formed and maintained remain unknown but are fundamental aspects in biology. Tissue-specific gene expression is a valuable tool to study such mechanisms. But in many biomedical studies, cell lines, rather than human body tissues, are used to investigate biological mechanisms Whether or not cell lines maintain their tissue-specific characteristics after they are isolated and cultured outside the human body remains to be explored. In this study, we applied a novel computational method to identify core genes that contribute to the differentiation of cell lines from various tissues. Several advanced computational techniques, such as Monte Carlo feature selection method, incremental feature selection method, and support vector machine (SVM) algorithm, were incorporated in the proposed method, which extensively analyzed the gene expression profiles of cell lines from different tissues. As a result, we extracted a group of functional genes that can indicate the differences of cell lines in different tissues and built an optimal SVM classifier for identifying cell lines in different tissues. In addition, a set of rules for classifying cell lines were also reported, which can give a clearer picture of cell lines in different issues although its performance was not better than the optimal SVM classifier. Finally, we compared such genes with the tissue-specific genes identified by the Genotype-tissue Expression project. Results showed that most expression patterns between tissues remained in the derived cell lines despite some uniqueness that some genes show tissue specificity.

, , , ,
doi.org/10.1002/jcb.27977, hdl.handle.net/1765/111924
Journal of Cellular Biochemistry
Department of Medical Informatics

Chen, L. (Lei), Pan, X., Zhang, Y.-H. (Yu-Hang), Kong, X. (Xiangyin), Huang, T. (Tao), & Cai, Y.-D. (Yu-Dong). (2018). Tissue differences revealed by gene expression profiles of various cell lines. Journal of Cellular Biochemistry. doi:10.1002/jcb.27977