Novel key gene tenascin C related to extracellular matrix accumulation in diabetic nephropathy kidney tubules: Results of integrative bioinformatics analysis

Liang Zhang; Zheng Wang; Manrong He; Yongdi Zuo; Jun Li; Wanxin Tang; Liang Zhang; Zheng Wang; Manrong He; Yongdi Zuo; Jun Li; Wanxin Tang

doi:10.17352/2455-8583.000047

ISSN: 2455-8583

Global Journal of Obesity, Diabetes and Metabolic Syndrome

Research Article Open Access Peer-Reviewed

Novel key gene tenascin C related to extracellular matrix accumulation in diabetic nephropathy kidney tubules: Results of integrative bioinformatics analysis

Liang Zhang¹, Zheng Wang¹, Manrong He¹, Yongdi Zuo¹, Jun Li^2,3 and Wanxin Tang^1*

Author and article information

¹Department of Nephrology, West China Hospital, Sichuan University, Chengdu, Sichuan, China
²The Center of Gerontology and Geriatrics, West China Hospital, Sichuan University, Chengdu, Sichuan, China
³National Clinical Research Center of Geriatrics, West China Hospital, Sichuan University, Chengdu, Sichuan, China

*Corresponding authors: Wanxin Tang, MD, Department of Nephrology, West China Hospital, Sichuan University, No.37, Guoxue alley, Chengdu, Sichuan, China, E-mail: [email protected]

DOI: 10.17352/2455-8583.000047

Received: 11 August, 2020 | Accepted: 24 August, 2020 | Published: 25 August, 2020

Keywords: Bioinformatics analysis; Diabetic nephropathy; Tenascin C; Extracellular matrix

Cite this as

Zhang L, Wang Z, He M, Zuo Y, Tang W, et al. (2020) Novel key gene tenascin C related to extracellular matrix accumulation in diabetic nephropathy kidney tubules: Results of integrative bioinformatics analysis. Glob J Obes Diabetes Metab Syndr 7(2): 042-052. DOI: 10.17352/2455-8583.000047

Abstract

Background: As the incidence of type 2 diabetes increases year by year, the number of individuals diagnosed with Diabetic Nephropathy (DN) has increased steeply. DN is characterized by glomerular sclerosis, tubulointerstitial fibrosis and atrophy. However, most of the previous studies on the pathogenesis of DN were focused on glomeruli, and now more and more evidences show that tubulointerstitial fibrosis plays an important role in the progress of DN. Bioinformatics analysis can be used in discovering disease-causing genes, biomarkers, and therapeutic targets through global analysis. In this study, we used this method to find and verify novel genes in DN.

Materials and methods: Microarray expression levels were downloaded from Gene Expression Omnibus datasets. Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analyses were conducted after Differentially Expressed Genes (DEGs) were identified. Hub genes were filtered on the basis of the result of GO enrichment, and functional analysis was performed by browsing the GeneCards website and the latest literatures. The expression levels of all the hub genes in HK2 cells stimulated by high glucose were verified, and TeNascin C (TNC), which is important and interesting, was verified through animal experiment.

Result: The relative expression levels of 54 samples were obtained, and 382 DEGs were identified. 286 GO terms and 100 KEGG pathways were enriched remarkably. Nine of the key genes of interest: MMP7, TNC, LUM, LTF, IGLC1, LYZ, CXCL6, CYP27B1 and FOS were selected and verified in HK2 cells stimulated by high glucose. quantitative Real-Time Polymerase Chain Reaction (qRT-PCR) indicated that the mRNA levels of most of the hub genes were up-regulated and had an accordance rate of approximately 66.7%. In addition, the expression of TNC in renal tubular tissue of STZ-induced diabetic nephropathy rat gradually increased with time.

Conclusion: Several novel key genes were discovered and verified in this study, and TNC plays an important role in the renal fibrosis of DN and is expected to be a new therapeutic target. This study provides a theoretical basis and data resources for further research.

Main article text

Abbreviations

DN: Diabetic Nephropathy; ESRD: End-Stage Renal Disease; ECM: ExtraCellular Matrix; TGF-β: Transforming Growth Factor Beta; MMP9: Matrix MetalloPeptidase 9; APC: Adenomatous Polyposis Coli; GEO: Gene Expression Omnibus; DEGs: Differentially Expressed Genes; GO: Gene Ontology; KEGG: Kyoto Encyclopedia of Genes and Genomes; NCBI: National Center for Biotechnology Information; Limma: Liner models for microarray data; FC: Fold Change; CC: Cellular Component; MF: Molecular Function; BP: Biological Process; HK2: Human Kidney 2; qRT-PCR: quantitative Real-Time Polymerase Chain Reaction; CT: Cycle Threshold; PBS: Phosphate-Buffered Saline; MMP7: Matrix MetalloPeptidase 7; TNC: TeNascin C; LUM: LUMican; LTF: LacTotransFerrin; IGLC1: ImmunoGlobulin Lambda Constant 1; LYZ: LYsoZyme; (CXCL6): Chemokine (C-X-C motif) Ligand 6; CYP27B1; CYtochrome P450 family 27 subfamily B member 1; ALB: ALBumin; C3: Complement component 3; IOD: Integrated Optical Density

Background

The incidence of diabetes, especially type 2 diabetes continues to increase globally. In 2015, approximately 415 million people have diabetes, and It is estimated that the number of diabetes will rise to 642 million by 2040 [1]. As we all know, diabetes (both type 1 and type 2) can cause various complications such as diabetic retinopathy, diabetic peripheral vascular disease, and diabetic autonomic neuropathy. What`s more, Diabetic Nephropathy (DN) is also one of the main complications of diabetes, and it is a major microvascular complication caused by uncontrolled high blood glucose for a long time. DN is characterized by proteinuria and progressive reduction in kidney function. The number of individuals diagnosed with DN has increased sharply owing to the high incidence rate of type 2 diabetes [2-4]. About 10% of deaths in type 2 diabetes patients were attributed to renal failure, and in the United States End-Stage Renal Disease (ESRD) caused by diabetes accounts for as high as 44% [4], even else, in some western countries, DN has become the most common cause of ESRD [5,6].

However, the pathogenesis of DN is extremely complex and has not been fully understood as yet. DN is characterized by glomerulosclerosis, tubulointerstitial fibrosis, and atrophy [7] and most of previous studies on the mechanism of DN are focused on glomeruli. Growing evidence reveals that the tubulointerstitial fibrosis plays an important role in the progression of DN [8], and the accumulation of ExtraCellular Matrix (ECM) is one of the most important mechanisms of tubulointerstitial fibrosis [9].

Current studies suggest that Transforming Growth Factor beta (TGF-β) is involved in the progression of tubulointerstitial fibrosis in DN [10,11]. Matrix MetalloPeptidase 9 (MMP9) [12-14], and Adenomatous Polyposis Coli (APC) [15] et al. may also play a role in the progression of tubulointerstitial fibrosis in DN. However, these scattered studies cannot provide a global impression for DN, because of the complexity of DN`s mechanism. In addition, numerous genes and targets have not been discovered.

Bioinformatics analysis surfaces with the emergence of massive gene data, and is a powerful methodology that can be used in discovering disease-causing genes, biomarkers, and therapeutic targets through global analysis. For example, using bioinformatics analysis methods, Gurudeeban Selvaraj, et al. [16] identified that mitogen-activated protein kinase 1 and aurora kinase are the hub nodes and identified candidate drugs in lung adenocarcinoma. However, the application of bioinformatics analysis methods in kidney diseases has just started. Cui Y, et al. [17] identified new biomarkers and therapeutic targets for immunoglobulin A nephropathy and hypertensive nephropathy. To date, corresponding studies in the field of DN, particularly of the kidney tubule, remain scarce.

In this study, we integrated the data of several different platforms about kidney tubule in DN from the Gene Expression Omnibus (GEO) dataset through bioinformatics methods and identified Differentially Expressed Genes (DEGs). We then conducted Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analyses and filtered hub genes through the network between the GO terms and DEGs. Finally, we performed in vivo and in vitro experiments to verify the novel hub genes.

Materials and Methods

Microarray data

The microarray expression data of GSE30529, GSE99325, GSE35487, and GSE37455 [18-21] were downloaded from GEO datasets (http://www.ncbi.nlm.nih.gov/geo/) in National Center for Biotechnology Information (NCBI). GSE30529 was obtained from the GPL571 platform: [HG-U133A_2], GSE99325 was obtained from the GPL19109 platform: [HG-U133_Plus_2] and GPL19184 platform: [HG-U133A],GSE35487 was obtained from the GPL96 platform: [HG-U133A], and GSE37455 was obtained from the GPL11670 platform: [Hs133P_Hs_ENTREZG.cdf] and GPL14663 platform: [Affy_HGU133A_CDF_ENTREZG_10]. Fifty-four samples, consisting of 28 diabetic human kidney tubule samples and 26 control human kidney tubule samples, were acquired for further gene profile analysis.

Data preprocessing

The raw data of GSE30529, GSE99325, GSE35487, and GSE37455 in CEL files were converted into a probe expression matrix by the R Affy package [22]. Background correction, normalization, and log-transformation were subsequently conducted by the ComBat function and Bayesian analysis of the surrogate variable analysis package in R [23] for the removal of the interval differences.

Identification of DEGs

Liner models for microarray data (Limma) in R [24] was used in identifying DEGs. Adjusted P values of <0.05 and log FC of ≥1 were set as the cutoff criteria, and the results were illustrated with a volcano plot. Bidirectional hierarchical clustering analysis was performed on the first 30 up-regulated and 30 down-regulated DEGs with the pheatmap package (version 1.0.8) in R [25] for the visualization of the differences between the DEGs of the DN and normal samples.

GO enrichment analysis

GO analysis [26] is a powerful bioinformatics tool for annotating the biological functions behind a large list of genes. GO mainly contains three categories: Cellular Component (CC), Molecular Function (MF), and Biological Process (BP). GO analysis was performed on the basis of the DEGs by using the gseGO function in clusterProfiler (version 3.5) [27], and the adjusted P value of < 0.05 was set as the cutoff criterion. The top 10 GO terms of each category ranked by an adjusted P value were visualized through bubble plot by the GO plot package in R [28]. The connections between the most significant GO terms in BP and participating genes were also visualized using the UpSet plot package in R [29].

KEGG pathway enrichment analysis

KEGG [30] is a pathway-related database that provides the information about genomes, biological pathways, diseases, and chemical substances. The gseKEGG function in clusterProfiler was used in the enrichment analysis of KEGG, which was performed on the basis of all the genes obtained from the primary data, and the analysis results were visualized with dot plot by the enrichplot package in R.

Hub gene analysis

The network between DEGs and the top five GO terms in each category drawn by GOplot package in R can reveal the connection between genes and GO terms and the fold change of each gene expressed by color depth. The DEGs that had considerably large fold change and considerably small P value and were enriched in the top GO terms were considered hub genes. Therefore, the genes that were considered hub genes were evident from the network. Only the top DEGs from the top five GO terms in BP were selected and further subjected to functional analysis because of the importance of BP in disease and our limited energy. The other DEGs were also valuable. We browsed the detailed introductions and functions of each hub gene in the GeneCards website (https://www.genecards.org), which is a comprehensive and authoritative compendium of annotative information about human genes [31]. We then reviewed the latest literature to determine whether these genes are involved in the mechanism of DN. Furthermore, the DEGs were validated by subsequent in vitro and in vivo experiments.

Cell culture

Human renal tubular epithelial cells (Human Kidney 2, HK2) were bought from the American Type Culture Collection (Manassas, VA, USA) and cultured in Dulbecco’s modified Eagle medium/F12 (Hyclone, NY, USA) with additional 10% fetal bovine serum (Hyclone, NY, USA) in a humidified incubator at 37 °C under 5% CO2. All of the cells were cultured in a complete medium with the same glucose concentration at 80% confluence after synchronization. The cells were lysed at 0, 12, 24, and 48 h for subsequent experiments.

qRT-PCR(quantitative Real-Time Polymerase Chain Reaction)

Cells or grand tissues were lysed with TRIzol-LS (Ambion), and total mRNA was prepared with RNeasy Plus Mini Kit (BioTeke RP1202) according to the manufacturer’s instructions. Genome removal reaction was performed using DNA Eraser and PrimeScript RT Reagent Kit(RR047A) was used for the acquisition of cDNA through reverse transcription. A qRT-PCR system (PIKORed 96, ThermoFisher) was subsequently used for PCR analysis for 45 cycles under the following conditions: 95 °C for 35 s,55 °C for 30 s, and 72 °C for 30 s. β-Actin was selected as internal control, and the primer sequences used for PCR are listed in Table 1. The Thermo Scientific PikoReal software (Thermo) was used in analyzing the Cycle Threshold (CT) value of each test sample in the PCR process. We calculated the relative mRNA expression level through 2-△CT, representing the expression fold of genes in comparison with β-actin.

Animal model construction

Male Sprague-Dawley rats (8 weeks old, 200–250 g) were purchased from the Dashuo Experimental Animal Center (Chengdu, China). All animals were fed under the same conditions for 7 days before the experiments. A total of 24 rats were injected with a single dose of streptozotocin (65 mg/kg, intraperitoneally, Sigma-Aldrich, Saint Louis, Missouri, USA), and six rats, as control (0-week), were injected with the same dose of normal saline. The diabetes model was confirmed by detecting blood glucose levels of >16.7 mmol/L from the random samples for 3 days. Then, 24 diabetes model rats were randomly and evenly divided into four groups: 4-weeks, 8-weeks, 12-weeks, and 16-weeks. Finally, all the rats were euthanized at the corresponding time, and kidney tissues were obtained. The animal studies were followed the guidelines issued by the National Institute of Health Guide for the Care and Use of Laboratory Animals. The protocol was approved by the Animal Care Committee of West China Hospital, Sichuan University, China.

Immunohistochemistry

Kidney tissue sections were deparaffinized, hydrated, and then washed with Phosphate-Buffered Saline (PBS). Endogenous peroxidase activity was quenched by incubating the sections in methanol solution containing 3% H2O2. The sections were then incubated overnight with primary antibodies at 4 °C and then washed with PBS. Secondary antibody was added and then reacted at room temperature for 30 min. The slides were washed with PBS, then incubated with diaminobenzidine tetrahydrochloride, and finally counterstained with hematoxylin. The positive region was observed under a positive fluorescent microscope (Olympus, Japan) and photographed under 400× magnification. Image-Pro Plus software (version 6.0) was used in calculating the positive region proportion.

Statistical analysis

Linear model fitting and Bayesian test in R (version 3.5.1) were used in identifying DEGs, Fisher`s exact test in R (version 3.5.1) was used in GO and KEGG enrichment analysis, and false discovery rate of <0.05 was considered to be statistically significant. With SPSS 22.0, t-test was performed on the final data of qRT-PCR and immunohistochemistry semi-quantitative analysis, and p value of <0.05 was considered to be statistically significant.

Results

Data preprocessing and identification of DEGs

After background correction, normalization, and log-transformation, the relative expression levels of the 54 samples were obtained (Figure 1a). The straight middle line and the uniform distribution indicated the high quality and comparability of the data in our study. On the basis of the criteria that we initially set, 382 DEGs were identified in diabetic human kidney tubule samples compared with the control. The volcanic plot of DEGs (Figure 1b) shows the criteria and results of DEGs. Most of the DEGs were up-regulated. The heatmap (Figure 1c), result of hierarchical clustering analysis displayed the gene expression levels of the top 30 DEGs in each sample. Table 2 shows the 15 up-regulated and 15 down-regulated DEGs with the largest log (FC) values. Log (FC), average expression level, adjusted P value, and expression pattern are also indicated in the table.

Table 2: 15 up-regulated and 15 down-regulated DEGs with the largest log (FC).
Genes	Log (FC)	Ave. Expr.	t	adj. P. Val	Regulation type
LTF	3.851674	8.239303	9.877539	1.87E-10	UP
CXCL6	3.582553	5.562603	10.42128	4.42E-11	UP
IGLC1	3.175743	8.846615	8.153457	1.99E-08	UP
LYZ	3.075445	7.610041	8.503614	7.12E-09	UP
NNMT	3.037901	8.221809	8.237743	1.57E-08	UP
MMP7	3.016535	9.368351	10.8129	1.66E-11	UP
HBB	2.762029	8.361878	6.56908	1.65E-06	UP
C3	2.737839	8.34637	8.841068	2.35E-09	UP
PROM1	2.671797	8.815896	9.519631	3.45E-10	UP
JCHAIN	2.439187	6.692386	6.92729	5.81E-07	UP
CLU	2.165134	9.310968	9.933482	1.87E-10	UP
TNC	2.119204	6.820108	7.541175	1.09E-07	UP
SLPI	2.070932	8.274606	6.25378	4.24E-06	UP
LUM	2.051951	8.740485	7.843996	4.49E-08	UP
WFDC2	2.048266	9.609095	9.649419	2.69E-10	UP
DUSP1	-1.6361	9.832772	-6.68297	1.21E-06	DOWN
FGF1	-1.76107	8.137033	-6.32632	3.48E-06	DOWN
EGF	-1.78691	8.003713	-7.31151	2.02E-07	DOWN
ATF3	-1.79111	7.337323	-6.17756	5.34E-06	DOWN
KLK1	-1.87585	7.592763	-9.70802	2.48E-10	DOWN
APOH	-1.89797	8.730592	-5.19884	8.21E-05	DOWN
HRG	-1.91606	8.575164	-7.80396	5.09E-08	DOWN
NDNF	-1.93427	6.950657	-7.0479	4.28E-07	DOWN
GADD45B	-1.96202	8.982207	-9.79122	2.14E-10	DOWN
FOSB	-2.08378	6.206358	-7.01905	4.46E-07	DOWN
CYP27B1	-2.122	8.481384	-5.97277	9.17E-06	DOWN
NPHS2	-2.34965	8.115874	-7.32695	1.96E-07	DOWN
G6PC	-2.35643	7.977318	-6.72197	1.07E-06	DOWN
CLIC5	-2.47136	8.20554	-7.9992	2.94E-08	DOWN
ALB	-2.61411	9.723223	-4.09657	0.001597	DOWN

Enrichment analysis

A total of 286 GO terms were found to be significantly enriched in the DEGs, including 201 in BP, 44 in CC, and 41 in MF, and the top 10 GO terms of each category ranked by adjusted P value, are shown in Figure 2a. The list of GO terms illustrated that the “extracellular structure organization,” and “ECM organization,” in BP, and the “ECM,” “proteinaceous ECM,” and “ECM component,” in CC were significantly enriched. Basing on this finding, we speculate that change in ECM might lead to tubulointerstitial fibrosis and plays an important role in the mechanism of DN. Thus, the DEGs contained by all of these GO terms may play a key role in the pathogenesis of DN. Several other GO terms related to immune response and endopeptidase also possibly contribute to the pathogenesis of DN. The connection between the top 10 GO terms in BP and the participating genes is shown in Figure 2b. Most of the DEGs were associated with multiple GO terms. Even else, one DEG simultaneously participated in seven GO terms, so this special DEG was the hub gene and the potential therapy target.

A total of 100 KEGG pathways, consisting 65 activated and 35 suppressed pathways, were enriched considerably and the top 12 of each regulation type are shown in Figure 2c and Table 3. Most of the enriched KEGGs were related to energy metabolism and contained carbohydrates, fats, and amino acids, such as “citrate cycle,” “butanoate metabolism,” and “tryptophan metabolism,” and all of these KEGGs were suppressed. The inhibited energy metabolism probably leads to the occurrence and development of DN, but the specific mechanism is unknown. Certain pathways related to immune response, such as “phaGO termsome,” “natural killer cell mediated cytotoxicity,” and “chemokine signaling pathway,” were also enriched remarkably. This result showed the important role of immunity in DN.

Table 3: Top 12 KEGG pathways of each regulation type.
ID	Description	set Size	Enrichment Score	P value	regulation type
hsa04145	Phagosome	132	0.650342069	0.001512859	activated
hsa05145	Toxoplasmosis	102	0.602901482	0.00152439	activated
hsa04650	Natural killer cell mediated cytotoxicity	111	0.586993255	0.001519757	activated
hsa05164	Influenza A	153	0.577493882	0.001432665	activated
hsa04062	Chemokine signaling pathway	165	0.539579877	0.001457726	activated
hsa05169	Epstein-Barr virus infection	189	0.538719536	0.001440922	activated
hsa05152	Tuberculosis	161	0.524261236	0.001459854	activated
hsa05168	Herpes simplex virus 1 infection	292	0.501573113	0.001344086	activated
hsa04510	Focal adhesion	184	0.500858233	0.001457726	activated
hsa04060	Cytokine-cytokine receptor interaction	245	0.461272885	0.001420455	activated
hsa05166	Human T-cell leukemia virus 1 infection	200	0.45875226	0.001461988	activated
hsa05165	Human papillomavirus infection	296	0.428441867	0.001342282	activated
hsa00010	Glycolysis / Gluconeogenesis	60	-0.534709879	0.002421308	suppressed
hsa04978	Mineral absorption	41	-0.673759723	0.002403846	suppressed
hsa00640	Propanoate metabolism	28	-0.696872101	0.002247191	suppressed
hsa00630	Glyoxylate and dicarboxylate metabolism	24	-0.728965202	0.002277904	suppressed
hsa00260	Glycine, serine and threonine metabolism	33	-0.733962383	0.002298851	suppressed
hsa04964	Proximal tubule bicarbonate reclamation	22	-0.735854726	0.002267574	suppressed
hsa00020	Citrate cycle (TCA cycle)	29	-0.743246668	0.002242152	suppressed
hsa00330	Arginine and proline metabolism	37	-0.743544096	0.002375297	suppressed
hsa00410	beta-Alanine metabolism	28	-0.752008847	0.002247191	suppressed
hsa00280	Valine, leucine and isoleucine degradation	42	-0.777282314	0.002352941	suppressed
hsa00380	Tryptophan metabolism	34	-0.790509707	0.002331002	suppressed
hsa00650	Butanoate metabolism	23	-0.841822462	0.002272727	suppressed

Hub-genes analysis

As shown in Figure 3, GO terms were closely linked with one another through multiple hub genes. Only the top two genes in GO terms (up-regulated and down regulated) and |log(FC)|>2, were set as the screening criteria. Thus, DEGs, including Matrix MetalloPeptidase 7 (MMP7), TeNascin C (TNC), LUMican (LUM), LacTotransFerrin (LTF), ImmunoGlobulin Lambda Constant 1 (IGLC1), lysozyme (LYZ), Chemokine (C-X-C motif) ligand 6 (CXCL6), CYtochrome P450 family 27 subfamily B member 1 (CYP27B1), and FOS, were selected for further analysis. Only CYP27B1 and FOS were down-regulated. The functions and roles of ALBumin (ALB) and Complement component 3 (C3) had been clarified; thus, ALB and C3 were not included in the subsequent analysis despite their significant enrichment and large fold change. By reviewing the latest literature, we found that MMP7, LUM, CXCL6, CYP27B1, and FOS were associated with DN. However, TNC, LTF, IGLC1, and LYZ were not reported. This result proved that our analysis results were credible and could provide new research directions. The detailed functions of these hub-genes are listed in Table 4. All of the hub-genes were subjected to subsequent in vitro validation experiments. TNC, one of the hub genes, is novel for DN and encodes an ECM protein with a spatially and temporally restricted tissue distribution. The result of GO analysis showed that ECM was closely related to DN. Thus, TNC attracted our attention and interest and was subjected to animal verification experiment.

Table 4: Function analysis of hub genes.
gene	function	relationship with DN	reference
MMP7	This gene is involved in the breakdown of extracellular matrix in normal physiological processes and disease processes, such as arthritis and metastasis.	may play a role in fibronectin accumulation in the diabetic kidney [32]	RefSeq, Jan 2016 10.1080/0886022X.2016.1257433
TNC	It is implicated in guidance of migrating neurons as well as axons during development, synaptic plasticity, and neuronal regeneration.	no associational report	RefSeq, Jul 2011
LUM	The protein encoded moiety binds collagen fibrils and the highly charged hydrophilic glycosaminoglycans regulate interfibrillar spacings.	may involve in diabetic tubulointerstitial injury [33].	RefSeq, Jul 2008; 10.1002/jcp.28313
LTF	Antimicrobial, antiviral, antifungal and antiparasitic activity has been found for this protein and its peptides encoded by this gene.	no associational report	RefSeq, Sep 2014
IGLC1	Secreted immunoglobulins mediate the effector phase of humoral immunity	no associational report	RefSeq, May 2015
LYZ	The protein encoded by this gene has antibacterial activity against a number of bacterial species.	no associational report	RefSeq, Oct 2014
CXCL6	In addition to its chemotactic and angiogenic properties, it has strong antibacterial activity against Gram-positive and Gram-negative bacteria.	Promotes renal interstitial fibrosis in DN by Activating JAK/STAT3 signaling pathway [34].	RefSeq, Nov 2019; 10.3389/fphar.2019.00224
CYP27B1	The cytochrome P450 proteins are involved in drug metabolism and synthesis of cholesterol, steroids and other lipids, and binds to the vitamin D receptor and regulates calcium metabolism.	May provide protection from DN by altering vitamin D metabolism in type II diabetic mouse glomeruli [35].	RefSeq, Jul 2008; 10.1038/sj.ki.5001624
FOS	The encoded protein can dimerize with proteins of the JUN family, thereby forming the transcription factor complex AP-1.the FOS proteins have been implicated as regulators of cell proliferation, differentiation, and transformation.	May be potential therapeutic targets for mesangial cells associated with DN [36].	RefSeq, Jul 2008; 10.3892/etm.2019.7524

qRT-PCR

We validated the nine hub genes expression levels in the HK2 cells in vitro. qRT-PCR indicated that the mRNA levels of TNC, LUM, LTF, IGLC1, LYZ, and CYP27B1 were up-regulated after being stimulated by high glucose. Simultaneously, FOS was down-regulated, but its change had no statistical difference. In addition, the change in MMP7 and CXCL6 were not substantial. Changes in all of the other gene expression levels except those of CYP27B1, MMP7, and CXCL6 were consistent with the bioinformatics analysis results, and the accordance rate was approximately 66.7%. The expression levels represented by 2-∆CT of all the genes and the time are shown in Figure 4. Moreover, the result demonstrated that the expression levels of these genes stimulated by high glucose were time dependent. The expression levels of CYP27B1 and TNC were increased in all time points and had the maximum expression values at 48 h. The expression levels of IGLC1 and LUM were increased only at 48 and 72 h and were inhibited at 24 h. Furthermore, LYZ had increased expression at 48 h only.

Immunohistochemistry

We obtained 20 photos of satisfactory quality. Figure 5a shows the average integrated optical density at each time point, which represents the expression of the TNC gene. The rats were established as a DN model. The expression of TNC in their kidney tissues gradually increased with time, and the difference was statistically significant compared with the 0-week group. Certain representative images at each time point are shown in Figure 5b. The result of the animal validation experiment showed that the expression of TNC, which is involved in the final formation of renal fibrosis gradually increased with the duration of DN increases. In addition, the expression levels of TNC were consistent with the results of the bioinformatics analysis and in vitro validation experiment.

Discussion

It is currently known that glomerular endothelial cells, glomerular mesangial cells, podocytes and renal tubular epithelial cells of the kidney are the targets of hyperglycemia injury. Glomerular sclerosis, thickening of the glomerular basement membrane, glomerular hypertrophy, mesangial cell expansion, podocyte loss, renal cell hypertrophy and tubular interstitial fibrosis are the main pathological changes that occur during DN [37], and which eventually leads to progressive albuminuria. Because the pathogenesis of DN is extremely complicated, it has not yet been elucidated. Early studies found that advanced glycosylation end products, polyol pathway activation, protein kinase activity, abnormal lipid metabolism and hemodynamic changes are related to the occurrence and development of DN. However, these scattered studies cannot provide an overall impression of the pathogenesis of DN, and the biomarkers for early diagnosis and effective treatment are still lacking nowadays. At the same time, there are also a large number of pathogenic genes and potential treatment targets have not been found. Therefore, the pathogenesis of diabetic nephropathy needs to be further studied.

GEO is a public dataset that contains a large amount of array- and sequence-based data. Each raw data can provide scientific and large-scale information for the comprehensive implementation of bioinformatics analysis. Bioinformatics analysis is a powerful methodology, that is objectively, advanced, and globally. And the results of it are reliable. Our study is the first to research the tubule cells of DN through bioinformatics analysis. By obtaining each sample from DN individually, changes in disease-causing genes were directly determined. In addition, we increased the sample size, acquired and integrated all relevant data, and found a large number of important DEGs. This study provides useful resources for in-depth research. Furthermore, the results of our bioinformatics analysis have been validated in vivo and in vitro using appropriate and scientific methods, making our findings credible and reproducible.

In order to further analyze the function of DEGs and explore the molecular mechanism of DN, we performed the functional enrichment analysis of GO and KEGG based on DEGs. A total of 286 GO terms were found to be significantly enriched, and “Extracellular structure organization” is the most significant GO term filtered by enrichment analysis. Several GO terms related to ECM, such as “ECM organization,” “ECM,” “proteinaceous EMC,” and “ECM component,” were also enriched significantly. This result indicated that ECM plays an important role in the occurrence and development of DN. A published study also showed that the excessive synthesis and decreased breakdown of the ECM lead to glomerulosclerosis and tubulointerstitial fibrosis, which further lead to the development of ESRD [38]. However, the mechanism between ECM and glomerulosclerosis is extremely intricate. Meng X.M, et al. [39] proposed that TGF-β is the master regulator of fibrosis. Moreover, the MMP family and TIMP1 are also associated with fibrosis [13]. This finding further confirms that our study is consistent with previous research.

What`s more, several other GO terms related to immune response and endopeptidase were also enriched significantly, such as “humoral immune response”, “acute inflammatory response”, “response to interferon-gamma” and “negative regulation of endopeptidase activity”. Early studies have already made it clear that immune inflammation is the most important factor that promotes podocyte damage and albuminuria in DN, and IL-6, IL-8 and TNF-a are all significantly increased in DN [40]. After IL-6 binds to IL-6R, it binds to the gp130 signal transduction chain, which can lead to autophosphorylation and changes in conformational structure of Janus kinase, thereby inducing intracellular signals and activating transcription factors, signal transducers and transcription activators -3, and eventually lead to the growth and proliferation of mesangial cells [41]. In addition, more and more evidences also indicate that the complement system has a pathogenic role in the development of diabetic nephropathy, and it is believed that inhibiting specific components of the complement system may be an effective treatment strategy for the treatment of diabetic nephropathy [42]. These studies have further confirmed that our research is consistent with previous studies.

As the result of enrichment analysis of KEGG pathways, “citrate cycle”, “butanoate metabolism” and “tryptophan metabolism” and several other significantly enriched KEGG are related to the metabolism of glucose, fat and amino acid, and these pathways are inhibited. Mitochondria are the place of energy metabolism and mitochondrial dysfunction can lead to an increase in reactive oxygen species and aggravate the damage of oxidative stress to vascular endothelial cells and podocytes [43]. Manjula Darshi, et al. through the study of DN metabolomics also found that mitochondrial function and fatty acid oxidation play a key role in the occurrence and development of DN [44]. Therefore, we infer that the inhibition of energy metabolism may be involved in the occurrence and development of DN, but the more specific mechanisms need to be further studied, and it also provides us with new research directions.

Through our previous data analysis, a number of key genes were screened out and verified in HK2 cells stimulated by high glucose. Among them, TNC, LTF, IGLC1 and LYZ are currently related and there is no literature report whether they are related to DN. LTF, encoding lactoferrin, has the regulation of iron absorption, the regulation of immune response, as well as antimicrobial, antiviral, antifungal and antiparasitic activities [45]. In this study, the results of bioinformatics analysis and in vitro experiment have shown that the high expression of LTF in the renal tubule tissue of patients with diabetic nephropathy may be involved in the damage to the kidney along with other inflammation and immune-related pathways. IGLC1, which encodes the constant region of the immunoglobulin light chain, and participates in the effect phase of humoral immunity. In our verification experiment, we see that IGLC1 is significantly highly expressed at 48 hours and 72 hours, and on the contrary is low at 24 hours. As we all know, blood system diseases such as multiple myeloma can cause renal function decline due to the deposition of monoclonal globulin or light chain in the kidney. In DN, is it possible that the high expression of IGLC1 increases the load of light chain on the kidney and aggravate kidney damage? This aspect has once again provided us with new research directions. LYZ, which encodes human lysozyme, has antibacterial activity in many kinds of bacteria, but its function lacks specificity, and there are few related researches. Although there is no clear evidence to proves that these genes are related to DN, they are all related to immunity, which plays a key role in the pathogenesis of DN, and is expected to become a new research direction.

TNC is a homohexamer that contains multiple epidermal growth factor-like and fibronectin type-III domains linked by disulfides and can present several molecular forms through alternative splicing and protein modifications. This multimodal structure allows tenascins to interact with a large number of highly diverse ligands (provided by RefSeq, July 2011). Another notable feature of TNC is its tightly controlled pattern of gene expression. Few TNCs are found in adults except in stem cell niches, tendons, inflammation and trauma sites, and the stroma of solid tumors [46,47]. In addition, TNC has an exceedingly wide range of functions, and the adhesion-regulating ECM protein is probably well-known because of its ability to link to fibronectin and change cell proliferation and signaling mediated through fibronectin and integrins [48]. Finally, TNC is associated with certain diseases and has the opportunity to be used in diagnosis and treatment. For example, the detection of high levels of TNC in body fluids may have a diagnostic value for diseases, including heart disease, rheumatoid arthritis, and cancer [49,50]. Moreover, TNC may be a good target for adjuvant chemotherapy in breast cancer [51]. In our study, the gene expression analysis of human renal tubule cells in the in vitro and vivo experiments indicated that TNC was highly expressed in the DN group. Previous research showed that a TNC-enriched microenvironment is associated with kidney fibro genesis [52]. Therefore, TNC is involved in the pathogenesis of DN, and its possible pathogenic mechanism increases the synthesis of the ECM by adhesion-regulating ECM protein and leads to glomerulosclerosis and tubulointerstitial fibrosis.

However, the samples that we have analyzed are from patients with DN at different stages because of the limited samples, which may not reflect the progression of the disease. This limitation may be the deficiency of this study, but our validation experiment supplements and enhances the credibility of the analysis results.

Conclusions

This study is the first to use bioinformatics analysis for identifying several novel key genes in tubule cells in DN. TNC, a novel gene of DN, plays an important role in the renal fibrosis of DN, and is expected to be a new therapeutic target. The results provide a theoretical basis and data resources for further research. This study also provides clues for finding new biomarkers and therapeuti targets.

Acknowledgement

Supported by the Foundation of Science and Technology Department of Sichuan Province (grant number 2018SZ0378 and 2018SZ0175).

References

Copyright

© 2020 Zhang L, et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

View Similar Articles

Share your thoughts and experiences

Article Alerts

Subscribe to our articles alerts and stay tuned.

This work is licensed under a Creative Commons Attribution 4.0 International License.

Quick Enquiry

Table 1: Primer sequences used for PCR.
Genes	forward	reverse
β-actin	5′-GAAGATCAAGATCATTGCTCC-3′	5′-TACTCCTGCTTGCTGATCCA-3′
MMP7	5′-GGGAATCAGCCTAGCAAACTGGATGT-3′	5′-GCTCTAATGCCTTGTGGATCACGGAC-3′
TNC	5′-TGAACGCCACCCTGCCAGAAGAGAA-3′	5′-GGCTGACTCCAGATCCACCGAACACT-3′
LUM	5′-TCCGTCCTGACAGAGTTCACAGCATA-3′	5′-AGCTCATCACAGTACATGGCACTTGG-3′
LTF	5′-GTGCCTGGCTGAGAATGCTGGAGAC-3′	5′-GGCGTTCCACCTTCTCCATCCGAGAC-3′
IGCL1	5′-ATGAAGGGAGCACCGTGGAGAAGACA-3′	5′-GGGAGAAGGGCTGGATGACTTGGGAT-3′
LYZ	5′-GGGAATCAGCCTAGCAAACTGGATGT-3′	5′-GCTCTAATGCCTTGTGGATCACGGAC-3′
CXCL6	5′-GCTGACAGAGCTGCGTTGCACTTG -3′	5′-CCGTTCTTCAGGGAGGCTACCACTTC-3′
CYP27B1	5′-AGTCCAGACAGCACTCCACTCAGAGA-3′	5′-TTAGCACTTCCTTGACCACCGCCTTC-3′
FOS	5′-AGACCGAGATTGCCAACCTGCTGAAG-3′	5′-GCTCCATGCTGCTGATGCTCTTGACA-3′

Global Journal of Obesity, Diabetes and Metabolic Syndrome

Novel key gene tenascin C related to extracellular matrix accumulation in diabetic nephropathy kidney tubules: Results of integrative bioinformatics analysis