Web Analytics
Loading...
Loading...
  1. How can PlacentaCellEnrich be used to calculate cell-specific gene enrichment?
  2. How can PlacentaCellEnrich be used to calculate cell-specific gene enrichment for input gene set from other model organisms?
  3. How expression details of genes in the datasets be obtained?

Steps to calculate cell-specific gene enrichment

  1. Click the "Placenta Cell Gene Enrichment" tab to go to the cell-specific gene enrichment page.



  2. Enter a gene list by either pasting it in the "Input Gene List" text area or by uploading the file (.txt/.csv extension). Each gene ID in the gene list should be in a new line.



  3. Choose the appropriate options provided on the page, for example, Organism (of the input gene set), Gene ID, Dataset, Type of Cell-Specific Genes, and Expression Threshold.

  4. Sample List 1 contains top 1000 genes based on expression from differentiated Extravilious trophpblast cells (EVTs) (Okae et al. 2018). We would expect PlacentaCellEnrich to show cell-specific enrichment of EVTs. We used the Suryawanshi et al. data for this analysis.

  5. Click on the submit button. A bar chart will be generated, showing the enrichment of cell-specific genes in your list. Above the bar chart, you will also find the genes that could not be mapped to the genes in the dataset.



  6. The bar charts are interactive and the user can hover over the bar chart to see the cell type, the number of cell-specific genes in the input data, fold change, and the enrichment scores (-Log10(P-Adjusted)).



  7. By clicking on the bar of the bar chart, the user can visualize the expression of those cell-specific genes in a heatmap. The user can also hover over the heatmap to see details of genes in a particular cell.



  8. There is also an option ("Histogram Plot Options") to plot the bar charts using fold-change values. For that the user has to select the fold-change option and then click on the submit button.



  9. The enrichment results can also be downloaded using the "Enrichment Results" button. It will download both the enrichment results and expression values of cell-specific genes of the selected cell.



Steps to calculate cell-specific enrichment using orthologs in model organisms

  1. Click on the "Placenta Cell Gene Enrichment" tab to go to the cell-specific gene enrichment page.

  2. Enter a mouse gene list by either pasting it in the "Input Gene List" text area or by uploading the file. Each gene ID in the gene list should be in a new line.

  3. Choose the "Mouse" option as the organism and select either of the human datasets as the dataset. Also, appropriately select the other options provided on the page.

  4. Click on the submit button. A bar chart will be generated, showing the enrichment of cell-specific genes in your list. Above the bar chart, you will also find the genes that could not mapped to the genes in the dataset.



Note: The cell-specific gene enrichment uses 1 to 1 orthologs between human and other organims downloaded from the Ensembl Version 100 database.


Steps to obtain gene expression details for cell-specific genes

  1. Click on the "Placenta Cell Specific Genes" tab.

  2. Choose the Organism and Dataset. The user can enter and search the gene name in the "Gene Name" text box (Auto-fill feature).
  3. Click on the submit button. A bar chart will be generated, showing the expression values of the gene across the cells along with its cell specificity information.

  4. The bar chart is interactive and the user can hover over the bar chart to see the details including cell type, expression values, and cell specificity.

  5. Users can also view the details in a table by clicking the "Gene Expression Table" tab.


Help

There are some general FAQs listed below explaining how the tool works, and datasets used for the cell-specific gene enrichment. In case of any other questions please contact geetu@iastate.edu .


  1. What are the different cell-specific gene groups used for cell-specific gene enrichment?
  2. How to cite PlacentaCellEnrich in your research?
  3. What are the different datasets used for cell-specific gene enrichment in PlacentaCellEnrich?
  4. Can we use background genes for carrying out cell-specific gene enrichment?
  5. Why are a lot of genes in the input data not found?
  6. Why is No.of.Cells always "N.A." for Suryawanshi et al. data?
  7. What is in the downloaded files?
  8. How should I interpret my results?
  9. How come the enrichment plot does not show any results?

Defining Cell-Specific Genes

Cell-specific genes are defined using the algorithm from the HPA (Uhlén et al. 2015), and can be grouped as follows:

  1. Not Expressed: Genes with an expression level less than an expression threshold specified by user across all the cells.

  2. Cell Enriched: Genes with an expression level greater than or equal to the expression threshold that also have at least five-fold higher expression levels in a particular cell compared to all other cells.

  3. Group Enriched: Genes with an expression level greater than or equal to the expression threshold that also have at least five-fold higher expression levels in a group of 2-7 cells compared to all other cells, and that are not considered Cell Enriched.

  4. Cell Enhanced: Genes with an expression level greater than or equal to the expression threshold that also have at least five-fold higher expression levels in a particular cell compared to the average levels in all other cells, and that are not considered Cell Enriched or Group Enriched.

  5. Expressed in all: Genes with an expression level greater than or equal to the expression threshold across all of the cell that are not in any of the above 4 groups.

  6. Mixed: Genes that are not assigned to any of the above 5 groups.

Genes from the Cell Enriched , Group Enriched , and Cell Enhanced groups are classified as Cell-specific genes.


Citation

Jain A and Tuteja G (2020). PlacentaCellEnrich: A tool to characterize gene sets using placenta cell-specific gene enrichment analysis. Placenta , doi.org/10.1016/j.placenta.2020.10.029 .

Datasets

PlacentaCellEnrich defines cell-specific genes using single-cell RNA-Seq (scRNA-Seq) data from two placental studies. The datasets used in the tool are:

  1. Suryawanshi H et al.: scRNA-Seq data across 22 placental and decidual cells

  2. Vento-Tormo R et al.: scRNA-Seq data across 32 placental and decidual cells


Background Genes

PlacentaCellEnrich enables users to provide background genes for carrying out cell-specific gene enrichment. In this case, instead of using all the genes in the dataset, a background gene set is being used to carry out the enrichment analysis. It should be noted that the background gene set must have all the genes of the input gene set. The p-value is calculated as:

$$P(X \gt k) = \sum\limits_{i=k+1}^n \frac{{{K_b}\choose{i}} {{N_b-K_b}\choose{n-i}}}{{{N_b}\choose{n}}}$$

Where, \(N_{b}\) is the total number of background genes, \(K_{b}\) is the total number of cell-specific genes for a cell in background genes, n is the number of genes in the input gene set, k is the number of cell-specific genes in the input gene set. The p-values are corrected for multiple hypothesis testing using the Benjamini & Hochberg correction. If the background gene set is not provided all the genes will be used as background.


Why are a lot of genes in the input data not found?

The user should first ensure that they selected the appropriate Gene ID, and organism, and entered the genes one per line. It is also important to note that we are only analyzing protein-coding genes, due to which non-coding genes are not found in the datasets. Additionally, when genes from a non-human species are being used as input, if an orthologous gene is not identified, it will not be found in the datasets.

Why is No.of.Cells always "N.A." for Suryawanshi et al. data?

Suryawanshi et al. study doesn't provide the number of cells of the identified placenta and decidua cell clusters. Due to this, we are showing "N.A." for the number of cells for Suryawanshi et al. data.

Columns in the CellSpecificGeneEnrichment.tsv file

  1. Cell: Cell type.

  2. -Log10PValue: -log10 transformed adjusted P-Value.

  3. #Cells: Total number of cells representing the cell type in the dataset.

  4. Cell-Specific Genes: Total number of cell-specific genes enriched in the input gene set.

  5. Fold-change: Fold-change of the cell-specific genes in the input gene set.

CellSpecificGeneExpression.tsv file

  1. Gene: Cell-specific gene name.

  2. Cell: Cell type.

  3. Log2(TPM): Log2 transformed TPM like expression value.

  4. Group: Cell-specific gene group.


How should I interpret my results?

  1. The recommended threshold value of the adjusted p-values and fold-change for selecting the enriched cells is 0.01 and 2 respectively.

  2. It is also recommended that the users should look at cell-specific gene enrichment from all the sources and have the highest confidence in results that are consistent across datasets.


How come the enrichment plot does not show any results?

If a bar plot is shown with no bars, that indicates that there is no enrichment for any groups of cell-type specific genes. You can try adjusting your gene set size, or parameters in PlacentaCellEnrich (expression threshold, type of cell specific expression, or try including a background list). If you still do not see any enrichment but are interested in groups that your genes belong to, you can use the Fold-change plot option instead of the P-Adjusted plot option, keeping in mind that results are not significant. You can also download details of which group your genes belong to by clicking the "Enrichment Results" button.

Contact Details


Tuteja Lab ( geetu@iastate.edu )

2106 Molecular Biology Building

2437 Pammel Drive

Department of Genetics, Development and Cell Biology

Iowa State University

Ames, IA 50011