Content of review 1, reviewed on August 22, 2014

I find this to be an interesting and important data set. The marmoset is of growing importance and the improved annotation of its genome (as well as of other non-human/non-mouse genomes) is sorely needed. This data set is a major step forward not only in this specific instance, but also in the scientific pipeline of constantly improving and reconsidering genomic sequence as it becomes available.

Minor essential revisions:

  1. In the abstract under findings the first sentence needs to be reworded so that it is clear that there are five tissues across two marmosets. It currently reads as if all five tissues were collected from both marmosets.

  2. Regarding the transcriptome assembly section. My reading suggests that this approach would identify marmoset sequences with human orthologs, but that it may miss "marmoset-specific" sequences. Notably I think of proteins that have become pseudogenes in the human lineage (think of the olfactory receptors). I do not think that this is a problem, but it should be made clear what the effects of these choices are and that this data set represents, in particular, a more conservative approach aimed to minimizes false positives.

  3. In the comparison to Ensembl annotations section, it is probably worth mentioning the 4555 genes NOT found. Is it because they aren't real or is it because they are expressed uniquely in tissues not covered in this study?

Discretionary revisions:

  1. I'd be interested to know if there are any chromosomal biases, specifically related to the sex chromosomes.

  2. It may also be interesting to see a table showing if transcripts were unique to one of the tissues or found in multiple tissues.

Level of interest: An article of importance in its field

Quality of written English: Acceptable

Statistical review: No, the manuscript does not need to be seen by a statistician.

Declaration of competing interests: I declare that I have no competing interests

Source

    © 2014 the Reviewer (CC-BY 4.0 - source).

References

    D., M. M., Dongren, R., S., G. J., M., G. R., C., L. A., N., M. E., A., F. J., Jr., N. R. B. 2014. De novo assembly of the common marmoset transcriptome from NextGen mRNA sequences. GigaScience.