progenetix
progenetix.blogspot.com is the News and History site for www.progenetix.net.
Tuesday, July 07, 2009
Article access
For all people with database registration, access to all the PDFs (>1500) has been enabled. Authentication is required, and this is considered as personal information sharing among collaborators ...
Sunday, July 05, 2009
GEO data sets
We have started to re-evaluate public aCGH data sets, available as raw or Log2 data from NCBI's GEO. Since the data there exists in a heterogeneous (MIAME-compatible) mess of data formats, GEO data will be made accessible either as locus-mapped (normalized) probe data, or as segmentation output files. For a start, we have posted some data sets from Affymetrix genome array experiments, for which the raw data was re-analyzed using the aroma.affymetrix package with custom plotting routines. Please browse the aCGH publications for the corresponding icon.Friday, March 20, 2009
Progenetix@GoogleGroups
I have created a group site at GoogleGroups:
http://groups.google.com/group/progenetix/
Comments, site updates etc. will now be discussed there; please subscribe!
And the current tally is:
http://groups.google.com/group/progenetix/
Comments, site updates etc. will now be discussed there; please subscribe!
And the current tally is:
- 21256 cases from 777 publications
- 17409 CGH
- 2674 array CGH
Wednesday, January 28, 2009
Mitelman database publication update
As reference for tumor karyotype analyses using traditional cytogenetic techniques, the literature database has been updated with the current content of the Mitelman database. The listing references now to 3207 publications from that source, representing 58197 cases. The vast majority of these cases are not included in the Progenetix database, but can be visualized through the article links.
Change in ProgenetiXML format
For more flexible analysis options, the ProgenetiXML format now has separate fields for the Golden Path mapped status information for CGH, aCGH and banding data.
Tuesday, October 28, 2008
> 20,000 individual oncogenomic profiles
By incorporation of some large aCGH studies, today the Progenetix database jumped the 20k mark and contains now 20122 cases from 682 publications:
Again, above numbers include some overlaps.
- 17154 of 31614 found CGH experiments
- 1926 of 10814 found aCGH experiments
- 1787 metaphase banding data sets
Again, above numbers include some overlaps.
Friday, September 19, 2008
19013 cases ...
The database has reached >19000 cases today:
Also, the website tools (e.g. aCGH to ProgenetiXML) have seen some overhaul, making them more robust and introducing new features (high-resolution ideograms, segments file input etc.).
Enjoy!
- 16542 CGH
- 1503 aCGH
- 1757 banding/MFISH/SKY (mostly cell lines)
Also, the website tools (e.g. aCGH to ProgenetiXML) have seen some overhaul, making them more robust and introducing new features (high-resolution ideograms, segments file input etc.).
Enjoy!
Wednesday, June 25, 2008
Improved search options
Search options have been improved, with free text search now used in "Search Entities, Loci and Analysis Groups". As example:
lymphoma, NHL, lymph node, leukemia, hemato
will return all ICD entities, ICD loci and analysis groups with one of the comma-separated values in the title. From here, you can display each single subset by clicking on the ideogram button, or select several of the subsets for further filtering & analysis (registered users only).
lymphoma, NHL, lymph node, leukemia, hemato
will return all ICD entities, ICD loci and analysis groups with one of the comma-separated values in the title. From here, you can display each single subset by clicking on the ideogram button, or select several of the subsets for further filtering & analysis (registered users only).
Monday, May 19, 2008
Jumping 18k
Thanks to some data submissions, and notably today's articles from Itziar Salaverria Frigola (lab of Elias Campo), Progenetix has finally jumped the 18000 cases mark. The lack of ost of the array CGH data is still somewhat puzzling; either case specific data is not shown at all, or "raw data" is dumped in GEO | ArrayExpress | SMD - where nobody but groups with dedicated bioinformaticians will get it out. Probably, I will have to work on that ...
Enjoy the data, especially the expanded new B-NHL content!
Enjoy the data, especially the expanded new B-NHL content!
Friday, May 02, 2008
Search update
The future is here (according to last post): For registered users, the pre selected publications can now be used to further limit the selection & analyze the cases ...
Feedback, please.
Feedback, please.
Tuesday, April 29, 2008
Search implementation
Instead of fixed listings for publications containing CGH/aCGH/banding analyses, a search engine for articles has been implemented. Options are:
Please comment on the features. Future extensions probably will include specific case search based on the pre-selected publications ...
- Autor names
- words from the title
- techniques
- array parameters for aCGH
Please comment on the features. Future extensions probably will include specific case search based on the pre-selected publications ...
Friday, April 25, 2008
Minor graphics changes
The imbalance histograms have been scaled to increase readability. Also, some broken links have been fixed.
Tuesday, April 22, 2008
Speed
The active database mining has been sped up (by introducing an indexing system). For registered users, when using:
Tools & Source => Source data selection, download and analysis => Use Progenetix Database
the options for data selection will load almost instantaneously.
Enjoy!
Tools & Source => Source data selection, download and analysis => Use Progenetix Database
the options for data selection will load almost instantaneously.
Enjoy!
Wednesday, April 02, 2008
ProgenetiXML format enhancement: Golden Path
I have updated the ProgenetiXML format. The main improvement is the addition of a Golden Path aberration annotation:
<GPANNOTATION>chr7:0-158628139:1::chr8:0-146274826:1::chr10:70300000-135413628:-1::chr12:0-132449811:1::chr13:16000000-114142980:-1</GPANNOTATION>
The GP-annotation represents a transcription of the chromosomal band status. For data derived from aCGH experiments, this may lead to an expansion of imbalances, since originally every band containing a full or partial gain/loss is considered as imbalanced. This may be changed in the future, with direct usage of the original GP array data, where available.
Additional changes:
Enjoy,
Michael.
<GPANNOTATION>chr7:0-158628139:1::chr8:0-146274826:1::chr10:70300000-135413628:-1::chr12:0-132449811:1::chr13:16000000-114142980:-1</GPANNOTATION>
The GP-annotation represents a transcription of the chromosomal band status. For data derived from aCGH experiments, this may lead to an expansion of imbalances, since originally every band containing a full or partial gain/loss is considered as imbalanced. This may be changed in the future, with direct usage of the original GP array data, where available.
Additional changes:
- removal of extraneous XML tags, for the cases lacking data in those entries
- addition of a full data matrix (862 bands status and all annotations) as download option prior to visualization
Enjoy,
Michael.
Wednesday, February 13, 2008
improved color scheme
since pure red-green works bad for some people, the histo and ideogram colors have been slightly changed (yellowish red, blueish green). i am happy about further suggestions!
new server
sometime late january, the site has been moved to a new server (Mac Pro, Dual 2.8GHz, 8GB RAM). this should make the site more user-friendly ...
but: do not use deep links or bookmarks - stick to www.progenetix.net!
and, please, do report problems.
but: do not use deep links or bookmarks - stick to www.progenetix.net!
and, please, do report problems.
Thursday, December 20, 2007
ISCN parser modifications
Thanks to a review of selected cases by Turid Knutsen, a number of changes have been implemented in the karyotype parser portion of the ISCN2matrix engine. This concerns mostly the handling of ambiguous annotations, but also some previous omissions of specific patterns.
The changes are active in the online parser, and will be propagated to the data set soon.
The changes are active in the online parser, and will be propagated to the data set soon.
Wednesday, December 12, 2007
data review and site overhaul
The site has had a layout overhaul (the changes in code are more than skin deep, though). On the data side, the emphasis is mostly on review and adding clinical information. As a ballpark estimate, gender and age information should be available for about half the cases - but this may take some months. Even followup / survival data will become accessible for a proportion of cases. Of course, this data is dependent on the published results, and / or the willingness of the authors to dig deeply into their old lab records ...
On the minus side, the case number just dropped slightly, to now 16998. This was due to one doubled publication entry, and one pure SKY data set that was removed (still accessible through NCBI).
As general policy, metaphase banding (incl. MFISH/SKY data) is only presented if it had been directly added to Progenetix (e.g. the DSMZ cell lines, or as part of some specific data review project). Otherwise, only cases with composite analysis (e.g. CGH and banding) are kept for now.
On the minus side, the case number just dropped slightly, to now 16998. This was due to one doubled publication entry, and one pure SKY data set that was removed (still accessible through NCBI).
As general policy, metaphase banding (incl. MFISH/SKY data) is only presented if it had been directly added to Progenetix (e.g. the DSMZ cell lines, or as part of some specific data review project). Otherwise, only cases with composite analysis (e.g. CGH and banding) are kept for now.
Monday, November 19, 2007
Bug fix
The latest editions had a bug, where for the publication pages the histograms and numeric values were showing "0" aberrations, for every band. Ideograms were not affected. This has been fixed on 20071119.
Thursday, November 15, 2007
Layout changes and data update
The site underwent some layout and option changes. Most interestingly, the publication information now links to the Mitelman database if the publication is included there. This is mostly the case for the articles listed in the "bandingtracker", and indicated by the
icon. The data itself is not stored on the Progenetix server, but rather accessed from the CGAP server.
icon. The data itself is not stored on the Progenetix server, but rather accessed from the CGAP server.Current Progenetix data content:
- 17023 experiments from 667 publications
- CGH: 15163 experiments, 571 articles
- array CGH: 827 experiments, 30 articles
- banding/SKY: 1017 experiments, 72 articles

