<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
		>
<channel>
	<title>Comments on: Gene Ontology analysis with Python and Bioconductor</title>
	<atom:link href="http://bcbio.wordpress.com/2009/10/18/gene-ontology-analysis-with-python-and-bioconductor/feed/" rel="self" type="application/rss+xml" />
	<link>http://bcbio.wordpress.com/2009/10/18/gene-ontology-analysis-with-python-and-bioconductor/</link>
	<description></description>
	<lastBuildDate>Tue, 04 Jun 2013 12:18:00 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
	<item>
		<title>By: Assorted Links of Bioinformatics &#171; Abner&#8217;s Postgraduate Days</title>
		<link>http://bcbio.wordpress.com/2009/10/18/gene-ontology-analysis-with-python-and-bioconductor/#comment-543</link>
		<dc:creator><![CDATA[Assorted Links of Bioinformatics &#171; Abner&#8217;s Postgraduate Days]]></dc:creator>
		<pubDate>Sat, 24 Mar 2012 16:11:58 +0000</pubDate>
		<guid isPermaLink="false">http://bcbio.wordpress.com/2009/10/18/gene-ontology-analysis-with-python-and-bioconductor/#comment-543</guid>
		<description><![CDATA[[...] Gene Ontology analysis with Python and Bioconductor « Blue Collar Bioinformatics [...]]]></description>
		<content:encoded><![CDATA[<p>[...] Gene Ontology analysis with Python and Bioconductor « Blue Collar Bioinformatics [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Zhidong</title>
		<link>http://bcbio.wordpress.com/2009/10/18/gene-ontology-analysis-with-python-and-bioconductor/#comment-190</link>
		<dc:creator><![CDATA[Zhidong]]></dc:creator>
		<pubDate>Wed, 04 Aug 2010 14:29:38 +0000</pubDate>
		<guid isPermaLink="false">http://bcbio.wordpress.com/2009/10/18/gene-ontology-analysis-with-python-and-bioconductor/#comment-190</guid>
		<description><![CDATA[Thank you, Brad.]]></description>
		<content:encoded><![CDATA[<p>Thank you, Brad.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Brad Chapman</title>
		<link>http://bcbio.wordpress.com/2009/10/18/gene-ontology-analysis-with-python-and-bioconductor/#comment-189</link>
		<dc:creator><![CDATA[Brad Chapman]]></dc:creator>
		<pubDate>Wed, 04 Aug 2010 11:03:09 +0000</pubDate>
		<guid isPermaLink="false">http://bcbio.wordpress.com/2009/10/18/gene-ontology-analysis-with-python-and-bioconductor/#comment-189</guid>
		<description><![CDATA[Zhidong;
The purpose of the p-value is to select the enriched genes from the background. If you have some other method of identifying the enriched genes, assign them proxy p-values that differentiate them from background. The topDiffGenes function should then be modified to select out the targets you are interested in, and the rest of the analysis can proceed as normal. Hope this helps.]]></description>
		<content:encoded><![CDATA[<p>Zhidong;<br />
The purpose of the p-value is to select the enriched genes from the background. If you have some other method of identifying the enriched genes, assign them proxy p-values that differentiate them from background. The topDiffGenes function should then be modified to select out the targets you are interested in, and the rest of the analysis can proceed as normal. Hope this helps.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Zhidong</title>
		<link>http://bcbio.wordpress.com/2009/10/18/gene-ontology-analysis-with-python-and-bioconductor/#comment-188</link>
		<dc:creator><![CDATA[Zhidong]]></dc:creator>
		<pubDate>Wed, 04 Aug 2010 00:55:17 +0000</pubDate>
		<guid isPermaLink="false">http://bcbio.wordpress.com/2009/10/18/gene-ontology-analysis-with-python-and-bioconductor/#comment-188</guid>
		<description><![CDATA[Dear Brad,
If I have the predicted target genes of miRNAs with no P values of difference expression, how could I do the functional enrichment analysis of these target genes?]]></description>
		<content:encoded><![CDATA[<p>Dear Brad,<br />
If I have the predicted target genes of miRNAs with no P values of difference expression, how could I do the functional enrichment analysis of these target genes?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Robert</title>
		<link>http://bcbio.wordpress.com/2009/10/18/gene-ontology-analysis-with-python-and-bioconductor/#comment-187</link>
		<dc:creator><![CDATA[Robert]]></dc:creator>
		<pubDate>Thu, 29 Jul 2010 18:01:44 +0000</pubDate>
		<guid isPermaLink="false">http://bcbio.wordpress.com/2009/10/18/gene-ontology-analysis-with-python-and-bioconductor/#comment-187</guid>
		<description><![CDATA[Hi Brad,

Nice example of GO analysis. However, you can save yourself a bit of work at the end. In each organism specific database in Bioconductor, there is a wrapper to return the mapping of all genes (by Entrez ID) to that particular GO term, thereby saving you the work of traversing the tree.

For example, with the &quot;org.Rn.eg.db&quot; database, you can use: org.Rn.egGO2ALLEGS to return all genes annotated to your terms of interest.]]></description>
		<content:encoded><![CDATA[<p>Hi Brad,</p>
<p>Nice example of GO analysis. However, you can save yourself a bit of work at the end. In each organism specific database in Bioconductor, there is a wrapper to return the mapping of all genes (by Entrez ID) to that particular GO term, thereby saving you the work of traversing the tree.</p>
<p>For example, with the &#8220;org.Rn.eg.db&#8221; database, you can use: org.Rn.egGO2ALLEGS to return all genes annotated to your terms of interest.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Zhidong</title>
		<link>http://bcbio.wordpress.com/2009/10/18/gene-ontology-analysis-with-python-and-bioconductor/#comment-186</link>
		<dc:creator><![CDATA[Zhidong]]></dc:creator>
		<pubDate>Fri, 16 Jul 2010 14:34:45 +0000</pubDate>
		<guid isPermaLink="false">http://bcbio.wordpress.com/2009/10/18/gene-ontology-analysis-with-python-and-bioconductor/#comment-186</guid>
		<description><![CDATA[Hi Brad,

That is a naive problem for I know little about python and now I have run it successfully following your reply. Thank you very much.]]></description>
		<content:encoded><![CDATA[<p>Hi Brad,</p>
<p>That is a naive problem for I know little about python and now I have run it successfully following your reply. Thank you very much.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Brad Chapman</title>
		<link>http://bcbio.wordpress.com/2009/10/18/gene-ontology-analysis-with-python-and-bioconductor/#comment-185</link>
		<dc:creator><![CDATA[Brad Chapman]]></dc:creator>
		<pubDate>Thu, 15 Jul 2010 15:20:27 +0000</pubDate>
		<guid isPermaLink="false">http://bcbio.wordpress.com/2009/10/18/gene-ontology-analysis-with-python-and-bioconductor/#comment-185</guid>
		<description><![CDATA[Zhidong;
You&#039;re getting this message because none of the terms in your GO mapping file (ATH_GO_GOSLIM.txt) match the example file (diffexp_example-diffs.csv). You&#039;ll need to supply a real list of terms for the first argument and be sure the names match what is in your GO file. You&#039;ll also need to modify the parse_go_map_file function so you can pull out the Arabidopsis gene names (1st column) and GO terms (6th column).

I updated the code so the error message in this case is more useful and to make it work on the latest rpy2 version. Hope this helps get things working for you.]]></description>
		<content:encoded><![CDATA[<p>Zhidong;<br />
You&#8217;re getting this message because none of the terms in your GO mapping file (ATH_GO_GOSLIM.txt) match the example file (diffexp_example-diffs.csv). You&#8217;ll need to supply a real list of terms for the first argument and be sure the names match what is in your GO file. You&#8217;ll also need to modify the parse_go_map_file function so you can pull out the Arabidopsis gene names (1st column) and GO terms (6th column).</p>
<p>I updated the code so the error message in this case is more useful and to make it work on the latest rpy2 version. Hope this helps get things working for you.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Zhidong</title>
		<link>http://bcbio.wordpress.com/2009/10/18/gene-ontology-analysis-with-python-and-bioconductor/#comment-184</link>
		<dc:creator><![CDATA[Zhidong]]></dc:creator>
		<pubDate>Thu, 15 Jul 2010 09:20:30 +0000</pubDate>
		<guid isPermaLink="false">http://bcbio.wordpress.com/2009/10/18/gene-ontology-analysis-with-python-and-bioconductor/#comment-184</guid>
		<description><![CDATA[Dear Brad,

Thank you for your great post.

I have try your script and examples but failed(Please see it as following). Could you tell me how to fix it? Thank you in advance.

$python diffexp_go_analysis.py diffexp_example-diffs.csv ATH_GO_GOSLIM.txt 

Loading required package: graph
Loading required package: Biobase

Welcome to Bioconductor

  Vignettes contain introductory material. To view, type
  &#039;openVignette()&#039;. To cite Bioconductor, see
  &#039;citation(&quot;Biobase&quot;)&#039; and for packages &#039;citation(pkgname)&#039;.

Loading required package: GO.db
Loading required package: AnnotationDbi
Loading required package: DBI
Loading required package: SparseM
Package SparseM (0.85) loaded.
           To cite, see citation(&quot;SparseM&quot;)


Attaching package: &#039;SparseM&#039;

The following object(s) are masked from &#039;package:base&#039;:

    backsolve


groupGOTerms:   GOBPTerm, GOMFTerm, GOCCTerm environments built.

Building most specific GOs .....Error in split.default(geneID[goodGO], allGO[goodGO]) : 
  first argument must be a vector
In addition: Warning message:
In is.na(gene2GO) : is.na() applied to non-(list or vector) of type &#039;NULL&#039;
Traceback (most recent call last):
  File &quot;diffexp_go_analysis.py&quot;, line 130, in 
    main(sys.argv[1], sys.argv[2])
  File &quot;diffexp_go_analysis.py&quot;, line 25, in main
    gene_pval, go_pval, topgo_method)
  File &quot;diffexp_go_analysis.py&quot;, line 81, in run_topGO
    go_data = robjects.r.new(&quot;topGOdata&quot;, **params)
  File &quot;/usr/lib64/python2.6/site-packages/rpy2/robjects/__init__.py&quot;, line 423, in __call__
    res = super(RFunction, self).__call__(*new_args, **new_kwargs)
rinterface.RRuntimeError: Error in split.default(geneID[goodGO], allGO[goodGO]) : 
  first argument must be a vector]]></description>
		<content:encoded><![CDATA[<p>Dear Brad,</p>
<p>Thank you for your great post.</p>
<p>I have try your script and examples but failed(Please see it as following). Could you tell me how to fix it? Thank you in advance.</p>
<p>$python diffexp_go_analysis.py diffexp_example-diffs.csv ATH_GO_GOSLIM.txt </p>
<p>Loading required package: graph<br />
Loading required package: Biobase</p>
<p>Welcome to Bioconductor</p>
<p>  Vignettes contain introductory material. To view, type<br />
  &#8216;openVignette()&#8217;. To cite Bioconductor, see<br />
  &#8216;citation(&#8220;Biobase&#8221;)&#8217; and for packages &#8216;citation(pkgname)&#8217;.</p>
<p>Loading required package: GO.db<br />
Loading required package: AnnotationDbi<br />
Loading required package: DBI<br />
Loading required package: SparseM<br />
Package SparseM (0.85) loaded.<br />
           To cite, see citation(&#8220;SparseM&#8221;)</p>
<p>Attaching package: &#8216;SparseM&#8217;</p>
<p>The following object(s) are masked from &#8216;package:base&#8217;:</p>
<p>    backsolve</p>
<p>groupGOTerms:   GOBPTerm, GOMFTerm, GOCCTerm environments built.</p>
<p>Building most specific GOs &#8230;..Error in split.default(geneID[goodGO], allGO[goodGO]) :<br />
  first argument must be a vector<br />
In addition: Warning message:<br />
In is.na(gene2GO) : is.na() applied to non-(list or vector) of type &#8216;NULL&#8217;<br />
Traceback (most recent call last):<br />
  File &#8220;diffexp_go_analysis.py&#8221;, line 130, in<br />
    main(sys.argv[1], sys.argv[2])<br />
  File &#8220;diffexp_go_analysis.py&#8221;, line 25, in main<br />
    gene_pval, go_pval, topgo_method)<br />
  File &#8220;diffexp_go_analysis.py&#8221;, line 81, in run_topGO<br />
    go_data = robjects.r.new(&#8220;topGOdata&#8221;, **params)<br />
  File &#8220;/usr/lib64/python2.6/site-packages/rpy2/robjects/__init__.py&#8221;, line 423, in __call__<br />
    res = super(RFunction, self).__call__(*new_args, **new_kwargs)<br />
rinterface.RRuntimeError: Error in split.default(geneID[goodGO], allGO[goodGO]) :<br />
  first argument must be a vector</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Frederico Arnoldi</title>
		<link>http://bcbio.wordpress.com/2009/10/18/gene-ontology-analysis-with-python-and-bioconductor/#comment-173</link>
		<dc:creator><![CDATA[Frederico Arnoldi]]></dc:creator>
		<pubDate>Tue, 11 May 2010 20:27:40 +0000</pubDate>
		<guid isPermaLink="false">http://bcbio.wordpress.com/2009/10/18/gene-ontology-analysis-with-python-and-bioconductor/#comment-173</guid>
		<description><![CDATA[Dear Brad Chapman,
Python and R is fantastic combination: the pleasantness of Python together with the amazing power of R and it&#039;s packages. I&#039;ve been using this combination for bioinformatics too, but very probably with fewer familiarity than you. Sometimes I ask myself how robust would be implementing some task with Python/Rpy instead of directly in R. I am sure your posts make Python&#039;s reputation stronger. Thanks for sharing your codes.]]></description>
		<content:encoded><![CDATA[<p>Dear Brad Chapman,<br />
Python and R is fantastic combination: the pleasantness of Python together with the amazing power of R and it&#8217;s packages. I&#8217;ve been using this combination for bioinformatics too, but very probably with fewer familiarity than you. Sometimes I ask myself how robust would be implementing some task with Python/Rpy instead of directly in R. I am sure your posts make Python&#8217;s reputation stronger. Thanks for sharing your codes.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
