<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xml:base="http://sciencex2.org" xmlns:dc="http://purl.org/dc/elements/1.1/">
<channel>
 <title>data mining</title>
 <link>http://sciencex2.org/en/taxonomy/term/575</link>
 <description>The taxonomy view with a depth of 0.</description>
 <language>en</language>
<item>
 <title>Data Mining in the War on Terror:  Useful and Worth the Risk?</title>
 <link>http://sciencex2.org/en/node/52864</link>
 <description>&lt;h3 class=&quot;field-label&quot;&gt;Description&lt;/h3&gt;
&lt;div class=&quot;content&quot;&gt;
   &lt;p&gt;In 2002, the Defense Advanced Research Project Agency established the Information Awareness Office (IAO) to unify projects using data mining technologies to counter threats to national security, in particular terrorism.&amp;nbsp; One of the largest of these efforts initially named the Total Information Awareness (TIA) Program, later changed to the Terrorism Information Awareness Program, was focused on integrating various public and private database.&amp;nbsp; Public awareness and concern over the program was raised by William Safire who wrote that TIA had a $200 million dollar budget to create computer dossiers on 300 million americans.&amp;nbsp; After public outcry over the program Congress moved in 2004 to prohibit further funding of TIA.[1]&lt;/p&gt;
&lt;p&gt;Deeper analysis of TIA shows that it was never completely defunded and perhaps most troubling various technology components of the effort continue to be developed under the less publicly viewable budget of the National Security Agency (NSA). [2] Regardless of if TIA technologies continue to be developed or not it is clear that the United States Government is engaging in data mining on a large scale effort.&amp;nbsp; &lt;/p&gt;
&lt;p&gt;Perhaps most interesting in this regards is a 2004 study conducted by the General Accounting Office that surveyed 128 federal agencies and departments and found that 52 of them were planning to launch nearly 200 new data mining projects.&amp;nbsp; Of the reported projects 122 involved personal information with at least 54 projects focused on integrating data from the private sector.&amp;nbsp; Clear application drivers for these efforts included analyzing intelligence to detect terrorist activities and identifying criminal activities.[3]&lt;/p&gt;
&lt;p&gt;Data mining programs that automatically search across personal and private sector data have been sold by proponents as being a necessary evil in the war against terror, but are they effective?&amp;nbsp; A new study conducted by the National Research Council on behalf of the Department of Homeland Security concludes unequivocally no and argues that not only will the technology not work but the inevitable mistakes made by such an effort are inherently un-American and an unnecessary risk in the war on terror.[4]&amp;nbsp; &lt;/p&gt;
&lt;p&gt;Most interestingly, the report also recommends that &amp;quot;All U.S. agencies with counterterrorism programs that collect or &amp;quot;mine&amp;quot; personal data -- such as phone records or Web sites visited -- should be required to evaluate the programs&#039; effectiveness, lawfulness, and impacts on privacy.&amp;quot;[5]&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;div class=&quot;og_rss_groups&quot;&gt;&lt;ul class=&quot;links&quot;&gt;&lt;li class=&quot;first last og_links&quot;&gt;&lt;a href=&quot;/en/node/13855&quot; class=&quot;og_links&quot;&gt;Computer &amp;amp; Information Science&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;&lt;/div&gt;&lt;/div&gt;

&lt;div class=&quot;field field-type-text field-field-source&quot;&gt;
  &lt;h3 class=&quot;field-label&quot;&gt;Source&lt;/h3&gt;
  &lt;div class=&quot;field-items&quot;&gt;
      &lt;div class=&quot;field-item&quot;&gt;&lt;p&gt;[1]&quot;Information Awareness Office&quot;, Wikipedia, &lt;a href=&quot;http://en.wikipedia.org/wiki/Information_Awareness_Office&quot; title=&quot;http://en.wikipedia.org/wiki/Information_Awareness_Office&quot;&gt;http://en.wikipedia.org/wiki/Information_Awareness_Office&lt;/a&gt;&lt;br /&gt;
[2]&quot;TIA Lives On&quot;, Shane Harris, National Journal, Feb 23, 2006, &lt;a href=&quot;http://www.nationaljournal.com/about/njweekly/stories/2006/0223nj1.htm&quot; title=&quot;http://www.nationaljournal.com/about/njweekly/stories/2006/0223nj1.htm&quot;&gt;http://www.nationaljournal.com/about/njweekly/stories/2006/0223nj1.htm&lt;/a&gt;&lt;br /&gt;
[3]&quot;Data Mining:  Federal Efforts Cover a Wide Range of Uses&quot;, United States General Accounting Office, May 2004, pg10-12&lt;br /&gt;
[4]&quot;Data-Mining for Terrorists Not &#039;Feasible,&#039; DHS-Funded Study Finds&quot;, Ryan Singel, October 7, 2008, WIRED, &lt;a href=&quot;http://blog.wired.com/27bstroke6/2008/10/data-mining-for.html&quot; title=&quot;http://blog.wired.com/27bstroke6/2008/10/data-mining-for.html&quot;&gt;http://blog.wired.com/27bstroke6/2008/10/data-mining-for.html&lt;/a&gt;&lt;br /&gt;
[5]Protecting Individual Privacy in the Struggle Against Terrorists:&lt;br /&gt;
A Framework for Program Assessment,National Research Council, October 7, 2008.&lt;/p&gt;
&lt;/div&gt;
  &lt;/div&gt;
&lt;/div&gt;
</description>
 <comments>http://sciencex2.org/en/node/52864#comments</comments>
 <category domain="http://sciencex2.org/en/taxonomy/term/575">data mining</category>
 <category domain="http://sciencex2.org/en/taxonomy/term/3302">national research council</category>
 <category domain="http://sciencex2.org/en/taxonomy/term/3301">terorrism</category>
 <category domain="http://sciencex2.org/en/taxonomy/term/3300">tia</category>
 <group domain="http://sciencex2.org/en/node/13855">Computer &amp;amp; Information Science</group>
 <pubDate>Thu, 16 Oct 2008 18:52:09 -0700</pubDate>
 <dc:creator>Jerry Sheehan</dc:creator>
 <guid isPermaLink="false">52864 at http://sciencex2.org</guid>
</item>
<item>
 <title> Database tools for science -opportunity for collaboration on standards</title>
 <link>http://sciencex2.org/en/node/41463</link>
 <description>&lt;h3 class=&quot;field-label&quot;&gt;Description&lt;/h3&gt;
&lt;div class=&quot;content&quot;&gt;
   &lt;p&gt;In the recent Nature issue (Nature 455, 28-29 (4 September 2008) | doi:10.1038/455028a; Published online 3 September 2008) a very interesting special section on &amp;quot;big data&amp;quot; is presented: (&lt;a href=&quot;http://www.nature.com/news/specials/bigdata/index.html&quot; title=&quot;http://www.nature.com/news/specials/bigdata/index.html&quot;&gt;http://www.nature.com/news/specials/bigdata/index.html&lt;/a&gt;).&lt;br /&gt;
This especially caught my attention because of project I&#039;m working on, and we are doing exactly what is discussed in the following article regarding generation of data and providing a sophisticated archive with lots of functionality as a part of the project. Questions arise as to curation, access, and desired functions. It is expected that this is becoming commonplace as a way to work will continue to evolve in the scientific community, as will the need for standards: &lt;a href=&quot;http://www.nature.com/nature/journal/v455/n7209/full/455028a.html&quot; title=&quot;http://www.nature.com/nature/journal/v455/n7209/full/455028a.html&quot;&gt;http://www.nature.com/nature/journal/v455/n7209/full/455028a.html&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;It is an excellent opportunity for scientific institutions to take the lead.&lt;/p&gt;
&lt;p&gt;Excerpts:&lt;br /&gt;
&amp;quot;Community standards for data description and exchange are crucial. These facilitate data reuse by making it easier to import, export, compare, combine and understand data. Standards also eliminate the need for each data creator to develop unique descriptive practices. They open the door to development of disciplinary repositories for specific classes of data and specialized software management tools.&amp;quot;&lt;br /&gt;
&amp;quot;The time is right for scientists to take stock of the institutionalized data services that are available or under development, to understand how these institutions are governed and financed, and to make choices about the best strategies for their disciplines. Can a discipline-oriented solution work? If a university-based system seems more practical, what can be done to expedite the move to university consortia strategies? As the volume of data, and the need to manage it grows, disciplinary consensus leadership will be very powerful factors in addressing the challenges ahead.&amp;quot;&lt;/p&gt;
&lt;div class=&quot;og_rss_groups&quot;&gt;&lt;ul class=&quot;links&quot;&gt;&lt;li class=&quot;first last og_links&quot;&gt;&lt;a href=&quot;/en/node/13859&quot; class=&quot;og_links&quot;&gt;Structure, Tools, and Platforms of Science&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;&lt;/div&gt;&lt;/div&gt;

&lt;div class=&quot;field field-type-text field-field-source&quot;&gt;
  &lt;h3 class=&quot;field-label&quot;&gt;Source&lt;/h3&gt;
  &lt;div class=&quot;field-items&quot;&gt;
      &lt;div class=&quot;field-item&quot;&gt;&lt;p&gt;&lt;a href=&quot;http://www.nature.com/news/specials/bigdata/index.html&quot; title=&quot;http://www.nature.com/news/specials/bigdata/index.html&quot;&gt;http://www.nature.com/news/specials/bigdata/index.html&lt;/a&gt;&lt;br /&gt;
&lt;a href=&quot;http://www.nature.com/nature/journal/v455/n7209/full/455028a.html&quot; title=&quot;http://www.nature.com/nature/journal/v455/n7209/full/455028a.html&quot;&gt;http://www.nature.com/nature/journal/v455/n7209/full/455028a.html&lt;/a&gt;&lt;/p&gt;
&lt;/div&gt;
  &lt;/div&gt;
&lt;/div&gt;
</description>
 <comments>http://sciencex2.org/en/node/41463#comments</comments>
 <category domain="http://sciencex2.org/en/taxonomy/term/584">collaboration</category>
 <category domain="http://sciencex2.org/en/taxonomy/term/575">data mining</category>
 <category domain="http://sciencex2.org/en/taxonomy/term/780">data overload</category>
 <category domain="http://sciencex2.org/en/taxonomy/term/988">research tools</category>
 <category domain="http://sciencex2.org/en/taxonomy/term/301">scientific infrastructure</category>
 <group domain="http://sciencex2.org/en/node/13859">Structure, Tools, and Platforms of Science</group>
 <pubDate>Mon, 08 Sep 2008 13:49:51 -0700</pubDate>
 <dc:creator>Patricia Larenas</dc:creator>
 <guid isPermaLink="false">41463 at http://sciencex2.org</guid>
</item>
<item>
 <title>Google Contributes Funding to NASA Space Science Technology + Mission</title>
 <link>http://sciencex2.org/en/node/24128</link>
 <description>&lt;h3 class=&quot;field-label&quot;&gt;Description&lt;/h3&gt;
&lt;div class=&quot;content&quot;&gt;
   &lt;p&gt;Google has contributed funding to the Transiting Exoplanet Survey Satellite (TESS), a NASA spacecraft (smallsat) in cooperation with MIT and the Harvard Smithsonian Center that could potentially be launched in 2012.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Google, the Internet search powerhouse that in recent years has expanded to include mapping of the stars as well as the surfaces of the moon and Mars and which has an ongoing collaboration with NASA&#039;s Ames Research Center, provided a small seed grant to fund development of the wide-field digital cameras needed for the satellite. Because of the huge amount of data that will be generated by the satellite, Google has an interest in working on the development of ways of sifting through that data to find useful information.&lt;/p&gt;
&lt;/p&gt;&lt;/blockquote&gt;
&lt;p&gt;It&#039;s interesting that private companies seem to be increasingly funding large science projects.... (N.B., I&#039;ve noticed this mostly in the physical sciences...)&lt;/p&gt;
&lt;div class=&quot;og_rss_groups&quot;&gt;&lt;ul class=&quot;links&quot;&gt;&lt;li class=&quot;first last og_links&quot;&gt;&lt;a href=&quot;/en/node/13863&quot; class=&quot;og_links&quot;&gt;Engineering &amp;amp; Design&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;&lt;/div&gt;&lt;/div&gt;

&lt;div class=&quot;field field-type-text field-field-source&quot;&gt;
  &lt;h3 class=&quot;field-label&quot;&gt;Source&lt;/h3&gt;
  &lt;div class=&quot;field-items&quot;&gt;
      &lt;div class=&quot;field-item&quot;&gt;&lt;p&gt;&lt;a href=&quot;http://web.mit.edu/newsoffice/2008/google-planets-tt0319.html&quot; title=&quot;http://web.mit.edu/newsoffice/2008/google-planets-tt0319.html&quot;&gt;http://web.mit.edu/newsoffice/2008/google-planets-tt0319.html&lt;/a&gt;&lt;/p&gt;
&lt;/div&gt;
  &lt;/div&gt;
&lt;/div&gt;
</description>
 <comments>http://sciencex2.org/en/node/24128#comments</comments>
 <category domain="http://sciencex2.org/en/taxonomy/term/589">Astronomy</category>
 <category domain="http://sciencex2.org/en/taxonomy/term/518">astrophysics</category>
 <category domain="http://sciencex2.org/en/taxonomy/term/575">data mining</category>
 <category domain="http://sciencex2.org/en/taxonomy/term/780">data overload</category>
 <category domain="http://sciencex2.org/en/taxonomy/term/1314">exoplanets</category>
 <category domain="http://sciencex2.org/en/taxonomy/term/1190">Funding</category>
 <category domain="http://sciencex2.org/en/taxonomy/term/569">google</category>
 <category domain="http://sciencex2.org/en/taxonomy/term/1127">NASA</category>
 <category domain="http://sciencex2.org/en/taxonomy/term/2250">NASA Ames</category>
 <category domain="http://sciencex2.org/en/taxonomy/term/5">physics</category>
 <category domain="http://sciencex2.org/en/taxonomy/term/701">Planetary Science</category>
 <category domain="http://sciencex2.org/en/taxonomy/term/6">space</category>
 <group domain="http://sciencex2.org/en/node/3660">Physics &amp;amp; Space Science</group>
 <group domain="http://sciencex2.org/en/node/13863">Engineering &amp;amp; Design</group>
 <pubDate>Tue, 10 Jun 2008 00:21:21 -0700</pubDate>
 <dc:creator>Matt Daniels</dc:creator>
 <guid isPermaLink="false">24128 at http://sciencex2.org</guid>
</item>
<item>
 <title>Web companies race to social and open information streams</title>
 <link>http://sciencex2.org/en/node/343</link>
 <description>&lt;div class=&quot;field field-type-text field-field-description&quot;&gt;
  &lt;h3 class=&quot;field-label&quot;&gt;Description&lt;/h3&gt;
  &lt;div class=&quot;field-items&quot;&gt;
      &lt;div class=&quot;field-item&quot;&gt;&lt;p&gt;Many large web companies are racing towards a similar goal: owning people&#039;s online attention streams by providing the most benefit to those that use their services.  For an example of this direction of new products, a San Francisco company, Radar Networks, recently unveiled a new service at the Web 2.0 Summit, Twine, which will assimilate and make sense of the information in users&#039; lives if they feed it with bookmarks and the content of emails. &lt;/p&gt;
&lt;p&gt;At the same time, we can see a push towards letting users choose if they want to open their digital identity and network.  A new initiative from Google will add a social component to more of their services.  Similar to Facebook, it looks like Google will open up a new set of APIs to let developers access the services.&lt;/p&gt;
&lt;blockquote&gt;&lt;p&gt;Some details about Maka-Maka have already leaked out, particularly how Google plans to use the feed engine that powers Google Reader (known internally as Reactor) to create “activity streams” for other applications akin to Facebook’s news and mini feeds.&lt;/p&gt;
&lt;p&gt;Google is planning to “out open” Facebook with a new set of APIs that developers can use to build apps for its social network Orkut, iGoogle, and eventually other applications as well.&lt;/p&gt;&lt;/blockquote&gt;
&lt;p&gt;A New York Times article describes Google, Myspace, LinkedIn and some of the most popular social networking sites convening on a commons standard for social network developers, OpenSocial:&lt;/p&gt;
&lt;blockquote&gt;&lt;p&gt;MySpace and Bebo, two of the world’s largest social networking sites, on Thursday joined a Google-led alliance that is promoting a common set of standards for software developers to write programs for social networks....&lt;/p&gt;
&lt;p&gt;The open standards could create a boom of innovation around social networks as applications reach more users than ever and encourage developers to create more Internet tools.&lt;/p&gt;&lt;/blockquote&gt;
&lt;p&gt;Data mining on a large scale has been common practice for companies and the government for years now, but these new ambitious data aggregation and data mining projects will potentially open their results to the public.  At the least, each individual user will be benefitted with better recommendations by the addition of data by others.&lt;/p&gt;
&lt;p&gt;Radar Networks: Twine &lt;a href=&quot;http://www.twine.com/&quot; title=&quot;http://www.twine.com/&quot;&gt;http://www.twine.com/&lt;/a&gt;&lt;br /&gt;
Google&#039;s Response to Facebook &lt;a href=&quot;http://www.techcrunch.com/2007/10/29/googles-response-to-facebook-maka-maka/&quot; title=&quot;http://www.techcrunch.com/2007/10/29/googles-response-to-facebook-maka-maka/&quot;&gt;http://www.techcrunch.com/2007/10/29/googles-response-to-facebook-maka-maka/&lt;/a&gt;&lt;br /&gt;
MySpace Joins Google Alliance&lt;br /&gt;
&lt;a href=&quot;http://www.nytimes.com/2007/11/02/technology/02google.html?_r=2&amp;amp;oref=slogin&amp;amp;oref=slogin&quot; title=&quot;http://www.nytimes.com/2007/11/02/technology/02google.html?_r=2&amp;amp;oref=slogin&amp;amp;oref=slogin&quot;&gt;http://www.nytimes.com/2007/11/02/technology/02google.html?_r=2&amp;amp;oref=slogin&amp;amp;oref=slogin&lt;/a&gt;&lt;br /&gt;
Plaxo: More on social network portability &lt;a href=&quot;http://blog.plaxo.com/archives/2007/08/more_on_social.html&quot; title=&quot;http://blog.plaxo.com/archives/2007/08/more_on_social.html&quot;&gt;http://blog.plaxo.com/archives/2007/08/more_on_social.html&lt;/a&gt;&lt;br /&gt;
Twine, Freebase and Powerset &lt;a href=&quot;http://mikelove.wordpress.com/2007/10/21/twine-freebase-and-powerset/&quot; title=&quot;http://mikelove.wordpress.com/2007/10/21/twine-freebase-and-powerset/&quot;&gt;http://mikelove.wordpress.com/2007/10/21/twine-freebase-and-powerset/&lt;/a&gt;&lt;/p&gt;
&lt;/div&gt;
  &lt;/div&gt;
&lt;/div&gt;

&lt;div class=&quot;field field-type-nodereference field-field-signal-1&quot;&gt;
  &lt;h3 class=&quot;field-label&quot;&gt;Signals&lt;/h3&gt;
  &lt;div class=&quot;field-items&quot;&gt;
          &lt;div class=&quot;field-item&quot;&gt;&lt;/div&gt;
      &lt;/div&gt;
&lt;/div&gt;
</description>
 <comments>http://sciencex2.org/en/node/343#comments</comments>
 <category domain="http://sciencex2.org/en/taxonomy/term/574">API</category>
 <category domain="http://sciencex2.org/en/taxonomy/term/575">data mining</category>
 <category domain="http://sciencex2.org/en/taxonomy/term/569">google</category>
 <category domain="http://sciencex2.org/en/taxonomy/term/572">open</category>
 <category domain="http://sciencex2.org/en/taxonomy/term/571">social graph</category>
 <category domain="http://sciencex2.org/en/taxonomy/term/282">social networks</category>
 <category domain="http://sciencex2.org/en/taxonomy/term/284">social software</category>
 <pubDate>Tue, 30 Oct 2007 14:05:30 -0700</pubDate>
 <dc:creator>Mike Love</dc:creator>
 <guid isPermaLink="false">343 at http://sciencex2.org</guid>
</item>
</channel>
</rss>
