tag:blogger.com,1999:blog-32220769694441904072024-02-08T09:17:53.789-08:00Big Data Rookiesonabinuhttp://www.blogger.com/profile/15920933670808502387noreply@blogger.comBlogger44125tag:blogger.com,1999:blog-3222076969444190407.post-86597645143525010282013-01-19T21:46:00.000-08:002013-01-19T21:48:44.033-08:00Resources for Learning more on HadoopI recently attended a Hadoop Users Group meetup. Here are some of the suggested material to learn more on Hadoop and it's ecosystem. <br />
<br />
<a href="http://www.cloudera.com/content/cloudera/en/resources/library/training/cloudera-essentials-for-apache-hadoop-the-motivation-for-hadoop.html" target="_blank">Essentials for Appache's Hadoop</a> - Register and watch six recorded webnairs from Cloudera on Hadoop.<br />
<br />
<a href="http://developer.yahoo.com/hadoop/tutorial/" target="_blank">Yahoo! Hadoop Tutorial</a> - A series of tutorials on how to how to use the Hadoop distributed data processing environment<br />
<br />
<a href="http://shop.oreilly.com/product/9780596521981.do" target="_blank">Hadoop: The Definitive Guide</a> - The Hadoop Bible<br />
<br />
<a href="http://search.oreilly.com/?i=1;q=Hadoop;q1=Books;x=0;x1=t1;y=0&act=fc_contenttype_Books" target="_blank">O’Reilly Ecosystem books</a> - Hive, Pig, Hbase, Cassandra, others<br />
<br />
<a href="http://www.amazon.com/Hadoop-Action-Chuck-Lam/dp/1935182196" target="_blank">Hadoop in Action</a> and <a href="http://www.amazon.com/Hadoop-Practice-Alex-Holmes/dp/1617290238/ref=sr_1_1?s=books&ie=UTF8&qid=1358660514&sr=1-1&keywords=hadoop+in+practice" target="_blank">Hadoop In Practice</a> - Example-based books. Very pragmatic, get you up-to-speed quickly<br />
<br />
Other Links :<br />
<br />
Cloudera Training & Distributions<br />
<a href="http://www.cloudera.com/resources/">http://www.cloudera.com/resources/</a><br />
<a href="https://ccp.cloudera.com/display/SUPPORT/Downloads">https://ccp.cloudera.com/display/SUPPORT/Downloads</a><br />
HortonWorks Training & Distributions<br />
<a href="http://hortonworks.com/community/">http://hortonworks.com/community/</a><br />
<a href="http://hortonworks.com/download/">http://hortonworks.com/download/</a><br />
Hadoop World 2010, 2011, 2012 - Slides and video<br />
<a href="http://www.hadoopworld.com/">http://www.hadoopworld.com/</a><br />
Cloudera Essentials Series – 1 to 6 (Audio) <a href="http://www.cloudera.com/search/?q=essentials">http://www.cloudera.com/search/?q=essentials</a><br />
Apache Hadoop – Petabytes and Terawatts<br />
<a href="http://www.youtube.com/watch?v=SS27F-hYWfU">http://www.youtube.com/watch?v=SS27F-hYWfU</a><br />
History of Hadoop<br />
<a href="http://www.wired.com/wiredenterprise/2011/10/how-yahoo-spawned-hadoop/">http://www.wired.com/wiredenterprise/2011/10/how-yahoo-spawned-hadoop/</a><br />
Adam Bosworth Interview from 2005 (source of some quotes in this presentation)<br />
<a href="http://itc.conversationsnetwork.org/shows/detail571.html">http://itc.conversationsnetwork.org/shows/detail571.html</a><br />
An Intro to Hadoop – Mark Fei<br />
<a href="http://cdn.oreillystatic.com/en/assets/1/event/85/An%20Introduction%20to%20Hadoop%20Presentation.pdf">http://cdn.oreillystatic.com/en/assets/1/event/85/An%20Introduction%20to%20Hadoop%20Presentation.pdf</a><br />
YARN/MRv2 Information (Next-Generation Hadoop)<br />
<a href="http://blog.cloudera.com/blog/2012/02/mapreduce-2-0-in-hadoop-0-23/">http://blog.cloudera.com/blog/2012/02/mapreduce-2-0-in-hadoop-0-23/</a><br />
<a href="http://hadoop.apache.org/docs/r0.23.0/index.html">http://hadoop.apache.org/docs/r0.23.0/index.html</a><br />
Brad Hedlund - Understanding Hadoop Clusters and the Network<br />
<a href="http://bradhedlund.com/2011/09/10/understanding-hadoop-clusters-and-the-network/#download">http://bradhedlund.com/2011/09/10/understanding-hadoop-clusters-and-the-network/#download</a><br />
<br />sonabinuhttp://www.blogger.com/profile/15920933670808502387noreply@blogger.com1tag:blogger.com,1999:blog-3222076969444190407.post-31372709708036587832012-12-05T09:42:00.000-08:002012-12-05T10:22:30.941-08:00Key research papers behind the growth in Big Data toolsA large chunk of big data tools including Hadoop owe their beginnings to the research papers published by Google and Amazon. These papers are a good place to start when trying to understand the technologies and tools that drive big data analytics. Below are descriptions of the technology and links to the research paper.<br />
<br />
<b><a href="http://static.googleusercontent.com/external_content/untrusted_dlcp/research.google.com/en/us/archive/bigtable-osdi06.pdf" target="_blank"><span style="color: #a64d79;">BigTable</span></a> </b>- Bigtable is a distributed storage system for managing structured data that is designed to scale to a very large size: petabytes of data across thousands of commodity servers. Many projects at Google store data in Bigtable, including web indexing, Google Earth, and Google Finance.<br />
<br />
<span style="color: #a64d79;"><b>MapReduce</b></span> - MapReduce is a programming model and an associated implementation for processing and generating large data sets.Google has implemented hundreds of special-purpose computations that process large amounts of raw data using MapReduce. Inspired by the map and reduce primitives present in Lisp and many other functional languages, Google introduced MapReduce. Users specify a map function that processes a key/value pair to generate a set of intermediate key/value pairs, and a reduce function that merges all intermediate values associated with the same intermediate key.<br />
<br />
<a href="http://static.googleusercontent.com/external_content/untrusted_dlcp/research.google.com/en/us/archive/gfs-sosp2003.pdf" target="_blank"><span style="color: #a64d79;"><b>Google File System</b></span></a> - Google designed and implemented the Google File System (GFS) to meet the rapidly growing demands of Google’s data processing needs.<br />
<br />
<b style="background-color: white;"><a href="http://www.read.seas.harvard.edu/~kohler/class/cs239-w08/decandia07dynamo.pdf" target="_blank"><span style="color: #a64d79;">Dynamo</span></a><span style="color: #a64d79;"> </span></b>- Dynamo is a highly available key-value storage system that some of Amazon’s core services use to provide an “always-on” experience.sonabinuhttp://www.blogger.com/profile/15920933670808502387noreply@blogger.com0tag:blogger.com,1999:blog-3222076969444190407.post-81691729020453334112012-11-12T12:09:00.000-08:002012-11-30T05:43:51.801-08:00Few thoughts on visualizing Big Data Big data is about volume, velocity and data in a variety of formats. How do you communicate meaning from petabytes of data? If you had only one page or slide to capture the attention of your audience, which medium would you choose to reach out to your audience. I would say a visual, specifically an image that visualizes the big data. Data visualization can be the key to exposing something new about the underlying patterns and relationships contained in the thousands or even millions of rows of data. It is increasing becoming the language preferred by customers and collaborators to understand the data you present - dynamic images are a better way to communicate than long lists of numbers.<br />
<br />
My first postively awe inspiring moment on the power of visual data came when viewing Hans Roslings' presentation (available on <a href="http://www.ted.com/talks/hans_rosling_shows_the_best_stats_you_ve_ever_seen.html" target="_blank">TED</a>) on global trends in health and economics. Another dynamic visual example is the <a href="http://vimeo.com/19088241" target="_blank">History of the World in 100 seconds</a> created by pulling out 424,000 articles and 35000 reference to events with coordinates, parsing an XML dump of all Wikipedia articles. <i><b>How does art, story-telling and information come together to create a dynamic visual?</b></i> Below are links to some interesting articles exploring this theme -<br />
<br />
<br />
<a href="http://www.forbes.com/sites/sap/2012/11/08/big-data-a-picture-is-worth-a-thousand-words/" target="_blank">Big Data : A Picture is Worth a Thousand Words</a> : Visualize big data to make better decisions.Visualization provides data in a format that’s easy for business users to digest and use.<br />
<br />
<a href="http://www.forbes.com/sites/naomirobbins/2012/06/13/conflicting-advice-on-data-visualization/" target="_blank">Conflicting Advice on Data Visualization</a> : Data visualization is often thought of as a simple communication tool. Is there room for artistic expression and what design features and labeling methodology should you consider while creating a visual image of data?<br />
<br />
<a href="http://www.forbes.com/sites/oreillymedia/2012/02/22/data-visualizations-are-more-than-just-pictures/" target="_blank">Data Visualizations Are More Than Just Pictures</a> : When visualization is done right, it can reveal so much. Data visualization are a kind of bidirectional encoding that lets ideas and information be transported from the database into your brain.Software and automation helps to quickly iterate data and experiment with it to find the signal within the noise. Interactive visualizations add a new dimension to complex data sets enhancing the audience's ability to understand a company's business. How will you ensure that the graph or visuals are not incomplete or misleading representations of the knowledge your company holds? Should you hire a professional who understands principles of design and visual communication?<br />
<br />
<a href="http://online.wsj.com/article/SB10001424052702303299604577323743008395060.html" target="_blank"> Making Data Beautiful </a>: Making data visually beautiful so that it becomes they become a pleasure for us to absorb.<br />
<br />
New York Times has a team dedicated to data visualization and information design. Here is a <a href="http://www.smallmeans.com/new-york-times-infographics/" target="_blank">link</a> to a page with some interesting graphics from the nytimes.com. The Times also has some <a href="http://learning.blogs.nytimes.com/2010/08/23/teaching-with-infographics-places-to-start/" target="_blank"> interesting examples</a> of how graphics can be used in the classroom and a list of places to start learning about infographics. There is a <a href="http://learning.blogs.nytimes.com/2011/04/08/data-visualized-more-on-teaching-with-infographics/" target="_blank">sequel</a> to this article with more information on teaching how to create and interpret infographics.sonabinuhttp://www.blogger.com/profile/15920933670808502387noreply@blogger.com1tag:blogger.com,1999:blog-3222076969444190407.post-92163583731403813182012-11-02T10:10:00.003-07:002012-11-02T10:10:52.208-07:00Cloudera's ImpalaCloudera is the best known Hadoop vendor around. Last week Cloudera announced it's latest offering, Project Impala.<br />
<br />
Project Impala is a parallel real-time query engine that can run atop the raw Hadoop Distributed File System (HDFS) or the HBase tabular overlay for HDFS that makes it look somewhat like a relational database.
<br />
<br />
Impala does not work through Hadoop MapReduce.
Impala uses a SQL-like syntax and allows you to query data, whether stored in HDFS or Apache HBase – including SELECT, JOIN, and aggregate functions – in real time.<br />
<br />
Here is a list of articles and opinions on what this will mean for Hadoop users:
<br />
<ul>
<li>
<a href="http://http//gigaom.com/data/cloudera-makes-sql-a-first-class-citizen-in-hadoop/">Cloudera makes SQL a first-class citizen in Hadoop</a>
</li>
<li>
<a href="http://www.dbms2.com/2012/10/24/cloudera-impala-hadoop/">Quick notes on Impala</a>
</li>
<li>
<a href="http://www.zdnet.com/clouderas-impala-brings-hadoop-to-sql-and-bi-7000006413/">Cloudera’s Impala brings Hadoop to SQL and BI</a>
</li>
<li>
<a href="http://bits.blogs.nytimes.com/2012/10/24/big-data-in-more-hands/">Big Data in More Hands</a>
</li>
<li>
<a href="http://www.informationweek.com/software/information-management/cloudera-debuts-real-time-hadoop-query/240009673">Cloudera Debuts Real-Time Hadoop Query</a>
</li>
<li>
<a href="http://www.cmswire.com/cms/information-management/cloudera-brings-realtime-sqllike-experience-to-hadoop-stratany2012-017965.php">Cloudera Brings Real-Time, SQL-like Experience to Hadoop</a>
</li>
<li>
<a href="http://www.marketwatch.com/story/cloudera-announces-game-changing-real-time-query-on-hadoop-and-leads-a-new-era-of-data-management-2012-10-24">Cloudera Announces Game-Changing, Real-Time Query on Hadoop and Leads a New Era of Data Management</a>
</li>
<li>
<a href="http://blog.cloudera.com/blog/2012/10/cloudera-impala-real-time-queries-in-apache-hadoop-for-real/">Cloudera Impala: Real-Time Queries in Apache Hadoop, For Real</a>
</li>
</ul>
sonabinuhttp://www.blogger.com/profile/15920933670808502387noreply@blogger.com0tag:blogger.com,1999:blog-3222076969444190407.post-23358222938736818492012-11-01T05:42:00.000-07:002012-11-01T05:42:04.687-07:00A place for the latest Hadoop newsFor a novice in big data, Hadoop is a moment of truth. Learning what Hadoop does stirred my imagination and it was the first instance of realizing that it is possible to bring together a variety of data types to find patterns without going through the conventional RDMS path. Here is a <a href="http://http://www.cmswire.com/news/topic/hadoop">link</a> to latest news on Hadoop.sonabinuhttp://www.blogger.com/profile/15920933670808502387noreply@blogger.com0tag:blogger.com,1999:blog-3222076969444190407.post-61880460155165268072012-10-30T07:58:00.001-07:002012-11-01T07:49:32.953-07:00Wow ... MicrosoftTher are very few recent examples where Micrososft makes you dream of the possibilities of software. This <a href="http://http://www.nytimes.com/2012/10/30/technology/microsoft-renews-relevance-with-machine-learning-technology.html?hpw">article</a> in the nytimes.com is a great example of the pay off from investment in R&D. Microsoft is likely to create a new revenue stream with it's entry into the big data scene.sonabinuhttp://www.blogger.com/profile/15920933670808502387noreply@blogger.com0tag:blogger.com,1999:blog-3222076969444190407.post-71375005130069275442012-10-29T05:37:00.000-07:002012-10-29T05:37:20.012-07:00HBR on Big DataHarvard Business Review has a 'big' segment on Big Data. This <a href="http://http://hbr.org/special-collections/insight/big-data">link</a> is rich with Collection of articles documenting experiences from retailers, social networking sites, management perspectives, emerging careers in the field and a host of opinions on the impact of analytics on decision making.sonabinuhttp://www.blogger.com/profile/15920933670808502387noreply@blogger.com0tag:blogger.com,1999:blog-3222076969444190407.post-25042629857942631612012-10-22T08:00:00.001-07:002012-10-22T08:00:13.266-07:00A 'go to' for the latest tech news I always wonder which is the best place to see what is buzzing in technology. Traditional sources like nytimes.com, wsj.com, forbes.com all cover tech in detail but most news makes it to these papers only after they have been news for a while. Where can we hear about it when it is just a concept or an idea that is popular among techies? I stumbled across <a href="http://news.ycombinator.com/news">Hacker News</a> following something mentioned in an article on Forbes.com. So far, it has proved to be a reliable source. sonabinuhttp://www.blogger.com/profile/15920933670808502387noreply@blogger.com0tag:blogger.com,1999:blog-3222076969444190407.post-13773050517797520282012-10-08T11:41:00.000-07:002012-10-08T11:41:04.633-07:00Who is a Data Scientist?A data scientist is central to extracting facts and figures from large volumes of data. Apart from being able to find patterns from large data sets, the data scientist should be able to mine for the most important and business-focused parts and present it to business users from all levels of the enterprise. They are part geek, part story-teller and part graphic illustrator as they deal with algorithms, use narratives to explain their findings and make visual/graphic illustrations to communicate it all. Charles Roe tackles this question in an <a href="http://www.dataversity.net/so-you-want-to-be-a-data-scientist/?mkt_tok=3RkMMJWWfF9wsRonvq3KZKXonjHpfsX56%2B8vUKG1lMI%2F0ER3fOvrPUfGjI4GT8R0dvycMRAVFZl5nRtRFvOddZJF%2BfpTDUSgXTX8hbI%3D">article</a> published at <a href="http://www.dataversity.net/">Dataversity.</a> Ray Rivera, Director, Solutions Management, Workforce Planning and Analytics, SAP has written a Forbes <a href="http://www.forbes.com/sites/sap/2012/10/05/what-makes-analytics-wizards-so-good-they-do-everything-backwards/">guest article</a> on this very subject. He chooses to call data scientists 'analytical wizards'!sonabinuhttp://www.blogger.com/profile/15920933670808502387noreply@blogger.com1tag:blogger.com,1999:blog-3222076969444190407.post-90231094292240261242012-10-01T18:27:00.001-07:002012-10-01T18:27:39.593-07:00A look at the possibilities of RCame across an <a href="http://www.nytimes.com/2009/01/07/technology/business-computing/07program.html?pagewanted=all&_r=0">article</a>, written about three years back, talking about the possibilites of R. It is well written and touches on many of the striking features of R that makes it attactive to users.
sonabinuhttp://www.blogger.com/profile/15920933670808502387noreply@blogger.com0tag:blogger.com,1999:blog-3222076969444190407.post-54094026606122205052012-08-28T05:45:00.000-07:002012-08-28T05:45:02.210-07:00Marriage of Cloud Computing and Big Data AnalyticsThe linkages between the power of cloud computing and big data analytics is increasing becoming stronger. Without the storage capacity and the cheap computing power offered by cloud, it would be virtually impossible for many companies to enage in the business of analyzing large volumes of data. This New York Times <a href="http://http://www.nytimes.com/2012/08/28/technology/active-in-cloud-amazon-reshapes-computing.html?_r=1&hp">article</a> explores how Amazon's web server provides cloud services to companies across the globe changing conventional business models, and also traditional company structures and resource utilization. A Forbes <a href="http://http://www.forbes.com/sites/joemckendrick/2012/08/27/the-8-most-important-skills-needed-for-cloud-computing-today/">article</a> explores another side of cloud computing, the changing skill set needed to be successful when using cloud computing power.
The two articles indicate close ties between cloud and analytics, and it is clear that this relationship is likely to grow even more tighter in the future.sonabinuhttp://www.blogger.com/profile/15920933670808502387noreply@blogger.com0tag:blogger.com,1999:blog-3222076969444190407.post-60136500767753272802012-08-16T22:22:00.001-07:002012-12-05T18:46:17.803-08:00Storing Data on DNAToday I read two posts on how huge volumes of data can be stored outside of conventional devices likes computer chips, drives and discs. One option that a Wall Street Journal article discusses is DNA. A research report in the journal<i> Science</i> reports that the a group of Harvard researchers translated the English text of an up-coming book on genomic engineering into actual DNA. The article can be read <a href="http://www.blogger.com/the%20group%20translated%20the%20English%20text%20of%20a%20coming%20book%20on%20genomic%20engineering%20into%20actual%20DNA">here</a>. Another <a href="http://www.forbes.com/sites/netapp/2012/08/15/big-data-needs-big-storage-where-to-keep-gigabytes-terabytes-and-petabytes-of-data/">article</a> in Forbes examines the issue of data storage and suggest storing data in bacteria and diamonds. These ideas are in preliminary stages and commercial applications are likely to be a long way off, but one thing is certain - the explosive growth of digital data is driving the development of alternatives storage solutions. sonabinuhttp://www.blogger.com/profile/15920933670808502387noreply@blogger.com0tag:blogger.com,1999:blog-3222076969444190407.post-8265849712799437602012-07-19T08:10:00.000-07:002012-07-19T08:10:26.536-07:00Data and analysis, the job engine of the future?The growth of cloud computing and storage capacity has given rise to new capabilities in data analytics. <a href="http://www.forbes.com/sites/netapp/2012/07/11/nearly-14-million-new-jobs-by-2015-the-cloud-has-a-silver-lining-in-a-stormy-economy/">This</a> article in Forbes explores world-wide job growth in these areas.sonabinuhttp://www.blogger.com/profile/15920933670808502387noreply@blogger.com0tag:blogger.com,1999:blog-3222076969444190407.post-35642592701102822822012-07-18T20:55:00.000-07:002012-07-18T20:55:26.854-07:00Big Data on CampusA nytimes.com article examines the how big data is being used by higher learning institutions to shape how students choose courses and classes. It sounded a bit too Orwellian for my taste. It is worth reading as a futuristic look into how big data and analytics shape our choices. Click <a href="http://www.nytimes.com/2012/07/22/education/edlife/colleges-awakening-to-the-opportunities-of-data-mining.html?pagewanted=all">here</a> for the link.sonabinuhttp://www.blogger.com/profile/15920933670808502387noreply@blogger.com0tag:blogger.com,1999:blog-3222076969444190407.post-79313537573803423722012-07-05T09:17:00.003-07:002012-07-05T09:17:44.966-07:00I came across an academic paper written a couple of years back. It uses machine learning algorithms to analyse consumer credit risk and predict the probability of default. This paper has come from MIT Sloan School of Management. The learning I take away from it are the possibilities that are opened up by practical application of tools used in big data analytics to broad issues like credit risk that have a bearing on the economy at large. Click <a href="http://www.argentumlux.org/documents/CRisk_final.pdf">here</a> for the link.sonabinuhttp://www.blogger.com/profile/15920933670808502387noreply@blogger.com0tag:blogger.com,1999:blog-3222076969444190407.post-47898124800938030602012-06-19T08:01:00.000-07:002012-06-19T08:01:20.234-07:00A real time application of a tracking toolCheck out <a href="http://www.forbes.com/sites/ericaswallow/2012/06/19/remarkable-hire/">this</a> article in forbes.com on how search results are used to evaluate how well a candidate knows what s/he claims to be a core skill. There is a long ways to go before this becomes a general application tool, but it is sufficient to get one's imagination fired up about the possibilities it holds.sonabinuhttp://www.blogger.com/profile/15920933670808502387noreply@blogger.com0tag:blogger.com,1999:blog-3222076969444190407.post-60717137204987716602012-06-16T20:00:00.000-07:002012-06-16T20:00:05.826-07:00IT knows you<a href="http://www.nytimes.com/2012/06/17/technology/acxiom-the-quiet-giant-of-consumer-database-marketing.html?_r=1&hpw">Here</a> is an article from the nytimes.com on how IT knows yousonabinuhttp://www.blogger.com/profile/15920933670808502387noreply@blogger.com0tag:blogger.com,1999:blog-3222076969444190407.post-82798658042296885092012-06-11T14:37:00.002-07:002012-06-11T14:37:22.319-07:00A general white paper on big data from DataStax. Go <a href="http://www.datastax.com/wp-content/uploads/2011/10/WP-DataStax-BigData.pdf">here</a>sonabinuhttp://www.blogger.com/profile/15920933670808502387noreply@blogger.com0tag:blogger.com,1999:blog-3222076969444190407.post-31894489029513773922012-05-31T19:44:00.001-07:002012-05-31T19:44:05.594-07:00Here is an online <a href="http://otexts.com/fpp/">book</a> that could help with those learning to forecast using R.sonabinuhttp://www.blogger.com/profile/15920933670808502387noreply@blogger.com0tag:blogger.com,1999:blog-3222076969444190407.post-24592623273832419112012-05-02T22:49:00.001-07:002012-05-02T22:49:32.788-07:00R is an important open source tool to analytics. Here is a <a href="http://http://pairach.com/2012/02/26/r-tutorials-from-universities-around-the-world/">link</a> to R tutorials from a blog that recently came to my attention!sonabinuhttp://www.blogger.com/profile/15920933670808502387noreply@blogger.com0tag:blogger.com,1999:blog-3222076969444190407.post-73214881542956726902012-04-30T08:23:00.001-07:002012-04-30T08:24:14.533-07:00A conversation on Big DataSometimes you want to hear from the companies that are actually innovating in the big data space. You want to know that they are not just talking possibilities but are talking about applications they use. Click <a href="http://te11.techonomy.com/corporation/big-data-panning-gold-information-stream">here</a> to hear such a converstation. The panalists include experts from leading companies like Symantec and Google and also others like Baynote, and Collective(i). The video is over an hour long but exciting, and worth listening to.sonabinuhttp://www.blogger.com/profile/15920933670808502387noreply@blogger.com0tag:blogger.com,1999:blog-3222076969444190407.post-13496442194504480512012-04-25T21:37:00.000-07:002012-04-25T21:37:11.300-07:00Making data human using visualsImagine using data visualisation tools to put data into a human context. Here is a <a href="http://www.ted.com/talks/jer_thorp_make_data_more_human.html">link</a> that explores making data human.sonabinuhttp://www.blogger.com/profile/15920933670808502387noreply@blogger.com0tag:blogger.com,1999:blog-3222076969444190407.post-81749463615097985592012-04-25T08:14:00.000-07:002012-04-25T08:14:05.427-07:00Answers to drowning in dataSometimes when you get online, do you feel trapped by the feeling that you are drowing in a sea of data, trying to find that exact piece of information. I do. Is there an easy way to extract quality content opposed to what a search engine wants you to read (a click that will maximise their revenue)? Could sophisticated algorithms be the answer, with their capability to understand context and semantic relationships to match web information with specific customer information needs? Click <a href="http://www.forbes.com/sites/ciocentral/2012/04/24/the-web-is-much-bigger-and-smaller-than-you-think/2/">here</a> for a Forbes.com article that examines just this question.sonabinuhttp://www.blogger.com/profile/15920933670808502387noreply@blogger.com0tag:blogger.com,1999:blog-3222076969444190407.post-65425535936013377512012-04-23T11:30:00.001-07:002012-04-23T11:30:41.539-07:00Just read an interesting piece on reviewing if big data is a strategic fit for an organisation. Go <a href="http://blogs.hbr.org/cs/2012/04/how_to_avoid_the_big_data_gotc_1.html?awid=5527246119642008869-3271">here</a> for more.sonabinuhttp://www.blogger.com/profile/15920933670808502387noreply@blogger.com0tag:blogger.com,1999:blog-3222076969444190407.post-44360055033428016472012-04-14T10:21:00.001-07:002012-04-14T10:21:22.458-07:00Data visualizationData visualization tools are key to communicating findings, and this can be very effective when dealing with volumes and volumes of data. Forbes has a slideshow featuring some interesting data visualizations ranging from charts used by Florence Nightingale to more contemporary examples. Check them out <a href="http://http://www.forbes.com/pictures/ejig45efge/the-best-data-visualizations-of-all-time-6/">here</a>sonabinuhttp://www.blogger.com/profile/15920933670808502387noreply@blogger.com0