Εμφάνιση αναρτήσεων με ετικέτα data. Εμφάνιση όλων των αναρτήσεων
Εμφάνιση αναρτήσεων με ετικέτα data. Εμφάνιση όλων των αναρτήσεων

Δευτέρα 20 Φεβρουαρίου 2012

Big Data - The next big thing to come... or is it already here?


This is the trailer from a movie called "Moneyball", which is based on the true story of how a low budget baseball team was formed from players who were picked according to a statistical analysis of many players. This is an example of "big data" application. Big data is the systematic analysis of enormous amounts of information and extraction of knowledge (or wisdom) from this an analysis.

Data are being recorded more than ever. Internet in all forms, mobile phones, sensors, are recording continuously and in digital - standardized format all kinds of data. Heart beat rate of runners, traffic in the cities, searches in the web, weather, news, emotions (through social media), location of smartphones, and many other details that there is no point to list now. Recoding is not producing any knowledge. This amount of information is not even manageable by humans alone. This is the playground of computerized processors! Humans are inventing algorithms that analyze with the help of computers the available data. Of course within the limits of our current imagination.

Data are stored though and are available for ever, with the new technologies of the "cloud". They will be there for generations to come. Computers are starting to learn! Because simply... they have unlimited time to look into and combine all this information. And they are way faster than humans in analyzing huge amounts of raw data. And when a new algorithm is available they can look back at all the stored data with the new "eye" made available and enrich their knowledge. So the possibilities are only expanding.

This is a trend that cannot be reversed. Simply because our lives are improving through this procedure. Some examples:
  • Less traffic or plane accidents
  • New health procedures, like epidemic analysis and reactions (Personal Monitoring Tech: Invasion of the Body Trackers)
  • More convenience in everyday life (for example, location services and personalized information on demand and in real time)
  • Longer weather forecasts
  • Superhuman fast stock market reactions and complex financial services
  • Better pricing and distribution of products at retail (super markets for example can plan their stock according to weather forecast, or customer behavior analysis!)
  • Urban crime control through analysis of events in correlation to various parameters like historical arrest patterns, paydays, sporting events, rainfall and holidays!
And for all these you do not need to conduct a survey as in the past. Most likely all the data anyone will ever need are there already. You just have to think of the best way to "dig" in them and extract the knowledge you need.

Of course, as written in a very enlightening New York Times article: "Data is tamed and understood using computer and mathematical models. These models, like metaphors in literature, are explanatory simplifications. They are useful for understanding, but they have their limits. A model might spot a correlation and draw a statistical inference that is unfair or discriminatory, based on online searches, affecting the products, bank loans and health insurance a person is offered, privacy advocates warn." But even this kind of problems can be solved much faster than in the past. Remember... data are there for ever and they only get more and more!

The main issue that must be addresses and resolved is access to the available data. I believe that recording is not bad for anyone, as long as the access to the records are open to anyone! This is the only way that true wisdom can be extracted from all this information. This the "Open Data" movement which is affecting all of us!  The Guardian has been encouraging this movement for some time now.  This a society issue and in my opinion it should be ranking very high in our priorities for the future. I would even call it a political issue!


Of course, there’s a major shortage of analytical talent. According to the McKinsey Global Institute (MGI), this is already an issue in the private sector. MGI predicts that in the US alone there will be an annual shortage of graduates in deep analytical fields of 140,000 to 190,000 by 2018. A very good article on this subject is from Linda Rosencrance of the Spotfire Blogging Team "The Promise and Challenge of Using Big Data to Address World Problems".

Big data in the end is all about wisdom and evolution! And obviously this cannot be left to the hands of few or only in the processing power of computers!

Δευτέρα 6 Φεβρουαρίου 2012

Connected Wearable Computers!

Modern smart-phones have more processing power than the best desktop computers of the '90s. On top of this, they are always connected through GSM or WiFi networks and most of them include, camera, accelerometer and a GPS device. All in one small package that fits in every pocket!

There are two major limitation for the current smartphone technology: Power supply (or batteries), and display "real estate" or size of screen. These two, are also inter-dependant, since larger screens consume more energy. A large battery makes the device bulky and heavy, a large screen makes a smart-phone un-practical as well...

The power supply of a smartphone with all the fancy features described above, allows usually a few hours of usage of all the features. Internet connection and GPS consume a lot and a typical smartphone battery could sustain non stop operation for not more than 4-6 hours. Combined with the larger screen sizes, that enable the user to have access to the information (e.g GPS location on an internet retrieved Google Map) available to him by his phone, can empty a battery very fast.

Lets look at some other devices - apparel that most modern people carry with them everyday. The wrist watch is an example. There are many models of watches that include various sensors, like barometer, thermometer, accelerometer, digital compass or even GPS sensor. The battery of a wrist watch lasts way longer than any of the smart phones. Their screens are very low on energy consumption but also very specific in their use. This new generation of "smart" watches also has some connectivity features, that enable the user to transmit the stored data to a web-service or even retrieve data from remote sensors, like heart rate monitor or power meters in the occasion of a sports watch (like the one at the side picture).

Also many are wearing glasses and sunglasses. This "low" tech apparel is always in front of our eyes, protecting them from harmful light rays, particles or correcting our vision or even for pure aesthetics. Recently we have seen some new products in this segment, starting from the "heavier" sports goggles, where the user can connect wireless to the smartphone and use the goggles as earphones and microphone, or even embed a micro projector to create a Head Mounted Display (HMD) screen.
 
There are many more examples of new products that are exploiting the unprecedented computing power that each one of us has in his pocket with a smart-phone. There are also many sensors that have been introduced, that enhance the capabilities of smart-phones:
  • Sports activities sensors (heart rate, power, stride / cadence, etc)
  • Health status (heart rate, blood sugar level, etc - Personal Monitoring Tech: Invasion of the Body Trackers)
  • Car monitoring services (connectivity to diagnostics - OBDII)
  • Connected external mini cameras (Contour+ is such an example)
  • and many more...

We should not neglect to take into consideration the market demand for new features which could be served very well by an "expanded" smart-phone. A whole new range of media and marketing possibilities will open-up, like:
  • Real life social games
  • Augmented reality applications
  • Big data statistics and marketing (massive amounts of information coming from recording at a scale never possible in the past)
  • Health wellness applications
  • Safety on the road
  • Public transportation information
  • Urban life management
  • etc...

With all these things in mind, I believe that the solution to the two major limitations of smart-phone technology, can be found in the modification of the device in the direction of peripheral connectivity and delegation of tasks. The smart-phone will be the main unit, providing connectivity and processing power, while peripheral devices will be deployed according to the needs, and will be supplying the data needed.

Connected wearable computers with standard communication protocols should be able to keep their screens off for power efficiency, while on demand could be presenting information, like caller id and SMS - micro blogging messages at the screen of the connected wrist watch. Eye-wear could work as microphone and headphone and save even more energy for the smart-phone. At an advanced stage special glasses could work as head mounted displays, feeding the user with information according to his needs. It is very important to have a standard communication protocol with all these devices, because today there are at least 3-4 major protocols available making the development and marketing of such applications very complicated. Freeing some of the processing power of the main unit with this delegation of tasks, would allow development of more demanding applications as well, like voice recognition (iPhone Siri is such an example) that would make interaction with the information so much easier.

Consumers would benefit a lot as well! Imagine if you could upgrade only the accessory you need, instead of buying a whole new device (current smart-phones) every time there is a better photo-camera, or more sensitive GPS or more powerful processor for the phone! If you break one thing, you replace only this specific component and not the whole device. It is also a bit more environment friendly set up.

Trying to fit everything in a box is working only at the initial steps of a new technology. From some point on modularity has to be introduced in order to become more efficient, flexible and release the full potential of the new technology. The market will open up as well to so many more companies with new products that will complement the pocket hidden smart-phone.

Σάββατο 18 Ιουνίου 2011

Collective or social intelligence

Wordle: Social Intelligence




The value of any media is directly proportional to the culture of those who use it. Following this logic I present the path followed by the Social Network Services and the internet in general. Understanding of the direction of evolution for new media will help in the realistic evaluation and use by the community. Also it will help overcome prejudice and cultrural barriers that exist. If there is a shared vision of what we want to achieve then the effectiveness of the medi will be optimal.

Flood of Information

Web sites - Social Media - Augmented reality - Location based services - Blogs - Facebook - Twitter - Internet of things. Terms and concepts which lead to a certain outcome: Plenty of information about people, places, objects and personal views on any matter possible in human mind. A vast universe of information, occasionaly useful, that can result to knowledge or even better wisdom, as long as we become aware of high quality data at the right time. But this immense stream of data is flowing around us and goes untapped, though thankfully recorded and registered. To turn this around and exploit the full potential of the available data, these must be recorded, formatted, categorized so that they become understandable and accessible (in terms of volume and quality), and then should be made easily available to anyone interested! In brief I would say that recorded data should be attributed a meaning (or more!).


 Current technology has given us the means of recording (PCs, networks, open space, GPS, cameras, mobile phones, RFiD) and with the architecture of CLOUD there is ample storage space and computer-processing power. For the past few years has begun with demonic pace, that exceeds our imagination, the recording of information of both the natural world and the spiritual. Tools have also been developed, that can relay this information to all interested parties at any given time. Actually this trend was initiated back in the late '90s when platforms like "Autonomy" made their appearance. It is obvious to see the strength of this trend, just by following the number of doctorates - thesis that have been "invested" on this direction of the information technology market around the world.

Nowadays, the rate of growth of available information is beyond the capability of monitoring and retrieval not only for the average person, but even for specialists or teams of specialists! This rate will certainly continue to accelerate and inflate. So this inflated wealth of info will only grow as there is now a steady flow of data even from objects / devices (internet of things). Under these circumstances, the stored information does not create knowledge at the moment as fast as one would imagine.

Knowledge from the information

Obviously the facts mentioned above highlight the importance of procedures, techniques - visualization and dissemination of knowledge (not simply information) that can be derived from all available data. The average individual in the contemporary society and on an everyday base, should be able to take advantage of the available knowledge stemming from the data. The "nervous system" to transfer the information recorded by the "sensors" is available already. Where "nervous system" put the communication networks and where "sensors" put persons and devices that transmit information. The "recording memory" also exists and I am reffering to the central storage of information, as well as individual - personal capabilities.


The technology should provide us the capability to retrieve, visualize and to some extent automate the transmission of knowledge. We should be able to "see" in front of us solutions, the moment we need them. Solutions that have been used in the past or solutions that may arise from a combination of stored relevant information.


This requires an analysis of the meaning of the information stored so that the neccessary links are established. The meaning can be derived and assigned from some measurable physical characteristics or data analysis. 
  • The simple first step is to georeferenced information. So interested parties can access information in the geographical scope. This is already starting to happen.
  • A second step is the symantic categorization which is achieved by keywords or by using special tags - markings (see # tags to Twitter for example). These two steps are underway in recent years.
  • Most difficult step is classifying attributes relating to identification based on shape, color, material and size, or even smell. This means that objects or persons are automatically recognized by their appearance! This will enable automatic connections (eg, automatic grouping of all clothing or all vehicles or all edible mushrooms!)
  • Indirect evaluation of information can be derived from the use they had by other interested parties in the past. Every time a piece of information is being used for a purpose, the nature of this use should be recorded and added to the attributes.
  • Another level could be their "lifeline" or evolution of use on a timeline. I am referring to the activity or value timeline as it is recorded through search engines, social networking or sensor recording. First hints of use of this "life-cycle" values are the various trend-reporting services that have been presented by engines like Google or Twitter, etc.
Then we come to the visualization of the relevant information. The presentation must be based on the associated "meanings". It should also be as straightforward as possible. Without requiring neither search procedures on sources (which can be practically infinite) nor evaluation of data (which should be automatic). This way location, quality and identity of information are combined and the user sees only relevant - useful entries, probably ranked according to some criteria.

It is obvious that the amount of available information can make visualization complicated (especially if it includes historical factors). Hence we get to the necessity to come up with new technological solutions and visualization methods that will make the flow of data comprehensible. Existing example of such user interfaces, include overlapping levels, projected on real life scenery, or graphic representation of streams on timeline, etc. But we need to design new interfaces to facilitate handy browsing for users.

In this evolution we should include new software and applications that link the meaning from many sources and create a "flow" of information and trends. Also, new hardware is needed (transparent organic displays, or maybe even body implants). A new generation of graphics is an obvious solution. The advantage of graphic representation is that they are compatible with the existing culture and educational system for most users.

The ethics of knowledge
If we focus on the last two points (mentioned above) of the procedures that assign meaning to stored information, we can instantly understand that they are subjective and dependent on the user behavior. In a world with a culture of collective wisdom and knowledge, each person using the information and their derivatives, have a responsibility towards the community for the proper ethical use. By this I do not imply in any way manipulation of free will and cultural variety, but rather a methodology of thinking and linking it with information. Actions based on the available information must have a rational thinking behind them. Or contribution to the knowledge base must be done with a sense of social responsibility and not recklessly.

The danger of creating very conservative societies and manipulating the way of thinking of many people, is real though. This can happen by the formulation of knowledge that will be applied by systems, as well as from the extensive analysis of data and situations. As knowledge is derived by systems and methods, this incorporates certain degree of predictions and evaluations. This by nature is conservative, since any change i the status quo, should be acceptable thus predicted by the system itself! There are obvious conservative political repercussions. In his account, this faith in the technologically augmented system model becomes a reason to defend the status quo. This problem constitutes a whole subject by itself and can be a very serious discussion on cybernetics. A relevant articles can be seen here: The Limits of Big Data - ReadWriteCloud as well as a BBC documentary: All Watched Over by Machines of Loving Grace


The "classes" of the knowledge society


This process of collective knowledge will lead to some new "classes" of people and systems.
  • Authors / creators = those who produce wisdom in the first level. I am referring to the "birth" of knowledge through a process of analysis, study, awareness and reporting
  • Repeaters = those who forward, promote and enrich to some degree the knowledge which is produced by systems or "creators". People or brands or companies with influence and an effective distribution network.
  • Consumers = Readers and users of knowledge as produced and broadcasted.
Although, the class of authors, is particularly valuable, all the "classes" add value to information, even through simple usage. Even the statistical analysis of their activity or the evaluation of it, will add more levels to the knowledge base. So, even the simple "consumer" can and should be aware of his role and importance in this chain of "knowledge building".

Our culture has already moved on from information society to knowledge. The power now is in the "owner" of knowledge as it is derived from publicly available information. And of course this leads to effective application of knowledge to everyday or even more complicated decisions and procedures.



Further reading: Birth of the global mind - FT.com