Welcome to the Real (IT) World!

The future is Machine Learning, not programs or processes.

I was once again voted the most outspoken BPM blogger on earth. Hey, why not the universe? But thanks for the mention. I won’t however waste my and your time on BPM predictions for next year and rather discuss the medium term future of process management in machine learning.

Machine Learning used to be called Artificial Intelligence and became popular when IBM’s Deep Blue won against Chess World Champion Kasparov in 1996. The victory wasn’t however a success of AI, but a result of Joel Benjamin’s game strategy and an ensuing Deep Blue bug, that led into a final draw and thus a win for IBM. And IBM did it again in 2011 with Watson which won the popular Jeopardy TV game show in a similar publicity stunt that has little relevance for machine learning. Jeopardy uses a single semantic structure for its question, so finding and structuring answers from a huge Internet database was not that hard. Watson had the hardest time with short questions that yielded too many probable answers. It used no intelligence whatsoever in providing answers.

1996: Kasparov is beaten by IBM's Deep Blue.

1996: Kasparov is beaten by IBM’s Deep Blue.

Why do I then say that the future of BPM is in this obscure AI arena? First, machine learning is not about beating humans at some task. And second, I see machine learning ideal for work that can’t be simply automated. My well-documented opposition in this blog to orthodox BPM is caused by the BPM expert illusion that a business and the economy are designed, rational structures of money and things that can be automated. In reality they are social interactions of people that can never be encoded in diagrams or algorithms. I have shown that machine learning can augment the human ability to understand complex situations and improve decision making and knowledge sharing.

To do so I turned my attention away from writing programs and processes many years ago. My research is into how humans learn and how that understanding can be used to create software that improves how people do business without the intent to replace them. For me the most important discovery was that humans do not use logic to make decisions but emotions (Damasio, 1993). Using logic requires correct data and perfect control which are both unattainable. Therefore humans developed the ability to come to decisions under uncertainty (Gigerenzer) using a wide variety of decision biases (Kahneman, Tversky). These aren’t however fallacies but very practical, simplified decision mechanisms. Many books on the subject focus on how wrong these biases can be in some situations. They ignore that logic-driven decisions are equally wrong in all other situations because the underlying data are wrong. Logic is only better in theory or in the closed shop of laboratory.

While machine learning collects past information to learn, it is a fact that the past never predicts the future. We can only identify general probability. Take a simple roulette wheel for exanple. But people are fascinated by predictions (see Astrology) and managers want certainty, which is however unattainable. Therefore they like buzzword driven hypes that suggest that if we collect and analyze more data about more things we will get better predictions to precode decisions for the future. It is mirrored in the ‚process mining‘ approach that assumes that collecting more data on more business processes makes them more predictable. That is in fact a misrepresenation of what ML can do.

Algorithms can be used to automate mechanical functionality in real-time in a factory, airplane or car. Take Google’s self-driving car for example, military drones or robotic assembly lines. Let’s not forget that they don’t take decisions on where to go and what to do when. They are surrounding-aware robots that follow a human given directive. That is not intelligent. Thus one can’t automate a business process or any complex human interaction the same way.

True innovation in process management won’t be delivered by rigid-process-minded ignoramuses, who fall for the illusion that correct decisions can be encoded. It will arrive in the arena of machine learning that will help humans to understand information to make better decisions.

What is machine learning and what is it not? And how far are we?

Marvin Minsky had 50 years ago a vision that computers would be as or more intelligent than humans fairly soon. He proposed the use of software called neural networks that mimicked human brains. However, a human brain does not work by itself, but is a complex construct of evolved brain matter, substantial inherited innate functionality and learned experiences of which many are only available through our bodily existence. Our experience of self is not a piece of code, but a biological function of our short-term memory and the connection to our body through our oldest part of the brain, the medulla which sits atop our spinal cord. Without our hormonal drives human intelligence and decision-making would not even develop. What makes us human are our bio-chemical emotions to feel fear, love and compassion. That is accepted science. Therefore a purely spiritual entity (our soul?) or logical function without body and body chemistry can’t feel either and thus won’t possess human-like intelligence. It won’t be able to take human-like decisions. But machine learning can provide benefits without the need for human-like intelligence. So in the last few years all large software companies have jumped on the machine learning bandwagon.

While IBM has no more to offer than publicity stunts, Facebook has no other interest than to utilize the private information of their 700 million users to make money. Facebook is using buzzword-speak ‘deep learning‘ to identify emotional content in text to improve its ad targeting through big data mining and prediction. Supposedly some of this is already used to reduce the Facebook news feed to an acceptable amount. My take? It isn’t working, much like Netflix movie suggestions or the product recommendations of Amazon. Why? Statistical distribution can’t judge my current emotional state!

But ML isn’t a ruse. It is real. Minsky’s neural networks are still being explored and improved for voice and image recognition and semantic, contextual mapping. These are important while low-end capabilities of the human brain for pattern recognition. Japanese, Canadian, and Stanford University researchers developed for example software to classify the sounds it was hearing into only a few vowel categories more than 80 percent of the time. Also face recognition is already extremely accurate today. Image classification is successfully used to recognize malignant forms of tumors for cancer treatment. In fact, voice recognition in Apple dictation in both iOS and OS X are extremely good in understanding spoken sentences. The hidden lesson is that Apple uses both a dictionary and a grammar library to correct the voice recognition. I have written much of this post using Apple dictation. The important progress in this area is the recognition of NEW common image features at a much higher success rate than humans can. But in all these approaches it is the human input that decides if the patterns are relevant or not. Man cooperates with machine, not machine replaces human intelligence.

So what is Google up to in machine learning?

The most publicized and successful Google venture in this domain is the self-driving car. A great example of how real-time data sensors in combination with a human-created world map (Google Maps obviously) allows a machine to interact safely and practically in a complex environment. Don’t forget that the car is not controlled by a BPM flow-diagram, but is totally event and context driven. So much for BPM and the ‘Internet of Things’ …

I dare to put Ray Kurzweil’s work at Google in the same category as IBM’s with similar illusions as Minsky. I have known Ray Kurzweil since the days he created the K250 synthesizer/sampler in 1984. It was the first successful attempt to emulate the complex sound of a grand piano. I was the proud owner of one in my musician days. It was inspired by a bet between Ray Kurzweil and Stevie Wonder over whether a synthesizer could sound like a real piano. It was awe-inspiring technology at the time and it too lead to predictions that performing musicians would become obsolete. It is obvious that this did not happen.

Kurzweil joined Google in 2013 to lead a project aimed at creating software capable of understanding text questions as well as humans can. The goal is to ask a question just as you would to another person and receive a fully reasoned answer, not just a list of links. Clearly this reminds of Siri and Wolfram Alpha and both have been at it for a while.

Kurzweil’s theory is that all functions in the neocortex, the plastic (meaning freely forming) six layers of neuron networks that is the seat of reasoning and abstract thought, are based on a hierarchy of pattern recognition. Not a new theory at all but pretty well established. It has led to a technique known as “hierarchical Hidden-Markov models,” that has been in used in speech recognition and other areas for over ten years. Very useful, but its limitations are well known. Kurzweil however proposes that his approach will allow human-like intelligence if the processor could provide a 100 trillion operations per second. A human brain is however not just a neocortex! And more processing power is not going to solve that problem.

In machine learning less is always more!

Google isn’t thus betting all its money on Ray Kurzweil but spent recently $400 million to acquire a company called DeepMind that attempts to mimic some properties of the human brain’s short-term memory. It too uses a neural network that identifies patterns as it stores memories and can later retrieve them to recognize texts that are analogies of the ones in memory. Here the less is more approach is used. DeepMind builds on the 1950 experiments of American cognitive psychologist George Miller who concluded that the human working memory stores information in the form of “chunks” and that it could hold approximately seven of them. Each chunk can represent anything from a simple number to an abstract concept pointing to a recognized pattern. In cognitive science, the ability to understand the components of a sentence and store them in working memory is called variable binding. The additional external memory enables the nerual network to store recognized sentences and retrieve them for later expansion. This allows to refer to the content of one sentence as a single term or chunk in another one.

Alex Graves, Greg Wayne, and Ivo Danihelka at London based DeepMind, call their machine a ‘Neural Turing Machine‘ because of the combination of neural networks with an additional short-term memory (as described by Turing).  While this is a great approach it lacks the ability for human interaction and training, which I see as the key aspect for practical use. But variable binding is a key functionality for intelligent reasoning.

Human collaboration and human-computer cooperation

The future is using computing as a tool to improve human capabilities and not to replace them. BPM being thus my pet peeve in large corporations. To illustrate the point I have been making for over a decade, I recommend to watch this interesting TED Talk by Shyam Sankar on human-computer collaboration.

Sankar talks about J.C.R. Licklider’s human-computer symbiosis vision to enable man and machine to cooperate in making decisions without the dependence on predetermined programs. Like me, Licklider proposed that humans would be setting the goals, formulating the hypotheses, determining the criteria, and performing the evaluations, while computers would deal with all operations at scale, such as computation and volume processing. They do not replace human creativity, intuition and decision-making.

So what are the aspects of machine learning that are both available, usable and do not suggest Orwellian scare scenarios? Well, machine learning technology is unfortunately perfectly suited and broadly used for surveillance but lets focus on the positive for the moment. Image and voice recognition and follow-on classification are areas where we have reached the stage of everyday practical use. We have been using image- and text-based document classification in Papyrus for 15 years. Machine learning is used in Papyrus for character recognition, text extraction, document structure, sentiment analysis, and for case context patterns related to user actions – with the so called UTA or User-Trained Agent. Pattern recognition for case management that uses the kind of human-training and cooperation that Sankar suggests has been patented by me in 2007.

What the UTA User-Trained Agent does and my patent describes, is that we do not look for patterns that predict that something will happen again. In human computer collaboration the repeated human reaction to a particular process pattern is interesting and therefore one can make others aware of the likelyhood that such an action is a good one. This ML functions does not just find patterns, it analyses how humans react to patterns. Users can also react to a recommended action by rejecting it. As I do not prescribe the process rigidly but require that goals to be achieved are defined, it is now possible to automatically map a chain of user actions to goal achievement and let a user judge how fast or efficient that process is.

But how practical is such machine learning to simplify process management for the business user. Does it require AI experts or big data scientists and huge machines? Absolutely not, as it too uses the less is more approach. Recognized patterns are automatically compacted into their simplest, smallest form and irrelevant information is truncated. But in 2007 it still used IT data structures and not business terminology. Using an ontology to describe processes in business language enables human-to-human collaboration and run-time process creation, and simplifies human-computer cooperation.

Papyrus thus uses a simplified form of ‘variable binding’ for process descriptions by means of an ontology. Such an ontology definition entry always has a subject, predicate and object just like the DeepMind short term memory. Now, the UTA can identify process patterns using the ontology terms. The first neural-network based User-Trained Agent version in 2007 could not explain why it suggested an action for a process pattern. Using the ontology to identify the pattern similarities in different work processes (really cases) one can tell in business terms why an action is recommended.

Business analysts create at design time a business ontology that the non-technical business users will use at run-time to create their processes and content. The technical effort is mostly related to creating interfaces to existing IT systems to read and write business data. At run-time users collaborate in business terminology and as they perform their work they create the process patterns that can be reused. These can both be stored explicitly as templates or the User-Trained Agent will pick up the most common actions performed by users in certain contexts.

Conclusion: We are just at the starting point of using machine learning for process management. IT and business management are mostly not ready for such advanced approaches because they lack the understanding of the underlying technology. We see it as our goal to dramatically reduce the friction, as Sankar calls it, between the human and the machine learning computer. Using these technologies has to become intuitive and natural. The ultimate benefit is an increase in the quality of the workforce and thus in customer service.

The Interdependence of Technology and Culture

Are computers making us smarter or dumbing us down?  However we feel about it, technology will progress and we need to decide for ourselves how we will interact with it. Yet before we can do that, we need to understand how technology is created and how it impacts our lives. We need to understand how human decisions are impacted or what happens when they are replaced by computerized ones. Only then we can make the right choices.

IMG_2598Development of technology depends on uncovering an aspect of nature, from how atoms form molecules, via how bacteria live in symbiosis with the human body, to how humans interact in a social network. It is interesting that technology does not grow and spread equally around the world, but there are hotspots and dead spots. It is both opportunity and culture that shape technological progress.

In The Death and Life of Great American CitiesJane Jacobs argues that the one core aspect of culture that drives progress is diversity of people and ideas. It fosters a creative environment that bubbles with innovation. Let’s not forget that this applies to any social grouping including businesses. The creative work of uncovering technological principles requires an environment of tolerance, prosperity, and opportunities for a diversity of ideas to mix. So quite apparently culture has an influence on technology. Ever since a stone axe has been used to shape the first wooden wheel, new technology is a combination of previous technologies, much like biological organisms are combinations of genes. My most used example is the one of the Apple Appstore ecosystem, setting free the immense creativity pool of its worldwide developer network. It is those applications and content that put Apple in front of Microsoft. Not the Mac or iPhone. As technology progresses, markets of humans using technology increasingly resemble biological ecosystems, with its building blocks being mixed and matched in new and unpredictable ways.

As the number of basic technologies expands the number of permutations and recombinations increases exponentially.  That’s why technological advancement is always accelerating. New technology enables us to uncover even more phenomena and that allows us to build even more technology.

When people say that they are afraid of technology what they are really saying is that they are afraid of people using to technology in the wrong way. As a general guideline we should only do things we fully understand and clearly there is always the military and political aspect to consider. Technology is equally suited for surveillance as it is for control. Too often Technology has been and is being used to kill more people more efficiently. War and peace are both dependent on technology.

And here comes the surprising loopback: Technology more than anything else changes the culture of the social group using it.

Morality and compassion are principle signs of maturity.

So why are people opposed to technology if they apply common sense? A two year old is not afraid of a smartphone, he/she is rather magically drawn towards it and learns within the shortest time how to interact with it. Einstein suggested that common sense is no more than all prejudice collected by the age of 18. Is it just the fear of change or the unknown consequences? I propose that politicians are only too happy to use this fear to enable more government control over technology. Orthodox media just love fear-mongering and especially towards its archenemy, the Internet.

Does that cause us to be prejudiced one way or the other about technology as we mature? Human maturity is mostly about realizing ones own weaknesses and turning them into strengths. Fear protects us but can also stifle. While the ability to forgo immediate pleasures for postponed gains is a sign of maturity there is little benefit by turning such behavior into rigid rules. A lot of it has to do with experience and the related emotional adjustment of ones decisions, also known as the value of failure. If you build airplanes some of them will crash! Using analogies in a future projection will enable a mature person to make large adjustments from small failures. A child that is not overprotected learns through its own experience that the height of a fall is proportional to the pain and that is for most sufficient to not test the limits of that projection. If not it is called natural selection.

Mature people understand intuitively that Collaboration is more productive than antagonistic behavior. Also competitiveness can be a great driver up to a point. So maturity is not just about increasing positive and reducing negative behaviors but about finding a balance while following our drivers. Perseverance, patience, endurance, and tolerance are all going to waste without compassion which is part of the one key human aspect: MORALITY. So what we are really worried about is a lack of morality and compassion that would stop others to use technology against us. But we do not have to just invoke images of Orwell’s 1984 but simply have to look how many businesses are run today and how employees are treated there.

Many would agree that a society has a higher morality if it has more laws and rules to follow. I propose that the opposite is the case. If people would be moral, there is no need for rules and laws. If people know what they are doing there is no need to protect them from their own folly with regulation such as speed limits. And let’s not forget that it is the law that makes the criminal and that the rule makes the transgressor. Many believe that rules and regulations save time and money as they prohibit failures. If that would be true the negative side effect would be a total lack of learning within that organization as there would be no failures and no growth of experience.

Prohibition as the ultimate regulation for both alcohol and drugs led to an empty space in which only criminals – the Mafia then and drug cartels now – define how drugs are being produced and consumed. I have been suggesting to deregulate drugs for two decades and I am no longer alone. The more you regulate, the more you give up control, because the action will take place outside the space you think you are controlling. If you put in place too strict process management then collaboration will bypass it with email and office tools.

In society and in business, rules and the necessary enforcement use up more resources and time than they actually save. An effect called ‘rule beating‘ is well known to system thinkers. Any defined rule will cause people to try and bypass it. The most efficient way to perform any kind of work is if the people involved know what they’re doing and rules are kept to the absolute minimum necessary. ‚Knowing what to do’ is not equal to abstract knowledge that can be encoded in rules. ‘Knowing what to do’ requires a personal experience transposed by ‘pattern-matching’ to the current context.

Giving people freedom is not anarchy! It drives diversity and learning.

While our understanding of human intelligence is limited, recent years have brought the realization that human decision making – the essence of free will – is controlled by our emotional center and is a bio-chemical function and not controlled by rational thinking. That has led some to say that there is no free will in humans. Brain scans show activity for a movement half a second before the human says that he has taken the decision. But how would we really know? Some use that discovery to argue that human decision-making is flawed. Nothing could be more wrong. The human mind has evolved to deal with decision-making under uncertainty. Boolean Logic requires a context of certainty that does not exist in reality.

Free will is also related to the concept of spontaneity, which is to the surprise of many a key concept of physics, especially quantum physics. Just like in quantum physics, spontaneity does not happen into an empty context, but it is that context that enables and is probably connected to the spontaneous action. Is it then still truly spontaneous or free will? A purely philosophical discussion depending on what free will is supposed to mean.

We do know that the human brain has a very plastic biology of layers upon layers. The limbic system is the emotional control mechanism for memory and decision-making and produces emotions such as anger, fear and our core drives. The cortex however stores a virtual representation of the real world as patterns. It uses it to continuously predict the future, based on the past and present. Each pattern recognised can have higher orders of patterns to the depth of 6 layers of the cerebral cortex.

The human brain is thus a bio-chemical pattern-matching device with a huge number of pre-encoded recognition clusters that allow us to sense our surroundings and deduce feelings and recognitions about them. As the world is simply not predictable in the detail and we are unable to know and measure accurately enough, we always decide under uncertainty and therefore an emotionally driven pattern matching concept works the best. Pattern matching is easy to train and easy to cluster into hierarchies. Much better than complex semantic networks that suffer from unresolvable interpretation issues. It is always the interpreter that assigns meaning and not the message creator. One can agree on common ontologies and taxonomies, but it will still be impossible to build logic networks that will be showing or explaining free will decision-making. And that understanding brings us back to maturity and morality! Feeling (through biochemistry) within us the same emotions as others given similar patterns is what allows us to be compassionate and thus moral. Logic can never be moral.

Company culture will change with technology!

One can use technology to do both — empower or restrict. Both will have a strong influence on a business and it will over time change the company culture, for better or for worse. So be careful what you wish for.

It is humans and their free will driven by the emotional context that makes a business moral and mature. A balanced score card or a well-defined process doesn’t. Company culture can’t be described in a few slogans either. There is no way that any kind of business or process maturity can be found in silly and far-fetched templates or boilerplates, models or frameworks, business or otherwise. Why should a business be more mature if it shapes its activity into predefined processes? It may be more compliant if it describes everything in rules but not more moral. Lacking diversity, it will certainly be much less innovative and thus competitive. And yes, one can therefore use the analogy of human maturity to business maturity. Mostly because it is the maturity of humans that make up the business that defines the maturity of the business. Not achieving some abstract level state of perfectly defined process logic. What utter nonsense!

Yes, technology will cause new challenges and further problems. There will be failures. Human creativity will use once again technology to solve those, not a methodology or legislation that restricts and demands safety and conformity. There is no need to fear technology as long as enough humans have the freedom to choose in a democratic environment. Technology that empowers will free the employees minds and unlock creativity and innovation. The same free minds will mostly use their freedom to do the moral thing.

No matter what your opinion is on the subject, the evolution of technology is tightly linked to our own. The use of technology is also tightly linked to the future of your business! Competitive advantage is not achieved through control so get hold of that fear and use technology to empower.

Follow

Get every new post delivered to your Inbox.

Join 7,032 other followers

%d bloggers like this: