Microsoft launches a drag-and-drop machine learning tool

[+] ungzd|6 years ago|reply

Nothing shocking. GUI for machine learning existed for decades, for example Weka and Rapidminer. I don't see "drag and drop" in screenshots, however; it's just awful gadget journalism lingo, common for this site with reviews of gaming mouses.

ETL-style dataflow pipelines are more natural for such tasks than imperative programming, it's not just "for people who can't code". Rapidminer is dataflow-based too.

Actual links, if you want to avoid gadget news website:

https://docs.microsoft.com/en-us/azure/machine-learning/serv...

[+] scottlocklin|6 years ago|reply

Yann LeCun and Leon Bottou had one of these in Lush ... I think back in the 80s. It's still in there!

[+] streetcat1|6 years ago|reply

The techcrunch headline is misleading. The drag and drop is the OLD version. What is new is the automl capabilities.

In general the holy grail is an autonomous ML system. with no input from the user but a data set schema.

[+] stratosgear|6 years ago|reply

Mice!

[+] cwyers|6 years ago|reply

So, I clicked on the link to understand how this was different than their existing drag-and-drop machine learning tool, and...

> This tool, the Azure Machine Learning visual interface, looks suspiciously like the existing Azure ML Studio, Microsoft’s first stab at building a visual machine learning tool. Indeed, the two services look identical. The company never really pushed this service, though, and almost seemed to have forgotten about it despite the fact that it always seemed like a really useful tool for getting started with machine learning.

> Microsoft says this new version combines the best of Azure ML Studio with the Azure Machine Learning service. In practice, this means that while the interface is almost identical, the Azure Machine Learning visual interface extends what was possible with ML Studio by running on top of the Azure Machine Learning service and adding that services’ security, deployment and life cycle management capabilities.

The answer is "not much."

[+] blackflame7000|6 years ago|reply

The problem with this approach is that so much of machine learning is dependent on the datasets you choose to give it. If people need their hand held through setting up a basic Neural Network, I foresee a lot of garbage in garbage out

[+] marshray|6 years ago|reply

This sentiment gets expressed every time programming is made more accessible.

It always turns out that the difficulty of "setting up a basic [hello world application]" is entirely unrelated to the essential complexity of the problem space and attracting a broader range of new users is later viewed as a valuable advance.

[+] pazimzadeh|6 years ago|reply

Is this related to their acquisition of Lobe? https://techcrunch.com/2018/09/13/microsoft-acquires-lobe-a-...

https://lobe.ai/

If not, it looks like they have parallel projects working on basically the same thing. I’ve been waiting for Microsoft to open Lobe to the public..

[+] ickler9|6 years ago|reply

No, it's not. And agreed, Lobe has always looked great. Hopefully there will be announcements about it at Build next week.

[+] ABeeSea|6 years ago|reply

Looks closer to the open source R tool from revolution analytics which they acquired awhile back. Either way, node/graph based ML workflow tools aren’t new. SAS has had a pretty good one for well over a decade. Although it definitely shows its age these days.

[+] dlkf|6 years ago|reply

How many people are there that satisfy both of the following criteria:

1. They want to build, train, and deploy a machine learning model into production. Presumably as a microservice, part of a web application, etc

2. They don't know how to program

I honestly can't imagine a less useful product than drag and drop ML.

[+] mabbo|6 years ago|reply

I want to build, train and deploy a ML learning model in production.

I do know how to program.

I still want this tool. Badly. So badly. Just because I can program doesn't mean I want to use programming to solve every problem. I want simple tools that do the complex things for me for the 90% of cases where they are good enough.

Let me spend my time on the 10% of problems that simple tools can't solve!

[+] darkerside|6 years ago|reply

Excel has proven there's a market for tricking people into designing complex technical systems without realizing that they are programming a computer.

[+] archgoon|6 years ago|reply

There was a post yesterday about how it is desirable for Fortran to be preserved as opposed to porting Fortran code to C++, because scientists want to focus on expressing the science and math, not on the low level details of passing arrays around and performing safe array access (which Fortran compilers are more strict about and will emit errors for).

I haven't used this tool, but it seems reasonable that the people who it might be useful for (regardless if Microsoft PR recognizes this or not) are for scientists who have an idea for an algorithm, but don't want to spend too much time thinking about how to write and deploy python/C++ code to their clusters.

Sure, it may not seem like that much effort to many programmers, but as the Fortran discussion stressed, just because you can code and think logically, doesn't mean you're a programmer.

[+] guiltygods|6 years ago|reply

People possessing Domain and Subject knowledge are usually not coders so it makes perfect sense to give a tool that eases the transition into use of this tech. Make the tools accessible enough so that it caters to 80% of use cases and you have already won the battle.

[+] kemiller2002|6 years ago|reply

Funny enough, when I saw a demo for this, it was by a programmer, and that was one of his main task at his job. I had the same impression, but he actually made a quite sound argument for it. He wasn't a machine learning expert (yet) and it allowed him and his team to quickly construct models and see what the best fit was. He was quite efficient at using it, and showed us how to build and run models in under an hour. There was no code they had to maintain which was ultimately a lot less work for them.

[+] pazimzadeh|6 years ago|reply

I think a lot of biologists (such as myself) would find drag and drop ML very useful.

I’m not interested in deploying a service or anything, but it would be a great way to take a first pass at analyzing some of the huge and pretty complex datasets that we generate, like metagenomic DNA sequences of microbiotas that are paired with health related information that could also be fed into the model.

Even just narrowing down a list of potential targets would be pretty darn useful.

[+] jaabe|6 years ago|reply

In the analytics departments of enterprise sized companies and organisations sits a lot of mathematicians, economics and statisticians. These are the people who are going to use ML to change the world, because business intelligence, prediction and analytics live in these departments.

Not a lot of these people can program.

I know a lot of programmers who want to work with ML, but in my experience, very few programmers are good enough at math or statistics to do so, and even fewer have the business skills to actually translate their results to management in non-tech based organisations.

I’m sure a lot of programmers will make excellent data-scientists, but I’m not entirely convinced why I would bring ML to my programmers rather than my people who have degrees in applied statistics and organisations.

I work in the public sector. One of the reasons ML hasn’t found it’s golden case yet, is largely because no one have figured out how to use ML in a way that is better than the decades worth of data-related work we have already done, and part of the reason behind this, is that companies who sell ML are programmers. They know how to use ML to identify, but none of them, not even IBM seem to know how to use ML for something they can actually get us to buy. And lord knows both sides of the tables have tried, I mean, even our political leadership has heard of the ML hype, and want us to use it. So I’m rather hopeful these tools for non-programmers will bring ML into the hands of people who will know what to use it for.

[+] vonnik|6 years ago|reply

Years ago, someone said something about what we were building that I've never forgotten:

"Anyone smart enough to use this isn't dumb enough to need it."

Those were the words that come to my mind everytime I see stuff like this. Clickers want solutions, not complicated toolkits in a GUI.

[+] dillondoyle|6 years ago|reply

I think there's value in these services for marketers, or target-ers. I see a lot of potential clients that know only excel /basic SQL and have a large list of customers/orders, or voters & donors. If they could upload and get back a scored list of top targets or leads that would add value. They just need to upload that list back to FB or wherever.

I don't see what the market would be for implementing this into production, like you say to make live in production needs to know how to code. Perhaps though I could see Segment offer a similar tool which I guess would be 'in production' without code.

[+] pmart123|6 years ago|reply

I think that’s true regarding where value will be produced, but Microsoft can sell way more subscriptions if it’s point and click and presumably wrong. I will say though that they recently added suggested chart types in excel and they don’t seem half bad.

[+] unknown|6 years ago|reply

[deleted]

[+] kerng|6 years ago|reply

Nice. Microsoft has been doing some great work the last couple of years with ML - especially for beginners its great to get started with their tooling. I always like their UI and workflow parts in Azure ML.

I still use their free Jupyter Notebooks service also.

[+] hestefisk|6 years ago|reply

By the way, I can see a massive adoption of this in consulting. Often we develop ‘quick and dirty’ model to test variables or do a high level regression / predictive model. Having this without the need for Python code is extremely useful, especially when you work in short term, high burn projects and just need an 80/20 answer.

[+] behnamoh|6 years ago|reply

I wonder who their target market are. ML requires a solid math background and the ability to customize every detail of the process which drag-and-drop tools never fully provide. I understand that it's always beneficial to have more user-friendly tools, but I still think ML experts - at least those who don't just copy-paste code snippets from SO - would still prefer the more professional R and Python packages.

Maybe Microsoft aims at teaching ML to beginners, which still would be detrimental if they get used to just that.

[+] bdod6|6 years ago|reply

I had this exact same thought when I read the headline. It seems like MS and others are viewing ML as a similar opportunity to Big Data/BI ten years ago. You saw the "democratization of data" as people with little technical skills could suddenly create analytics dashboards within tools like Tableau.

In my opinion, it's far too easy to make a critical mistake during design/implementation of ML to follow this same path. And what's more, if you mess up making an analytics dashboard, it's usually fairly obvious. In ML, there are MANY ways to mess up a model and you have no easy way to tell.

If someone doesn't have the technical experience behind creating these models, I would not trust any output they give me from using one of these tools. And if they do have the experience, they would certainly not be choosing to use one of these tools either.

[+] twblalock|6 years ago|reply

A lot of ML problems are already solved fairly well and people want to use them in their products. For example, say you wanted to make a smartphone app that had some kind of image recognition. You aren't trying to invent a new machine learning algorithm. This tool would be very convenient for making an app like that.

[+] gtt|6 years ago|reply

>> ML requires a solid math background

I've seen many people in ML not quite understanding what they are doing randomly trying different things until it "works".

[+] heavenlyblue|6 years ago|reply

>> "drag-and-drop tools never fully provide"

Really? How many of the machine learning "specialists" actually write custom codes for matrix operations done by the ML engines?

[+] marshray|6 years ago|reply

> ML requires a solid math background

B-t.

> at least those who don't just copy-paste code snippets from SO

This is like the late 1990's listening to "real coders" complaining about web developers using Javascript all over again.

[+] leblancfg|6 years ago|reply

I was taking a look at H2O.ai's autoML dashboard yesterday, which they call Driverless AI. Broader in scope, includes an interpretability feature, and seems a little less white box-ey than what I could see from MS. Plus, great looks. Haven't tried it first-hand, but I did take a mental note.

https://www.h2o.ai/products/h2o-driverless-ai/

[+] hestefisk|6 years ago|reply

Imagine the auto ML built into Excel. Now that would be awesome.

[+] simmers|6 years ago|reply

That iris data set is perhaps the most prolific data in the world.

[+] siliconc0w|6 years ago|reply

I think an ideal generalized ML service is more like - you give it a CSV and then add another row with missing column(s) and it guesses what should be in those column(s) along with some human readable explanation of how it got there.

[+] cbHXBY1D|6 years ago|reply

Azure Machine Learning actually does this:

https://docs.microsoft.com/en-us/azure/machine-learning/serv...

[+] visarga|6 years ago|reply

So it's like Scratch for ML? You 'draw' a program with a unwieldy graphical interface, and when it inevitably becomes a visual mess of complexity what can you do?

[+] dessant|6 years ago|reply

Is downloading the trained model for offline use possible?

[+] calewis|6 years ago|reply

[deleted]

[+] dlphn___xyz|6 years ago|reply

[deleted]

[+] archgoon|6 years ago|reply

Machine Learning pipelines are at their core a sequence of data transformations. Having a concrete visualization of said transformations may make developing ML pipelines easier, and also allow for clearer communication between developers (and their managers, and potentially future auditors) as to what components they're working on and how they fit together.

Also, by using input output blocks (as opposed to generic functions that can potentially access global state), the data dependencies of different components is made explicit, and can make tooling around that easier. (I don't know how strongly this is enforced in this implementation).

Granted of course, one's mileage may vary.

[+] sgt101|6 years ago|reply

Because drag and drop ML is so new.

https://en.wikipedia.org/wiki/SPSS_Modeler

I remember using this in NINETEEN NINETY FIVE (it was called clementine then).

Interestingly one of the leads (Rob Milne) sold up (to IBM , forced sale I guess due to a cash squeeze and no investors) and went to Everest, where he got to the bottom of the Hilary step, had a massive heart attack and died.

Makes ya thunk.

[+] hawaiianbrah|6 years ago|reply

A major company launching something doesn’t mean it’s a new idea?

62 comments