Talk:Deep learning/Archive 1

From Wikipedia, the free encyclopedia

Article created

I did about two years of work involving Deep Belief Networks. Noticing that there was no article for deep architectures in Wikipedia I made this one and hope to add more soon! Renklauf (talk) Wed Jul 20 06:29:12 UTC 2011

Please do! I greatly enjoy this topic, and I think it's the best way to Strong AI! Danuthaiduc (talk) 10:00, 10 August 2011 (UTC)
Yeah, but let's follow NPOV and refrain from asserting that this stuff really works. You can't use one researcher's claims about his own work as proof that his ideas are valid (see Reproducibility of results). --Uncle Ed (talk) 15:48, 27 May 2012 (UTC)
There is zero technical merit to this article. I suggest merging this with the article on neural networks until "definitions" of "deep" neural networks like this one are seen as laughably pathetic as they are: "A deep neural network (DNN) is defined to be an artificial neural network with at least one hidden layer of units between the input and output layers..." For crying out loud, people, the addition of at least one hidden lay was the whole idea behind neural networks as revived during the 1980s -- and this after the article's introduction says "ANNs fell out of favor in practical machine learning and simpler models such as support vector machines (SVMs) became the popular choice of the field in the 1990s and 2000s." That is to say, Deep Learning Neural Networks (as defined in this article) fell out of favor prior to the Deep Learning Neural Networks advancing the field beyond Deep Learning Neural Networks. Jim Bowery (talk) — Preceding undated comment added 20:09, 9 May 2014 (UTC)
I'm afraid I agree with Jim Bowery -- I'm not sure that deep learning is really a separate field -- it sounds like another term for an existing set of paradigms. I would support a merge. ---- CharlesGillingham (talk) 22:54, 27 September 2014 (UTC)
I also agree. As this article currently stands I don't see the difference between Deep Learning and Neural Networks. If there really IS a difference it should be made clear in the article. If there isn't a difference then this article should be merged with an article on Neural Nets. BTW, I also think it's not good style to have comments like this at the top of the article: "Alternatively, "deep learning" has been characterized as "just a buzzword for neural nets" Is it "just a buzzword for neural nets" or isn't it? That seems to me something the article should take a stand on and not just be weasely as it is now. --MadScientistX11 (talk) 18:29, 21 October 2014 (UTC)
As far as I can tell, "deep learning" refers commonly to the current neural net revival, which started when researchers figured out how to train nets with more than (say) three hidden layers, ca. 2006. But since then, as you can see in this article, anything and everything neural net-related has been sold as "deep learning", including single-layer networks. QVVERTYVS (hm?) 19:11, 21 October 2014 (UTC)
The difference between deep learning and neural networks is pretty clear in the literature and in this article. A neural network (in machine learning) is a a particular kind of architecture or mathematical model for transforming inputs to outputs. Deep learning is the name of a class of algorithms, based on representational methods and unsupervised pre-training of layers, for training models with deep architectures, that is models with multiple layers of computational units between input and output. Don't mistake a training algorithm for an architecture. There are many different ways to train artificial neural networks: see Artificial neural network#Learning algorithms for some examples. Some of theses learning algorithms, such as backpropagation and expectation-maximization, have their own standalone articles.
Deep learning as a training algorithm is more than notable enough to have its own standalone article. Criticism of article content is welcome, but it is far better to improve the content using reliable sources than to derisively declare "It's all crap!" and attempt to delete it. "It's all crap!" is also a non-neutral stance on the topic. It is clear from the literature that there are proponents of deep learning methods and undeniable successes in the competitions, there are detractors of these methods and there are people fed up with all the hype. Summarizing with due weight on all of these is the way to go. --Mark viking (talk) 19:46, 21 October 2014 (UTC)
Mark viking deep learning is not an algorithm at all. It's an umbrella term for various models, trained with novel but still very different algorithms, ranging from the supervised to the unsupervised. I actually wrote a page about the deep belief network and its training algorithm; but convolutional nets are trained in quite different ways. (Backpropagation is involved practically everywhere, though.) QVVERTYVS (hm?) 20:04, 21 October 2014 (UTC)
As I said above, Deep learning is the name of a class of algorithms. Feedforward neural nets, restricted Boltzmann machines, deep neural networks and deep belief networks are all mathematical models. How the models are trained, that is, how model parameters are chosen to optimize some loss function, is another topic entirely. People had looked at deep neural network architectures before deep learning techniques were developed, but it was the deep learning techniques--unsupervised pre-training, layer by layer--that made deep networks more practical at the time. With today's GPU based algorithms, the need for pre-training has lessened. The hype surrounding deep learning has muddied the concept to point that people like Michael Jordan claim it is just a synonym for neural network. But the backlash from current hype doesn't change the historical importance of deep learning algorithms in generating interest and good results in deep neural network models or representational learning. Schmidhuber's review has a good discussion of these issues. --Mark viking (talk) 21:55, 21 October 2014 (UTC)

Ok, agreed. Let's try to clean up the article before trying to merge it. QVVERTYVS (hm?) 09:14, 22 October 2014 (UTC)

Sounds good. I will chip in the next few days. --Mark viking (talk) 17:26, 23 October 2014 (UTC)
Good job so far guys - it's a devilish mess to unravel! That said, I have to agree with most of the negative comments - this still needs a lot of work before being reference quality. Deep learning is a very young (immature) field at a philosophical level (probably because progress is driven mostly by empirical engineering practices) and we must have faith that it will become more rational as it matures. E.g., the state of taxonomic analysis is pitiful (as substantiated out by the various comments here). As a step in the direction of rigorous philosophical treatment, I have added a fairly brief and by no means exhaustive section on *interpretation*. Clearly, the logic set forth in the added 'interpretation' section is not compatible with the subsequent (hand-wavy) 'definitions' section but I haven't got time to clear that section up at the moment. We could also do with a 'Biological Interpretation' section as well (to round things out) - if there is one. We could also do with more examples of the insights and progress that was specific to each interpretation. Furthermore, I do not support a merger with 'neural networks'. It is premature to conclude that neural networks are inherently 'deep learners' (see the 'DSP interpretation' for argumentation). In order for this field to be taken seriously, we need to step up the rigor fellas. Qazwsxedcqazws (talk) 07:54, 1 October 2015 (UTC)
How is the clean up progressing? IMHO, "Deep Learning" as a term is likely to mislead the not-so-educated audience about the "depth" of learning: the general audience does not understand that the only "deep" thing about "deep learning" is the metric depth of the neural network, and there is nothing else particularly deep about the neural networks or the associated learning methods. An extremely effective way to produce more hype!  Preceding unsigned comment added by 87.92.32.62 (talk) 07:31, 2 March 2019 (UTC)

Deep Learning Artificial Neural Networks

This article was missing a lot of stuff on deep learning neural networks. I tried to improve this a bit, but much remains to be done. (I found the so-called "official site" on deep learning pretty meager, too.) Deeper Learning (talk) 20:35, 7 December 2012 (UTC)

You could have a look at deep belief networks and deep autoencoders. p.r.newman (talk) 13:56, 20 August 2013 (UTC)
It's said in the article that neural networks with more than one hidden layer could be considered as a deep learning architecture. In this case there is at least two hidden layers (and also 3 non-linear successive transformations). Instead isn't it ONE hidden layer for two non-linear transformations ? — Preceding unsigned comment added by 163.5.218.118 (talk) 01:19, 9 March 2014 (UTC)

Streamlining

One should probably streamline repetitive statements in the introduction and the section on deep neural networks. Isn't deep learning exclusively about neural networks anyway? As far as I know, there is no other successful form of deep learning. Prof. Oundest (talk) 01:56, 11 August 2013 (UTC)

Proofreading note: "set of algorithms"

The article states that deep learning is a "set of algorithms", but it doesn't clearly identify the specific algorithms included in the set. The Transhumanist 01:40, 25 September 2013 (UTC)

How is deep learning differentiated from the family of feature learning methods it belongs to?

The article states: "Deep learning is part of a broader family of machine learning methods based on learning representations." But it doesn't explain how it differs from the rest of the feature learning family. The Transhumanist 01:40, 25 September 2013 (UTC)

"Deep learning" synonymous with "neural networks"?

The article's lead includes a blatant claim that "deep learning" is synonymous with "neural networks":

Deep learning is just a buzzword for neural nets, and neural nets are just a stack of matrix-vector multiplications, interleaved with some non-linearities. No magic there.

Ronan Collobert

This is potentially extremely confusing, as it may cause readers to wonder why there is a separate article about deep learning. The article does not justify itself with an explanation. That is, it doesn't explain how deep learning is a type of neural networks, rather than being just another name for neural networks in general.

If it is synonymous (as per the claim included in the lead), then the article violates WP:FORK. And so far, the article doesn't make it clear that its coverage isn't a content fork.

Is there any such thing as non-deep learning neural networks? If so, what are they? And, how do deep learning neural networks differ from them? The Transhumanist 01:40, 25 September 2013 (UTC)

I agree that this is confusing, but the reason for citing Collobert (one of the foremost practitioners in the current neural nets landscape) is to counterbalance the rest of the article. As I understand deep learning, it's neural nets with more hidden layers than you can train using vanilla backpropagation (because of numerical stability, the required computing time, and/or a lack of labeled training data). See e.g. this talk by Hinton, and note the difference between the title assigned to it by UBC's YouTube moderator and the actual title of the talk. QVVERTYVS (hm?) 18:22, 25 September 2013 (UTC)
Yes that quote was weirdly situated. I added some context and moved it to the last paragraph of the lead. Bhny (talk) 19:10, 25 September 2013 (UTC)
I was about to revert your edit, but then I figured that wouldn't solve anything, and I'd just be restoring my POV. Instead, I tagged the whole page as OR. I haven't seen a source that establishes the link between the neocognitron, or for that matter Schmidthuber's work, to the recent trend of "deep learning". I'll admit that, from what I've read, it fits one of the definitions that can be gleaned from it, but as this page stands, there's no WP:COMMON here. QVVERTYVS (hm?) 19:34, 25 September 2013 (UTC)
I clarified a few things concerning the contributions of Fukushima, LeCun, Schmidhuber, Hinton, and others, with references. Schmidhuber's open peer review web page http://www.idsia.ch/~juergen/firstdeeplearner.html is a great resource for references. Its acknowledgments read like a "who is who" of neural networks. Yes deeper (talk) 19:17, 29 November 2013 (UTC)
I agree with the Transhumanist and James Bowery above. Don't make it more difficult for readers to find what they are looking for; don't WP:Fork. Who, exactly, uses the term "deep learning"? Certainly not many of the people who's work is described as "deep learning". I think Hinton would say he was working on neural networks or even connectionism. I don't like people being described as being part of research project that they never would have heard of. ---- CharlesGillingham (talk) 23:00, 27 September 2014 (UTC)
Amongst others Yoshua Bengio, Goeffrey Hinton and Yann LeCun use the term "deep learning". —Kri (talk) 21:23, 5 February 2016 (UTC)
"Deep learning" may be synonymous with "neural networks" if you limit yourself to the neural networks that are used today, which are really "deep neural networks", i.e. neural networks with many hidden layers. For a long time, however, neural networks were very difficult to train unless they were very shallow (basically just one or at the maximum two hidden layers). It is first lately that we have been able to train deep neural networks efficiently, which have made it possible for neural networks to learn much more sophisticated features and abstract concepts, which is key in perception. Hence the buzzword "deep learning". So all neural networks can definitely not be counted as forms of deep learning. —Kri (talk) 20:37, 5 February 2016 (UTC)

Definition and citation

The very first sentence appears to be a definition quoted from the given paper "Representation Learning: A Review and New Perspectives".

Deep learning is a set of algorithms in machine learning that attempt to model high-level abstractions in data by using architectures composed of multiple non-linear transformations.

I didn't find the quote, though. I'd suggest to make it clear how to verify the statement. The paper for your reference: http://www.computer.org/csdl/trans/tp/2013/08/ttp2013081798.html — Preceding unsigned comment added by 87.149.191.206 (talk) 14:54, 30 April 2014 (UTC)

It's not a quote. If it were a quote, it would be in quotation marks. QVVERTYVS (hm?) 19:27, 30 April 2014 (UTC)

Errors shrink exponentially?

The article states:

[Recurrent neural networks] are trained by unfolding them into very deep feedforward networks, where a new layer is created for each time step of an input sequence processed by the network. As errors propagate from layer to layer, they shrink exponentially with the number of layers. To overcome this problem [...]

Should this instead say that the errors increase exponentially? Why would the shrinking of errors constitute a "problem"? AxelBoldt (talk) 23:53, 24 May 2014 (UTC)

What I think is meant is the the derivative of the error function in terms of the parameters shrinks as the parameters (weights) are further from the output unit. This is the "vanishing gradient" problem: parts of the net close to the input side receive very small updates, until at some point the derivatives underflow and these parts no longer get any updates. QVVERTYVS (hm?) 15:27, 25 May 2014 (UTC)
Yes, that sounds right. Maybe we can reformulate the error propagation sentence to explain it better? AxelBoldt (talk) 15:04, 26 May 2014 (UTC)

DNN is just an MLP

A deep neural network (DNN) is defined to be an artificial neural network with at least one hidden layer of units between the input and output layers

Following this definition, almost all neural nets are deep. The source for this sentence actually describes good old (shallow) multilayer perceptrons, and says "we will see later on that there are substantial benefits to using many such hidden layers, i.e. the very premise of deep learning" (bold in the original), so it actually distinguishes single-hidden layer nets from deep ones. QVVERTYVS (hm?) 10:31, 23 June 2014 (UTC)

It seems to me then that the definition of DNNs as given here in the article is wrong. It sounds rather like a requirement (i.e. it constitutes a superset for DNNs) rather than an actual definition. —Kri (talk) 11:14, 4 October 2014 (UTC)

Use in NLP

What is a Vector of Pixels?

Limitations of DNNs

The relativity of being recent

Art and AI

stupid sentence

Andrew J.R. Simpson self citations

Why are "deep neural networks" listed as a separate deep learning architecture?

Editing the copyediting

Criticism and comment on "Criticism and comment"

Pseudo Citation in Definition

Please consider and compare before reverting

Text removed as being pseudo-circular in its sourcing (back to Wikipedia)

Artificial Neural Networks section's history

The Reference to the Mordvintsev/Olah/Tyka "Inceptionism" Post

Greek translation

The introductory paragraph to this article is a f***ing trainwreck.

Rewrite

Deep Learning for Natural Language Processing (NLP)

Origin of term

Overlap with Artificial neural network

"Decision Stream" Editing Campaign

Semi-protected edit request on 14 August 2018

A Commons file used on this page has been nominated for speedy deletion

Improving the text describing the first implementation of DNNs using back propagation

Delete "Shallowing deep neural networks" section?

Google Brain referred to in 2009, but only formed in 2011

neuromorphic hardware

Image

Poster child for the reference spamming here

Year of first published learning algorithm - 1965

Snow

Related Articles

Wikiwand AI