Machine Learning

Deconvolutional Latent-Variable Model for Text Sequence Matching

Tagged: , , ,

This topic contains 0 replies, has 1 voice, and was last updated by  arXiv 2 months ago.


  • arXiv
    5 pts

    Deconvolutional Latent-Variable Model for Text Sequence Matching

    A latent-variable model is introduced for text matching, inferring sentence representations by jointly optimizing generative and discriminative objectives. To alleviate typical optimization challenges in latent-variable models for text, we employ deconvolutional networks as the sequence decoder (generator), providing learned latent codes with more semantic information and better generalization. Our model, trained in an unsupervised manner, yields stronger empirical predictive performance than a decoder based on Long Short-Term Memory (LSTM), with less parameters and considerably faster training. Further, we apply it to text sequence-matching problems. The proposed model significantly outperforms several strong sentence-encoding baselines, especially in the semi-supervised setting.

    Deconvolutional Latent-Variable Model for Text Sequence Matching
    by Dinghan Shen, Yizhe Zhang, Ricardo Henao, Qinliang Su, Lawrence Carin
    https://arxiv.org/pdf/1709.07109v1.pdf

You must be logged in to reply to this topic.