{"id":296768,"date":"2016-09-23T01:47:36","date_gmt":"2016-09-23T08:47:36","guid":{"rendered":"https:\/\/newed.any0.dpdns.org\/en-us\/research\/?post_type=msr-research-item&#038;p=296768"},"modified":"2018-10-16T19:59:21","modified_gmt":"2018-10-17T02:59:21","slug":"learning-bilinear-model-matching-queries-documents","status":"publish","type":"msr-research-item","link":"https:\/\/newed.any0.dpdns.org\/en-us\/research\/publication\/learning-bilinear-model-matching-queries-documents\/","title":{"rendered":"Learning bilinear model for matching queries and documents."},"content":{"rendered":"<p>The task of matching data from two heterogeneous domains naturally arises in various areas such<br \/>\nas web search, collaborative filtering, and drug design. In web search, existing work has designed<br \/>\nrelevance models to match queries and documents by exploiting either user clicks or content of<br \/>\nqueries and documents. To the best of our knowledge, however, there has been little work on principled<br \/>\napproaches to leveraging both clicks and content to learn a matching model for search. In<br \/>\nthis paper, we propose a framework for learning to match heterogeneous objects. The framework<br \/>\nlearns two linear mappings for two objects respectively, and matches them via the dot product of<br \/>\ntheir images after mapping. Moreover, when different regularizations are enforced, the framework<br \/>\nrenders a rich family of matching models. With orthonormal constraints on mapping functions,<br \/>\nthe framework subsumes Partial Least Squares (PLS) as a special case. Alternatively, with a \u21131+\u21132<br \/>\nregularization, we obtain a new model called Regularized Mapping to Latent Structures (RMLS).<br \/>\nRMLS enjoys many advantages over PLS, including lower time complexity and easy parallelization.<br \/>\nTo further understand the matching framework, we conduct generalization analysis and apply<br \/>\nthe result to both PLS and RMLS. We apply the framework to web search and implement both PLS<br \/>\nand RMLS using a click-through bipartite with metadata representing features of queries and documents.<br \/>\nWe test the efficacy and scalability of RMLS and PLS on large scale web search problems.<br \/>\nThe results show that both PLS and RMLS can significantly outperform baseline methods, while<br \/>\nRMLS substantially speeds up the learning process.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>The task of matching data from two heterogeneous domains naturally arises in various areas such as web search, collaborative filtering, and drug design. In web search, existing work has designed relevance models to match queries and documents by exploiting either user clicks or content of queries and documents. To the best of our knowledge, however, [&hellip;]<\/p>\n","protected":false},"featured_media":0,"template":"","meta":{"msr-url-field":"","msr-podcast-episode":"","msrModifiedDate":"","msrModifiedDateEnabled":false,"ep_exclude_from_search":false,"_classifai_error":"","msr-author-ordering":null,"msr_publishername":"","msr_publisher_other":"","msr_booktitle":"","msr_chapter":"","msr_edition":"Journal of Machine Learning Research (JMLR)","msr_editors":"","msr_how_published":"","msr_isbn":"","msr_issue":"","msr_journal":"Journal of Machine Learning Research (JMLR)","msr_number":"","msr_organization":"","msr_pages_string":"2519-2548","msr_page_range_start":"2519","msr_page_range_end":"2548","msr_series":"","msr_volume":"14","msr_copyright":"","msr_conference_name":"","msr_doi":"","msr_arxiv_id":"","msr_s2_paper_id":"","msr_mag_id":"","msr_pubmed_id":"","msr_other_authors":"","msr_other_contributors":"","msr_speaker":"","msr_award":"","msr_affiliation":"","msr_institution":"","msr_host":"","msr_version":"","msr_duration":"","msr_original_fields_of_study":"","msr_release_tracker_id":"","msr_s2_match_type":"","msr_citation_count_updated":"","msr_published_date":"2013-01-01","msr_highlight_text":"","msr_notes":"","msr_longbiography":"","msr_publicationurl":"http:\/\/www.jmlr.org\/papers\/volume14\/wu13a\/wu13a.pdf","msr_external_url":"","msr_secondary_video_url":"","msr_conference_url":"","msr_journal_url":"","msr_s2_pdf_url":"","msr_year":0,"msr_citation_count":0,"msr_influential_citations":0,"msr_reference_count":0,"msr_s2_match_confidence":0,"msr_microsoftintellectualproperty":true,"msr_s2_open_access":false,"msr_s2_author_ids":[],"msr_pub_ids":[],"msr_hide_image_in_river":0,"footnotes":""},"msr-research-highlight":[],"research-area":[13556],"msr-publication-type":[193715],"msr-publisher":[],"msr-focus-area":[],"msr-locale":[268875],"msr-post-option":[],"msr-field-of-study":[],"msr-conference":[],"msr-journal":[],"msr-impact-theme":[],"msr-pillar":[],"class_list":["post-296768","msr-research-item","type-msr-research-item","status-publish","hentry","msr-research-area-artificial-intelligence","msr-locale-en_us"],"msr_publishername":"","msr_edition":"Journal of Machine Learning Research (JMLR)","msr_affiliation":"","msr_published_date":"2013-01-01","msr_host":"","msr_duration":"","msr_version":"","msr_speaker":"","msr_other_contributors":"","msr_booktitle":"","msr_pages_string":"2519-2548","msr_chapter":"","msr_isbn":"","msr_journal":"Journal of Machine Learning Research (JMLR)","msr_volume":"14","msr_number":"","msr_editors":"","msr_series":"","msr_issue":"","msr_organization":"","msr_how_published":"","msr_notes":"","msr_highlight_text":"","msr_release_tracker_id":"","msr_original_fields_of_study":"","msr_download_urls":"","msr_external_url":"","msr_secondary_video_url":"","msr_longbiography":"","msr_microsoftintellectualproperty":1,"msr_main_download":"","msr_publicationurl":"http:\/\/www.jmlr.org\/papers\/volume14\/wu13a\/wu13a.pdf","msr_doi":"","msr_publication_uploader":[{"type":"url","title":"http:\/\/www.jmlr.org\/papers\/volume14\/wu13a\/wu13a.pdf","viewUrl":false,"id":false,"label_id":0}],"msr_related_uploader":"","msr_citation_count":0,"msr_citation_count_updated":"","msr_s2_paper_id":"","msr_influential_citations":0,"msr_reference_count":0,"msr_arxiv_id":"","msr_s2_author_ids":[],"msr_s2_open_access":false,"msr_s2_pdf_url":null,"msr_attachments":[{"id":0,"url":"http:\/\/www.jmlr.org\/papers\/volume14\/wu13a\/wu13a.pdf"}],"msr-author-ordering":[{"type":"text","value":"Wei Wu","user_id":0,"rest_url":false},{"type":"text","value":"Zhengdong Lu","user_id":0,"rest_url":false},{"type":"text","value":"Hang Li","user_id":0,"rest_url":false},{"type":"user_nicename","value":"wuwei","user_id":34855,"rest_url":"https:\/\/newed.any0.dpdns.org\/en-us\/research\/wp-json\/microsoft-research\/v1\/researchers?person=wuwei"}],"msr_impact_theme":[],"msr_research_lab":[199560],"msr_event":[],"msr_group":[],"msr_project":[],"publication":[],"video":[],"msr-tool":[],"msr_publication_type":"article","related_content":[],"_links":{"self":[{"href":"https:\/\/newed.any0.dpdns.org\/en-us\/research\/wp-json\/wp\/v2\/msr-research-item\/296768","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/newed.any0.dpdns.org\/en-us\/research\/wp-json\/wp\/v2\/msr-research-item"}],"about":[{"href":"https:\/\/newed.any0.dpdns.org\/en-us\/research\/wp-json\/wp\/v2\/types\/msr-research-item"}],"version-history":[{"count":1,"href":"https:\/\/newed.any0.dpdns.org\/en-us\/research\/wp-json\/wp\/v2\/msr-research-item\/296768\/revisions"}],"predecessor-version":[{"id":517103,"href":"https:\/\/newed.any0.dpdns.org\/en-us\/research\/wp-json\/wp\/v2\/msr-research-item\/296768\/revisions\/517103"}],"wp:attachment":[{"href":"https:\/\/newed.any0.dpdns.org\/en-us\/research\/wp-json\/wp\/v2\/media?parent=296768"}],"wp:term":[{"taxonomy":"msr-research-highlight","embeddable":true,"href":"https:\/\/newed.any0.dpdns.org\/en-us\/research\/wp-json\/wp\/v2\/msr-research-highlight?post=296768"},{"taxonomy":"msr-research-area","embeddable":true,"href":"https:\/\/newed.any0.dpdns.org\/en-us\/research\/wp-json\/wp\/v2\/research-area?post=296768"},{"taxonomy":"msr-publication-type","embeddable":true,"href":"https:\/\/newed.any0.dpdns.org\/en-us\/research\/wp-json\/wp\/v2\/msr-publication-type?post=296768"},{"taxonomy":"msr-publisher","embeddable":true,"href":"https:\/\/newed.any0.dpdns.org\/en-us\/research\/wp-json\/wp\/v2\/msr-publisher?post=296768"},{"taxonomy":"msr-focus-area","embeddable":true,"href":"https:\/\/newed.any0.dpdns.org\/en-us\/research\/wp-json\/wp\/v2\/msr-focus-area?post=296768"},{"taxonomy":"msr-locale","embeddable":true,"href":"https:\/\/newed.any0.dpdns.org\/en-us\/research\/wp-json\/wp\/v2\/msr-locale?post=296768"},{"taxonomy":"msr-post-option","embeddable":true,"href":"https:\/\/newed.any0.dpdns.org\/en-us\/research\/wp-json\/wp\/v2\/msr-post-option?post=296768"},{"taxonomy":"msr-field-of-study","embeddable":true,"href":"https:\/\/newed.any0.dpdns.org\/en-us\/research\/wp-json\/wp\/v2\/msr-field-of-study?post=296768"},{"taxonomy":"msr-conference","embeddable":true,"href":"https:\/\/newed.any0.dpdns.org\/en-us\/research\/wp-json\/wp\/v2\/msr-conference?post=296768"},{"taxonomy":"msr-journal","embeddable":true,"href":"https:\/\/newed.any0.dpdns.org\/en-us\/research\/wp-json\/wp\/v2\/msr-journal?post=296768"},{"taxonomy":"msr-impact-theme","embeddable":true,"href":"https:\/\/newed.any0.dpdns.org\/en-us\/research\/wp-json\/wp\/v2\/msr-impact-theme?post=296768"},{"taxonomy":"msr-pillar","embeddable":true,"href":"https:\/\/newed.any0.dpdns.org\/en-us\/research\/wp-json\/wp\/v2\/msr-pillar?post=296768"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}