ACMMM2012　Accepted Papers

最新推荐文章于 2023-12-11 20:14:46 发布

原创最新推荐文章于 2023-12-11 20:14:46 发布 · 4.7k 阅读

2 ·

本内容遵循CC 4.0 BY-SA版权协议

收录于

科研.学术

Full papers

Paper ID	Paper Title
1	The Acoustic Emotion Gaussians Model for Emotion-Based Music Annotation and Retrieval
47	Attribute Feedback
66	Spatial Pooling of Heterogeneous Features for Image Applications
88	A Social Network-aided Efficient Peer-to-Peer Live Streaming System
90	Sense Beauty via Face, Dressing, and/or Voice
91	Discovering Informative Social Subgraphs and Predicting Pairwise Relationships from Group Photos
110	Constraint-Optimized Keypoint Inhibition/Insertion Attack: Security Threat to Scale-Space Image Feature Extraction
111	Query-driven iterated neighborhood graph search for scalable visual indexing
113	Accelerating SURF Detector on Mobile Devices
114	Harvesting Visual Concepts for Image Search with Complex Queries
118	Whom should I Trust: Topic-sensitive Influencer Mining for Personalized Image Search
123	Query-Adaptive Shape Topic Mining for Hand-Drawn Sketch Recognition
152	“Hi, Magic Closet, Tell Me What to Wear!”
165	El-pincel – A Painter Cloud Service for Greener Web Pages
172	Discovering and Ranking Areas of Interest with Geo-tagged Images and Check-ins
183	An Interactive System of Stereoscopic Video Conversion
193	Controlling Urban Lighting by Human Motion Patterns results from a full Scale Experiment
195	Unsupervised Face-Name Association via Commute Distance
198	A Smile Can Reveal Your Age: Enabling Facial Dynamics in Age Estimation
215	Understanding screen contents for building a high performance, real time screen sharing system
223	Online Crowdsourcing Subjective Image Quality Assessment
239	SymCity: Feature Selection by Symmetry for Large Scale Image Retrieval
259	Annotating Web Images using NOVA: NOn-conVex group spArsity
261	MOGAT: Mobile Games with Auditory Training for Children with Cochlear Implants
262	Scalar Quantization for Large Scale Image Search
270	Multi-View Learning from Imperfect Tagging
275	Don’t Ask Me What I’m Like, Just Watch and Listen
290	Knowledge Adaptation for Ad Hoc Multimedia Event Detection with Few Examplars
303	Image Annotation by Semantic Sparse Recoding of Visual Content
312	Image Understanding and Responsive Design for Automating Magazine Layout
319	Scalable mining of small visual objects
330	Leveraging High-level and Low-level Features for Multimedia Event Detection
332	A Bag-of-Objects Retrieval Model for Web Image Search
338	On Shape and the Computability of Emotions
340	Propagation-Based Social-Aware Replication for Social Video Contents
345	Hybrid Social Media Network
355	A Unified Submodular Framework towards Video Pooling and Hashing
356	GreenTube: Power Optimization for Mobile Video Streaming via Dynamic Cache Management
357	MusicScore: Mobile Music Composition for Practice and Fun
360	Jetway: Minimizing Costs on Inter-Datacenter Video Traffic
368	Context-Aware Mobile Music Recommendation for Daily Activities
374	Snap-and-Ask: Answering Multimodal Question by Naming Visual Instance
387	IMShare: Instantly Sharing Your Mobile Images by Search-based Reconstruction
388	Low Bitrate Source-filter Model Based Compression of Vibrotactile Texture Signals in Haptic Teleoperation
391	Embedding Spatial Context into Inverted File for Large-Scale Image Search
395	Modeling the QoE of Rate Changes in SKYPE/SILK VoIP Calls
403	Dinner of Luciérnaga- An interactive Play with iPhone App in Theater
415	Visual Knowledge Transfer among Multiple Cameras for People Counting with Occlusion Handling
451	When Video Search Goes Wrong: Predicting Query Failure Using Search Engine Logs and Visual Search Results
454	Mining In-Class Social Networks for Large-Scale Pedagogical Analysis
464	Action Recognition for Human-Marionette Interaction
468	Finding Perfect Rendezvous On the Go: Accurate Mobile Visual Localization and Its Applications to Routing
469	SocialTransfer: Cross-Domain Transfer Learning from Social Streams for Media Applications
483	PaperVideo: Multiple Videos In Physical Space
499	Interactive Data-Driven Discovery of Temporal Behavior Models From Events In Media Streams
510	Joint statistical analysis of images and keywords with applications in semantic image enhancement
532	MoViMash: Online Mobile Video Mashup
542	In the eye of the beholder: Employing statistical analysis and eye tracking for analyzing abstract paintings
574	Efficient Image Annotation for Automatic Sentence Generation
577	Exploratory Search of Long Surveillance Videos
579	Control of Distributed Servers for Quality-Fair Delivery of Multiple Video Streams
587	Vibrotactile Feedback of Motor Performance Errors for Enhancing Motor Learning
679	Correlated Attribute Transfer with Multi-task Graph-Guided Fusion
686	Image Colorization Using Similar Images
716	Exploiting Visual Word Co-occurrence for Image Retrieval
736	Enabling “Togetherness” in High-Quality Domestic Video Conferencing
1042	A Multimedia Analytics Framework for Browsing Image Collections in Digital Forensics

Short papers

Paper ID	Paper Title
46	Visual Query Attributes Suggestion
49	Video Saliency Detection in the Compressed Domain
52	Compact Kernel Hashing with Multiple Features
65	Dynamic Camera Calibration Method for Free-viewpoint Experience in Sport Videos
112	Similar image search with a tiny bag-of-delegates representation
141	A Robust and Efficient Shot Boundary Detection Approach Based on Fisher Criterion
179	Robust Cross-Media Transfer for Visual Event Detection
186	Online Non-feedback Image Re-ranking via Dominant Data Selection
190	A Genetic Algorithm for Audio Retargeting
194	Seam Carving with Forward Gradient Difference Maps
197	Optimal Semi-Supervised Metric Learning for Image Retrieval
229	Surveillance Video Coding via Low-Rank and Sparse Decomposition
238	Enhanced Extraction of Moving Objects in Variable Bit-Rate Video Streams
274	Low Rank Metric Learning for Social Image Retrieval
286	Improving Dense Image Correspondence Estimation with Interactive User Guidance
317	View-based 3D Object Retrieval by Bipartite Graph Matching
361	Predicting Human Activities using Spatio-Temporal Structure of Interest Points
475	Personalized Video Recommendation Through Tripartite Graph Propagation
476	Local Visual Words Coding for Low Bit Rate Mobile Visual Search
480	Can we understand van Gogh’s Mood? Learning to infer affects from images in social networks
484	Human Action Recognition and Retrieval Using Sole Depth Information
493	Clothing Genre Classification by Exploiting the Style Elements
494	AttachedShock: Facilitating Moving Targets Acquisition on Augmented Reality Devices using Goal-crossing Actions
563	MixPad: Augmenting Interactive Paper with Mice & Keyboards for Cross-media and Fine-grained Interaction with Documents
671	Attribute-assisted Reranking for Web Image Retrieval
676	Query Expansion Enhancement by Fast Binary Matching
677	Image Tag Re-ranking by Coupled Probability Transition
683	Touch Saliency
710	Local Geometry Adaptive Manifold Re-Ranking for 3D Object Retrieval
741	Recognizing Actions Using Depth Motion Maps-based Histograms of Oriented Gradients
742	Context-aware Affective Images Classification based on Bilayer Sparse Representation
744	DLMSearch: Diversified Landmark Search by Photo
765	Gabor-Based Gradient Orientation Pyramid for Kinship Verification Under Uncontrolled Environments
771	Detecting Rule of Simplicity from Photos
773	An Approach to Automatic Construction of Cinemagraphs
774	Robust Stroke-based Video Animation via Layered Motion and Correspondence
779	Fast Semantic Image Retrieval based on Random Forest
781	Sketch-based Image Retrieval on Mobile Devices Using Compact Hash Bits
786	Breaking Row-Column Shuffle Based Image Cipher
796	Human-Computer Dance Interaction with Realtime Accelerometer Control
801	Texture Optimization for Seamless View Synthesis Through Energy Minimization
812	Activity-Based Person Identification Using Sparse Coding and Discriminative Metric Learning
817	Detection Bank: An Object Detection Based Video Representation for Multimedia Event Recognition
819	Robust AAM-Based Audio-Visual Speech Recognition against Face Direction Changes
825	Towards Relevance and Saliency Ranking of Image Tags
829	Reducing Cross-Group Traffic with Cooperative Streaming Architecture
833	Mobile-Based Advertisement Information Retrieval from Images and Websites
835	Semi-Supervised Multi-Instance Multi-Label Learning for Video Annotation Task
838	On the Music Content Authentication
841	A Study on Making Camera Trajectory from Panorama Watching Manipulation
842	A user study on image browsing on touchscreens
848	Discriminative ICA Model with Reconstruction Constraint for Image Classification
853	A New Heat-Map-based Algorithm for Human Group Activity Recognition
858	Geo-Location Inference on News Articles via Multimodal pLSA
859	A method for detecting salient regions using integrated features
862	Multimedia Event Recounting with Concept based Representation
864	Critical Gameplay: Designing Games to Critique Convention
869	Search Web Images Using Objects, Backgrounds and Conditions
871	Virtual reference view generation for CBIR-based visual pose estimation
881	Detecting Viewing Directions to Landmarks for Recommendation by Large-scale User-contributed Photos
890	Sparsity Cue in Image Copy Detection
891	Deep Nonlinear Metric Learning with Independent Subspace Analysis for Face Verification
894	QoE-based Opportunistic Transmission for Video Broadcasting in Heterogeneous Circumstance
898	Color Transfer Based on Multiscale Gradient-aware Decomposition and Color Distribution Mapping
899	What is Happening: Annotating Images with Verbs
904	On Sparse and Low-Rank Matrix Decomposition for Singing Voice Separation
914	Memorable Basis: Towards Human-Centralized Sparse Representation
920	Near-Duplicate Video Retrieval Based on Clustering by Multiple Sequence Alignment
923	Social Tag Alignment with Image Regions by Sparse Reconstructions
926	Neighborhood Preserving Hashing for Fast Similarity Search
932	Detecting Text in the Real World
933	Hybrid Generative-Discriminative Recognition of Human Action in 3D Joint Space
935	Smooth and efficient crowd transformation
936	Face Image Super-Resolution via Nearest Feature Line
937	Face Photo Retrieval by Sketch Example
944	Drive Video Summarization based on Double Articulation Structure of Driving Behavior
946	Modalities Consensus for Multi-Modal Constraint Propagation
948	Augmented Reality Card Game based on User-specific Information Control
950	Interactive Multimodal Social Robot for Improving Quality of Care of Elderly in Australian Nursing Homes
951	Joint Semantic Segmentation by Searching for Compatible-Competitive References
952	Plug&Touch: A Mobile Interaction Solution for Large Display via Vision-Based Hand Gesture Detection
958	Geometric context-Preserving Progressive Transmission in Mobile Visual Search
961	An Effective Multi-Clue Fusion Approach for Web Video Topic Detection
962	Secure Cloud-based Medical Data Visualization
964	State-based Steganography in Low Bit Rate Speech
968	Efficient Mobile Landmark Recognition Based on Saliency-Aware Scalable Vocabulary Tree
969	3D Fingertip and Palm Tracking in Depth Image Sequences
975	Parsing Collective Behaviors by Hierarchical Model with Varying Structure
976	Supervised Cross-collection Topic Modeling
978	Markov-based Image Forensics for Photographic Copying from Printed Picture
979	Secure Content Sharing for Social Network Using Fingerprinting and Encryption in the TSH Transform Domain
981	PDSS: Patch-Descriptor-Similarity Space for Effective Face Verification
986	Correlation-based burstiness for logo retrieval
989	Large-Scale Simultaneous Multi-Object Recognition and Localization via Bottom Up Search-Based Approach
990	From Speech to Personality: Mapping Voice Quality and Intonation into Personality Differences
991	Predicting the Conflict Level in Television Political Debates: an Approach Based on Crowdsourcing, Nonverbal Communication and Gaussian Processes
992	Sketch-based Image Retrieval on Large Scale Database
998	Using Structural Patches Tiling to Guide Human Head-Shoulder Segmentation
1000	Video Object Segmentation with Shortest Path
1001	Indoor and outdoor profiling of users in multimedia installations
1002	A study on the user perception to color variations
1003	Video Object Cosegmentation
1004	Parallel Deblocking Filtering in H.264/AVC using Multiple CPUs and GPUs
1007	ROI-Based Protection Scheme for High Definition Interactive Video Applications
1008	Community as a Connector: Associating Faces with Celebrity Names in Web Videos
1009	Toward Next Generation Coaching Tools for Court Based Racquet Sports
1014	Ulcer Detection in Wireless Capsule Endoscopy Videos
1023	Advanced Downlink LTE Radio Resource Management for HTTP-Streaming
1028	Dynamic Vocabularies for Web-based Concept Detection by Trend Discovery
1033	Comparison of prediction-based fusion and feature-level fusion across different learning models
1036	Digiti Sonus: An Interactive Fingerprint Sonification
1045	Conversationally-inspired Stylometric Features for Authorship Attribution in Instant Messaging
1050	Coherent Image Selection Using a Fast Approximation to the Generalized Traveling Salesman Problem
1056	Depth Estimation for Semi-Automatic 2D to 3D Conversion
1059	Predicting Domain Adaptivity: Redo or Recycle?
1065	Social Event Detection: Finding Events through the Social Interaction Graph
1068	Music/Speech Classification Using High-level Features Derived from fMRI Brain Imaging
1069	Bilingual Analysis of Song Lyrics and Audio Words
1074	Self-Paced Dictionary Learning for Image Classification
1077	Bridging Music and Image: A Preliminary Study with Multiple Rank-CCA Learning
1080	Energy-Aware Adaptations in Mobile 3D Graphics
1081	ITEM: Immersive Telepresence for Entertainment and Meeting with Commodity Setup
1084	Predicting Participants in Public Events using Stock Photos
1088	Name That Room: Room identification using acoustic features in a recording
1090	Extending the Life Log to Non-human Subjects: Ambient Storytelling for Human-Object Relationships
1118	Enhancing Visual Dominance by Semantics-Preserving Image Recomposition
1119	PRiSMA: Searching Images in Parallel
1127	Client-side backprojection of presentation slides into educational video