CityX: Controllable Procedural Content Generation for Unbounded 3D Cities

Read original: arXiv:2407.17572 - Published 8/7/2024 by Shougao Zhang, Mengqi Zhou, Yuxi Wang, Chuanchen Luo, Rongyu Wang, Yiwei Li, Xucheng Yin, Zhaoxiang Zhang, Junran Peng

CityX: Controllable Procedural Content Generation for Unbounded 3D Cities

Overview

Presents a novel framework called City𝒳 for controllable procedural content generation of 3D cities
Enables the creation of large, unbounded 3D cities with a high degree of user control over the city's appearance and layout
Builds on recent advancements in generative modeling and procedural content generation

Plain English Explanation

City𝒳 is a new system that allows users to easily create large, detailed 3D virtual cities from scratch. Rather than manually modeling every building, street, and park, City𝒳 uses advanced algorithms to automatically generate the city content based on the user's high-level instructions.

The key innovation of City𝒳 is the high degree of control it provides over the final city. Users can specify the overall style, layout, and features they want, and the system will generate a unique city tailored to those preferences. This allows for the creation of diverse, customized cities that cater to the user's specific needs, whether for a video game, architectural visualization, or urban planning application.

Under the hood, City𝒳 uses a combination of machine learning and procedural techniques to generate the city content. It learns patterns from examples of real-world cities, allowing it to create plausible and realistic urban environments. At the same time, the user can interactively guide the generation process, tweaking parameters to achieve their desired outcome.

Technical Explanation

City𝒳 is a novel framework for controllable procedural content generation of 3D cities. It builds upon recent advancements in generative modeling and procedural content generation to enable the creation of large, unbounded 3D cities with a high degree of user control.

The system utilizes a hierarchical, compositional generative model that learns the structure and patterns of cities from example data. This allows it to generate plausible city layouts, building placements, and other urban features. Crucially, the model is designed to be highly configurable, enabling users to specify high-level parameters like the city's overall style, density, and amenities.

The generation process is further enhanced through the use of procedural techniques. These algorithms generate detailed 3D geometry, textures, and other visual elements on the fly, ensuring that the final city is not only realistic but also highly varied and unique.

Throughout the generation process, City𝒳 provides users with extensive control mechanisms. They can interactively adjust parameters, add or remove specific features, and even directly manipulate the city's layout to achieve their desired outcome. This allows for the creation of diverse, customized cities that cater to a wide range of applications, from video games and architectural visualizations to urban planning and analysis.

Critical Analysis

The City𝒳 paper presents a compelling and well-designed framework for procedural city generation. The authors have clearly put a lot of thought into the system's architecture and the user experience, resulting in a powerful tool that offers a high degree of control and flexibility.

One potential limitation of the approach is the reliance on example data to train the underlying generative model. While this allows the system to create plausible and realistic cities, it may also limit the range of possible outcomes, as the model is ultimately constrained by the patterns present in the training data. Expanding the model to handle more diverse or unconventional city layouts could be an area for future research.

Additionally, the computational complexity of the system, especially as city size and complexity increase, may be a practical concern for some applications. The authors touch on this issue but do not provide a detailed analysis of the system's scalability and performance characteristics.

Overall, the City𝒳 paper represents a significant advancement in the field of procedural city generation and could have important implications for a wide range of applications, from urban planning and simulation to game development and architectural visualization.

Conclusion

The City𝒳 paper presents a novel framework for the controllable procedural generation of large, unbounded 3D cities. By combining generative modeling and procedural techniques, the system enables users to create highly customized urban environments with a high degree of control over the city's appearance and layout.

This research represents a significant advancement in the field of procedural content generation, with potential applications in urban planning, architectural visualization, video game development, and beyond. The ability to quickly generate diverse and realistic 3D cities could greatly streamline the design and development process in these domains, while also enabling new forms of interactive urban exploration and analysis.

As the authors note, there are still areas for further research and refinement, such as improving the system's scalability and exploring more diverse city layouts. However, the core ideas and techniques presented in the City𝒳 paper provide a solid foundation for continued progress in this exciting and rapidly evolving field.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

CityX: Controllable Procedural Content Generation for Unbounded 3D Cities

Shougao Zhang, Mengqi Zhou, Yuxi Wang, Chuanchen Luo, Rongyu Wang, Yiwei Li, Xucheng Yin, Zhaoxiang Zhang, Junran Peng

Generating a realistic, large-scale 3D virtual city remains a complex challenge due to the involvement of numerous 3D assets, various city styles, and strict layout constraints. Existing approaches provide promising attempts at procedural content generation to create large-scale scenes using Blender agents. However, they face crucial issues such as difficulties in scaling up generation capability and achieving fine-grained control at the semantic layout level. To address these problems, we propose a novel multi-modal controllable procedural content generation method, named CityX, which enhances realistic, unbounded 3D city generation guided by multiple layout conditions, including OSM, semantic maps, and satellite images. Specifically, the proposed method contains a general protocol for integrating various PCG plugins and a multi-agent framework for transforming instructions into executable Blender actions. Through this effective framework, CityX shows the potential to build an innovative ecosystem for 3D scene generation by bridging the gap between the quality of generated assets and industrial requirements. Extensive experiments have demonstrated the effectiveness of our method in creating high-quality, diverse, and unbounded cities guided by multi-modal conditions. Our project page: https://cityx-lab.github.io.

8/7/2024

SceneX:Procedural Controllable Large-scale Scene Generation via Large-language Models

Mengqi Zhou, Yuxi Wang, Jun Hou, Chuanchen Luo, Zhaoxiang Zhang, Junran Peng

Due to its great application potential, large-scale scene generation has drawn extensive attention in academia and industry. Recent research employs powerful generative models to create desired scenes and achieves promising results. However, most of these methods represent the scene using 3D primitives (e.g. point cloud or radiance field) incompatible with the industrial pipeline, which leads to a substantial gap between academic research and industrial deployment. Procedural Controllable Generation (PCG) is an efficient technique for creating scalable and high-quality assets, but it is unfriendly for ordinary users as it demands profound domain expertise. To address these issues, we resort to using the large language model (LLM) to drive the procedural modeling. In this paper, we introduce a large-scale scene generation framework, SceneX, which can automatically produce high-quality procedural models according to designers' textual descriptions.Specifically, the proposed method comprises two components, PCGBench and PCGPlanner. The former encompasses an extensive collection of accessible procedural assets and thousands of hand-craft API documents. The latter aims to generate executable actions for Blender to produce controllable and precise 3D assets guided by the user's instructions. Our SceneX can generate a city spanning 2.5 km times 2.5 km with delicate layout and geometric structures, drastically reducing the time cost from several weeks for professional PCG engineers to just a few hours for an ordinary user. Extensive experiments demonstrated the capability of our method in controllable large-scale scene generation and editing, including asset placement and season translation.

7/31/2024

📈

CityDreamer: Compositional Generative Model of Unbounded 3D Cities

Haozhe Xie, Zhaoxi Chen, Fangzhou Hong, Ziwei Liu

3D city generation is a desirable yet challenging task, since humans are more sensitive to structural distortions in urban environments. Additionally, generating 3D cities is more complex than 3D natural scenes since buildings, as objects of the same class, exhibit a wider range of appearances compared to the relatively consistent appearance of objects like trees in natural scenes. To address these challenges, we propose textbf{CityDreamer}, a compositional generative model designed specifically for unbounded 3D cities. Our key insight is that 3D city generation should be a composition of different types of neural fields: 1) various building instances, and 2) background stuff, such as roads and green lands. Specifically, we adopt the bird's eye view scene representation and employ a volumetric render for both instance-oriented and stuff-oriented neural fields. The generative hash grid and periodic positional embedding are tailored as scene parameterization to suit the distinct characteristics of building instances and background stuff. Furthermore, we contribute a suite of CityGen Datasets, including OSM and GoogleEarth, which comprises a vast amount of real-world city imagery to enhance the realism of the generated 3D cities both in their layouts and appearances. CityDreamer achieves state-of-the-art performance not only in generating realistic 3D cities but also in localized editing within the generated cities.

6/7/2024

CityCraft: A Real Crafter for 3D City Generation

Jie Deng, Wenhao Chai, Junsheng Huang, Zhonghan Zhao, Qixuan Huang, Mingyan Gao, Jianshu Guo, Shengyu Hao, Wenhao Hu, Jenq-Neng Hwang, Xi Li, Gaoang Wang

City scene generation has gained significant attention in autonomous driving, smart city development, and traffic simulation. It helps enhance infrastructure planning and monitoring solutions. Existing methods have employed a two-stage process involving city layout generation, typically using Variational Autoencoders (VAEs), Generative Adversarial Networks (GANs), or Transformers, followed by neural rendering. These techniques often exhibit limited diversity and noticeable artifacts in the rendered city scenes. The rendered scenes lack variety, resembling the training images, resulting in monotonous styles. Additionally, these methods lack planning capabilities, leading to less realistic generated scenes. In this paper, we introduce CityCraft, an innovative framework designed to enhance both the diversity and quality of urban scene generation. Our approach integrates three key stages: initially, a diffusion transformer (DiT) model is deployed to generate diverse and controllable 2D city layouts. Subsequently, a Large Language Model(LLM) is utilized to strategically make land-use plans within these layouts based on user prompts and language guidelines. Based on the generated layout and city plan, we utilize the asset retrieval module and Blender for precise asset placement and scene construction. Furthermore, we contribute two new datasets to the field: 1)CityCraft-OSM dataset including 2D semantic layouts of urban areas, corresponding satellite images, and detailed annotations. 2) CityCraft-Buildings dataset, featuring thousands of diverse, high-quality 3D building assets. CityCraft achieves state-of-the-art performance in generating realistic 3D cities.

6/10/2024